I am a Research Scientist working on Computational Linguistics at Facebook AI Research. Previously, I completed a PhD in Computational Linguistics at the University of Cambridge Computer Laboratory, supervised by Stephen Clark.
My research area is Machine Learning applied to Natural Language Processing. My research interests include the applications of NLP to low-resource and endangered languages, machine translation and speech translation.
My doctoral research was kindly funded by the EPSRC, and I am a St John’s scholar.
When I am not working on language, I can usually be found hiking, sailing or backcountry skiing somewhere around Northern Italy or California.
My name is pronounced [ˈʒɑ̃ maˈjaʁ], as in the character Jean Valjean in Victor Hugo’s Les Misérables.
S Lusito, E Ferrante, J Maillard (2023). “Text normalization for low-resource languages: the case of Ligurian”. Proceedings of the 6th Workshop on the Use of Computational Methods in the Study of Endangered Languages. PDF
NLLB Team (2022). “No Language Left Behind: scaling human-centered machine translation”. PDF
J Maillard, V Karpukhin, F Petroni, W Yih, B Oğuz, V Stoyanov, G Ghosh (2021). “Multi-task retrieval for knowledge-intensive tasks”. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics. PDF BibTeX
F Petroni, A Piktus, A Fan, P Lewis, M Yazdani, N De Cao, J Thorne, Y Jernite, V Karpukhin, J Maillard, V Plachouras, T Rocktäschel, S Riedel (2020). “KILT: a benchmark for knowledge intensive language tasks”. To appear in: 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics. PDF
See the full list of publications here.
University of Cambridge, St John’s College
- PhD in Computer Science, NLIP group (2015–2019)
- MPhil in Advanced Computer Science, distinction, best research project award (2013–2014)
- MASt (Part III) in Applied Mathematics and Theoretical Physics (2012–2013)
Imperial College London
- BSc in Physics with Theoretical Physics, first class honours (2009–2012)
- Snail mail
University of Cambridge Computer Laboratory
William Gates Building
15 JJ Thomson Avenue