I am currently an ELLIS PhD student advised by Gerard de Melo at the Hasso Plattner Institute and ELLIS Unit Potsdam, Germany and co-advised by Desmond Elliott at the University of Copenhagen.
My current research focuses on multilingual NLP, tokenizers, and embeddings. In particular, I am working towards “freeing” pretrained large language models from their static vocabularies by developing better methods for tokenizer transfer and embedding initialization of new tokens [1,2]. In particular, I focus on crosslingual transfer of pretrained models, where a tokenizer mismatch with new languages can be very detrimental.
Apart from this focus I have a broader set of research interests: I have published works on computationally efficient training of large language models [3,4], extending tokenizers with a special “I Don’t Know”-token for uncertainty quantification [5] as well as conditional image generation using GANs [6] and have worked on multimodal protein modeling in a research internship with InstaDeep.
I’m currently interning at Apple with the Global Siri team in Barcelona, Spain. Previously, I completed an ML research internship with InstaDeep in Paris focusing on multimodal generative protein design and interned with SAP in Newport Beach, California as a software engineer after my undergrad. In my spare time, I enjoy playing my saxophone, chess, alpine hiking, and solo traveling.
Contries worked out of during my PhD counter: 12 🇩🇪🇸🇬🇲🇾🇹🇭🇳🇴🇯🇵🇮🇩🇫🇷🇰🇷🇬🇧🇺🇸🇪🇸
March 31, 2025 Starting as an ML Intern at Apple in the Global Siri team!
Feb 18, 2025 Interviewed for an article in the Wall Street Journal about our “I Don’t Know” token paper.
Dec 6, 2024 Presented our “I Don’t Know” token paper at the NeurIPS@Paris ELLIS poster session.
Nov 13, 2024 Talk on FOCUS and the future of embedding initialization for language adaptation at the Lee Language Lab @ OntarioTech University.
Oct 7, 2024 Presented “Efficient Parallelization Layouts for Large-Scale Distributed Model Training” at COLM 2024!
Sep 25, 2024 Our paper on “I Don’t Know” tokens is accepted at NeurIPS 2024 (work with Roi Cohen, Eden Biran and Gerard de Melo).
Jul 27, 2024 Presented a new paper on “Language Adaptation on a Tight Academic Compute Budget” at the WANT ICML 2024 workshop.
Jul 1, 2024 Started my research internship at InstaDeep based out of Paris working on multimodal protein modeling!
Apr 7, 2024 Attended two amazing ML research schools: MLSS in Okinawa, Japan and ALPS in Aussois, France where I presented my FOCUS paper on embedding initialization.
Feb 12, 2024 Attended the HPLT & NLPL winter school in Skeikampen, Norway.
Dec 17, 2023 Our work on efficient distributed model training won Best Paper at the WANT@NeurIPS workshop!
Oct 7, 2023 FOCUS is accepted at EMNLP 2023!
PhD in Computer Science (Machine Learning)
Hasso Plattner Institute, University Potsdam
2022 - present
MSc in Computer Science (IT-Systems Engineering)
Hasso Plattner Institute, University Potsdam
2020 - 2022
BSc in Computer Science (IT-Systems Engineering)
Hasso Plattner Institute, University Potsdam
2016 - 2020