Meet the talented engineers and students from ETHZ and EPFL building Apertus, along with researchers from several labs at EPFL and ETHZ. We're creating impactful models that embody Swiss values of transparency and collaboration, serving both our research community and society at large.
Allen Huang
Research Engineer
Pretraining
Hanna Yukhymenko
Research Engineer
Post-training
Robert Smith
Research Engineer
Applications
Oleg Lavrovsky
Research Engineer
Community and Outreach
Valentina Pyatkin
Research Engineer
Post-training
Current Student Researchers
| Tommy Chu | Student Researcher | Multimodality - Health |
| Ahmed Rockey Saikia | Student Researcher | Audio |
| Ahmad Fraji | Student Researcher | Applications |
| Yuhei Fukuhara | Student Researcher | Inference and Multimodal Evaluations |
| Alex Padula | Student Researcher | Coding Environments |
| Alessandro Tazza | Student Researcher | RL Pipeline Optimisation |
| Khanh Nguyen | Student Researcher | Coding Environments |
| Aryan Ahadinia | Student Researcher | Inference Optimisation |
| Anunay Yadav | Student Researcher | Multimodal Evals |
| Bich (Rubi) Ngoc Doan | Student Researcher | Multicultural Evals |
| Marian Schneider | Student Researcher | Alignment |
| Davit Melikidze | Student Researcher | Alignment |
| Ender Dogan Isik | Student Researcher | Data Generation |
Thesis Students
Supervised by members of the team or close academic collaborators.
| Raphael Kreft | Master Thesis | Multimodality |
| Nicola Dall'Acqua | Bachelor Thesis | Synthetic Data Generation |
| Cédric Laubacher | Master Thesis | Long-Context RL |
| Juan Garcia Giraldo | Master Thesis | Steering and Interpretability |
| Wanja Pletscher | Bachelor Thesis | LLM Interpretability |
| Matteo Santelmo | Master Thesis | RLVR |
| Elena Lyulina | Master Thesis | Memorization in Attention Mechanisms |
| Guanshujie Fu | Master Thesis | MoE Pretraining |
| Petr Grinberg | Master Thesis | Multimodal Language Models |
Student Projects
Supervised by members of the team or close academic collaborators.
| Bartosz Szostakiewicz | Semester Project | Coding Environments |
| Francesco Monti | Semester Project | Multilingual Reasoning |
| Mauro Pellonara | Semester Project | Multilingual Reasoning |
| Youssef Boughizane | Semester Project | Scalable RL |
| Mahdi Atallah | Semester Project | Scalable RL |
| Sacha Godey | Semester Project | Audio-Text Alignment |
| Clément Rousseau | Semester Project | RL for Hard-to-Verify Domains |
| Badr Al Mahouri | Semester Project | Vision RL |
| Loic Deslarzes | Course Project | Tool Gym |
| Leonard Mantel | Course Project | Speech Tokenizers |
| Aleks Stepancic | Course Project | Hallucination Steering |
| Tanguy Dieudonné | Course Project | Memorization in Language Models |
| Tobias von Arx | Course Project | Memorization in Language Models |
| Rada Kamysheva | Course Project | Personalized Data Generation |
| Luca Baumann | Course Project | Post-training Data |
| Jenny (Yizhen) Wang | Course Project | Agentic LLMs |
In addition, a larger list of MSc students, PhD students, and Postdocs contribute and lead parts of the Apertus development as part of their education and research.
Collaborations
Beyond our team, we collaborate with 30+ MSc students, PhD students, and Postdocs from various academic labs. Our collaborations include but are not limited to the following groups:
- Machine Learning and Optimization Lab - Prof. Martin Jaggi, EPFL
- Natural Language Processing Lab - Prof. Antoine Bosselut, EPFL
- Learning and Adaptive Systems Group - Prof. Andreas Krause, ETH Zurich
- Scalable Parallel Computing Lab - Prof. Torsten Hoefler, ETH Zurich
- Law, Economics, and Data Science Lab - Prof. Elliott Ash, ETH Zurich
- Caglar Gulcehre Laboratory of Artificial Intelligence Research - Prof. Caglar Gulcehre, EPFL
- Efficient Architectures and Systems Lab - Prof. Ana Klimović, ETH Zurich
- ETH AI Center - Fellows and researchers
- EPFL AI Center - Fellows and researchers
- Swiss National Supercomputing Centre (CSCS) - Several full-time employees
Alumni
Former team members who have moved on to new opportunities:
| Mukhammadali Sayfiddinov | Student Researcher | Reinforcement Learning | 2026 |
| Antoni Solergibert | Research Engineer | Pretraining (joined Nvidia) | 2025 |
| Nicola Irmiger | Master Thesis | Multimodality | 2025 |
| Dr. Kyle Matoba | Research Engineer | Pretraining | 2025 |
| Vanya Pavlov | Student Researcher | Alignment | 2025 |
| Marco Scialanga | Student Project | Alignment | 2025 |
| Ilia Badanin | Student Researcher | Serving & Post-training | 2025 |
| Mathieu Sauser | Student Researcher | Data Processing | 2025 |
| Dhia Garbaya | Student Researcher | Efficiency | 2025 |
| Camille Challier | Master Thesis | RL with verifiable Rewards | 2025 |
| Juan Garcia Giraldo | Student Researcher | Alignment | 2025 |
| Luca Mouchel | Master Thesis | Distributed RL Training | 2025 |
| Nathan Ranchin | Student Researcher | RL with verifiable Rewards | 2025 |
| Jakhongir Saydaliev | Master Thesis | RL with verifiable Rewards | 2025 |
| Jiaming Jiang | Student Researcher | RL with verifiable Rewards | 2025 |
| Clément Charmillot | Master Thesis | RL with verifiable Rewards | 2025 |
| Rongxiao Qu | Student Researcher | Multimodality | 2025 |