Meet the talented engineers and students from ETHZ and EPFL building Apertus, along with researchers from several labs at EPFL and ETHZ. We're creating impactful models that embody Swiss values of transparency and collaboration, serving both our research community and society at large.
Allen Huang
Research Engineer
Pretraining
Hanna Yukhymenko
Research Engineer
Post-training
Robert Smith
Research Engineer
Applications
Oleg Lavrovsky
Research Engineer
Community and Outreach
Valentina Pyatkin
Research Engineer
Post-training
Current Student Researchers
| Mukhammadali Sayfiddinov | Student Researcher | Reinforcement Learning |
| Tommy Chu | Student Researcher | Multimodality - Health |
| Ahmed Rockey Saikia | Student Researcher | Audio |
| Ahmad Fraji | Student Researcher | Applications |
| Yuhei Fukuhara | Student Researcher | Inference and Multimodal Evaluations |
| Alex Padula | Student Researcher | Coding Environments |
| Alessandro Tazza | Student Researcher | RL with verifiable Rewards |
| Khanh Nguyen | Student Researcher | Coding Environments |
| Aryan Ahadinia | Student Researcher | Inference Optimisation |
| Anunay Yadav | Student Researcher | Multimodal Evals |
| Bich (Rubi) Ngoc Doan | Student Researcher | Multicultural Evals |
| Raphael Kreft | Thesis | Multimodality |
| Juan Garcia Giraldo | Thesis | Steering and Interpretability |
| Wanja Pletscher | Thesis | LLM Interpretability |
| Matteo Santelmo | Thesis | RLVR |
| Nicola Dall'Acqua | Thesis | Synthetic Data Generation |
In addition, a larger list of MSc students, PhD students, and Postdocs contribute and lead parts of the Apertus development as part of their education and research.
Collaborations
Beyond our team, we collaborate with 30+ MSc students, PhD students, and Postdocs from various academic labs, such as the ones listed below.
- Machine Learning and Optimization Lab - Prof. Martin Jaggi, EPFL
- Natural Language Processing Lab - Prof. Antoine Bosselut, EPFL
- Learning and Adaptive Systems Group - Prof. Andreas Krause, ETH Zurich
- Scalable Parallel Computing Lab - Prof. Torsten Hoefler, ETH Zurich
- Law, Economics, and Data Science Lab - Prof. Elliott Ash, ETH Zurich
- Caglar Gulcehre Laboratory of Artificial Intelligence Research - Prof. Caglar Gulcehre, EPFL
- Efficient Architectures and Systems Lab - Prof. Ana Klimović, ETH Zurich
- ETH AI Center - Fellows and researchers
- EPFL AI Center - Fellows and researchers
- Swiss National Supercomputing Centre (CSCS) - Several full-time employees
Alumni
Former team members who have moved on to new opportunities:
| Antoni Solergibert | Research Engineer | Pretraining (joined Nvidia) |
| Nicola Irmiger | Student Researcher | Multimodality |
| Dr. Kyle Matoba | Research Engineer | Pretraining |
| Vanya Pavlov | Student Researcher | Alignment |
| Marco Scialanga | Student Researcher | Alignment |
| Ilia Badanin | Student Researcher | Serving & Post-training |
| Mathieu Sauser | Student Researcher | Data Processing |
| Dhia Garbaya | Student Researcher | Efficiency |
| Camille Challier | Student Researcher | RL with verifiable Rewards |
| Juan Garcia Giraldo | Student Researcher | Alignment |
| Luca Mouchel | Student Researcher | Distributed RL Training |
| Nathan Ranchin | Student Researcher | RL with verifiable Rewards |
| Jakhongir Saydaliev | Student Researcher | RL with verifiable Rewards |
| Jiaming Jiang | Student Researcher | RL with verifiable Rewards |
| Clément Charmillot | Student Researcher | RL with verifiable Rewards |
| Rongxiao Qu | Student Researcher | Multimodality |