Job Description
The Allen Institute for Artificial Intelligence (AI2) is seeking a Senior Research Engineer to join their OLMo team in Seattle. This role involves building and optimizing infrastructure for large language model research. The engineer will collaborate with colleagues to design and implement scalable machine learning pipelines for training these models.
Role Involves:
- Building infrastructure for LLM research.
- Optimizing training and inference for language models.
- Triaging experiments and executing impactful ones.
- Supporting and collaborating with an open-source community.
- Bridging the gap between research and product.
- Releasing contributions as open-source software.
Requirements:
- 4+ years of experience building ML infrastructure.
- Deep experience in model development cycle.
- Knowledge of deep learning and NLP.
- Strong Python skills and experience with PyTorch/Jax/Tensorflow.
- Familiarity with cloud compute resources and containerization.
- Strong collaboration and communication skills.
What AI2 offers:
- Generous paid vacation and sick leave.
- Family leave.
- Team member bonus.
- Long-term incentive plan.
- $125/month - commuting or internet expenses.
- $200/month - fitness and wellbeing expenses.