Job Description
Anthropic is seeking a Research Engineer to join their Pretraining team in San Francisco. The Research Engineer will contribute to the development of safe, steerable, and trustworthy AI systems. This role involves working at the intersection of cutting-edge research and practical engineering, focusing on the next generation of large language models.Responsibilities include:
- Designing and implementing high-performance ML training infrastructure.
- Developing and maintaining core ML framework primitives in JAX, PyTorch, etc.
- Creating robust automated evaluation and benchmarking systems.
- Implementing comprehensive monitoring and debugging tools.
- Designing and optimizing data loading pipelines.
- Building MLOps tooling to support reproducible research.
- Collaborating with research teams to prototype and scale novel training architectures.
- Developing infrastructure for efficient hyperparameter sweeps and architecture search.
Requirements:
- Strong software engineering skills with experience in building distributed systems.
- Expertise in Python and experience with distributed computing frameworks.
- Deep understanding of cloud computing platforms and distributed systems architecture.
- Experience with high-throughput, fault-tolerant system design.
- Strong background in performance optimization and system scaling.
- Excellent problem-solving skills and attention to detail.
- Strong communication skills and ability to work in a collaborative environment.
- Bachelor's degree in a related field or equivalent experience.
Anthropic offers:
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- A collaborative office space.