Job Description
Anthropic is seeking a Staff Infrastructure Engineer to join their AI Scientist Team. This role is based in San Francisco, CA, and focuses on building an AI scientist capable of solving long-term reasoning challenges. The engineer will work end-to-end, addressing key infrastructure blockers to advance scientific AGI. Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.Role involves:
- Designing and implementing large-scale infrastructure systems for AI scientist training, evaluation, and deployment.
- Identifying and resolving infrastructure bottlenecks.
- Developing robust evaluation frameworks for measuring progress towards scientific AGI.
- Building scalable VM/sandboxing/container architectures.
- Developing large-scale data pipelines for advanced language model training.
- Optimizing large-scale training and inference pipelines for reinforcement learning.
Requirements:
- 3+ years of experience in infrastructure engineering with expertise in large-scale distributed systems.
- Strong communication and collaboration skills.
- Deep knowledge of performance optimization techniques and system architectures for high-throughput ML workloads.
- Experience with containerization technologies (Docker, Kubernetes) and orchestration at scale.
- Proven track record of building large-scale data pipelines and distributed storage systems.
- Experience collaborating with researchers to scale experimental ideas.
- Bachelor's degree in a related field or equivalent experience.
Role offers:
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Lovely office space in San Francisco.