Job Description
Anthropic is seeking a Research Engineer to join its Horizons team, which is dedicated to reinforcement learning research and development. This team plays a crucial role in advancing Anthropic's AI systems and has contributed significantly to Claude models. The role involves collaborating with researchers and engineers to enhance the capabilities and safety of large language models.
As a Research Engineer, the candidate will blend research and engineering responsibilities, implementing novel approaches and contributing to the research direction. The candidate will work on fundamental research in reinforcement learning, creating 'agentic' models via tool use for open-ended tasks, improving reasoning abilities, and developing prototypes for internal use.
Responsibilities include:
- Architecting and optimizing core reinforcement learning infrastructure.
- Designing, implementing, and testing novel training environments and methodologies.
- Driving performance improvements through profiling, optimization, and benchmarking.
- Collaborating across research and engineering teams to develop automated testing frameworks and scalable infrastructure.
Requirements:
- Proficiency in Python and async/concurrent programming.
- Experience with machine learning frameworks (PyTorch, TensorFlow, JAX).
- Industry experience in machine learning research.
- Strong systems design and communication skills.
- Passion for the potential impact of AI and commitment to developing safe and beneficial systems.
The role offers:
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- A collaborative office space.