Job Description
Anthropic is dedicated to creating reliable, interpretable, and steerable AI systems, ensuring AI is safe and beneficial for users and society. The Applied Finetuning team builds on Anthropic’s Finetuning research to make Anthropic’s business and products successful.
The Research Manager will lead a team of researchers and research engineers focused on training flagship models launched to the public via Claude.AI and Anthropic's API. This role involves designing and iterating on state-of-the-art finetuning techniques, such as Constitutional AI and RLHF, to train production Claude models. The team will implement new algorithms, run experiments on data mixes, design evaluations, and improve the production model finetuning pipeline.
Responsibilities:
- Lead research and engineering efforts to train production models through post-training techniques
- Become familiar with the team’s technical stack enough to make targeted contributions as an individual contributor
- Manage day-to-day execution of the team's work
- Prioritize the team’s work and manage projects to support fast iteration on research projects and training runs
- Coach and support your reports in understanding, and pursuing, their professional growth
- Maintain a deep understanding of the team's technical work and its implications for AI safety
Requirements:
- Have 3-5 years of management experience in a research or technical environment
- Have a background in machine learning, AI, or a related technical field
- Are deeply interested in the potential transformative effects of advanced AI systems and are committed to ensuring their safe development
- Excel at building strong relationships with stakeholders at all levels
- Are a quick learner, capable of understanding and contributing to discussions on complex technical topics
- Have experience managing teams through periods of rapid growth and change
- Are comfortable working in a fast-paced, research-driven environment where priorities may shift quickly
- Are a quick study: this team sits at the intersection of a large number of different complex technical systems that you’ll need to understand (at a high level of abstraction) to be effective
Anthropic offers:
- Competitive compensation and benefits
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- A lovely office space in which to collaborate with colleagues