Research Manager, Production Model Training

Lead research team training production models at Anthropic.

Anthropic

Hybrid

On-Site

United States

USD 340,000 - 560,000

Job Description

Anthropic is dedicated to creating reliable, interpretable, and steerable AI systems, ensuring AI is safe and beneficial for users and society. The Applied Finetuning team builds on Anthropic’s Finetuning research to make Anthropic’s business and products successful.

The Research Manager will lead a team of researchers and research engineers focused on training flagship models launched to the public via Claude.AI and Anthropic's API. This role involves designing and iterating on state-of-the-art finetuning techniques, such as Constitutional AI and RLHF, to train production Claude models. The team will implement new algorithms, run experiments on data mixes, design evaluations, and improve the production model finetuning pipeline.

Responsibilities:

Lead research and engineering efforts to train production models through post-training techniques
Become familiar with the team’s technical stack enough to make targeted contributions as an individual contributor
Manage day-to-day execution of the team's work
Prioritize the team’s work and manage projects to support fast iteration on research projects and training runs
Coach and support your reports in understanding, and pursuing, their professional growth
Maintain a deep understanding of the team's technical work and its implications for AI safety

Requirements:

Have 3-5 years of management experience in a research or technical environment
Have a background in machine learning, AI, or a related technical field
Are deeply interested in the potential transformative effects of advanced AI systems and are committed to ensuring their safe development
Excel at building strong relationships with stakeholders at all levels
Are a quick learner, capable of understanding and contributing to discussions on complex technical topics
Have experience managing teams through periods of rapid growth and change
Are comfortable working in a fast-paced, research-driven environment where priorities may shift quickly
Are a quick study: this team sits at the intersection of a large number of different complex technical systems that you’ll need to understand (at a high level of abstraction) to be effective

Anthropic offers:

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
A lovely office space in which to collaborate with colleagues

Apply Manually

Anthropic

All Jobs at Anthropic (208)

Clash

of Jobs

Research Manager, Production Model Training

Job Description

Anthropic

This feature is not ready yet

Sign up for the newsletter to get notified when it's available

Research Manager, Production Model Training

Job Description

Anthropic