Research Scientist, Frontier Red Team (Autonomy)

Research Scientist for autonomy evaluations on Frontier Red Team.

Anthropic

USD 280,000 - 425,000

Job Description

Anthropic is seeking a Research Scientist to join their Frontier Red Team, focusing on advanced autonomy evaluations. The role involves developing and implementing evaluations to determine the AI Safety Level (ASL) of Anthropic's models. This is crucial for training, deploying, and securing their models, as outlined in their Responsible Scaling Policy (RSP).

The Research Scientist will lead the end-to-end development of autonomy evaluations, starting with risk and capability modeling, and including designing, implementing, and regularly running these evaluations. They will iterate on experiments to evaluate autonomous capabilities and forecast future capabilities. The role also involves providing technical leadership to Research Engineers to build scalable and secure infrastructure for large-scale experiments.

Anthropic values collaboration and communication, and the Research Scientist will communicate evaluation outcomes to relevant teams, policy stakeholders, and research collaborators. They will also collaborate with other projects to improve infrastructure and design safety techniques for autonomous capabilities.

Responsibilities:

Lead the end-to-end development of autonomy evals and research.
Quickly iterate on experiments to evaluate autonomous capabilities.
Provide technical leadership to Research Engineers.
Communicate the outcomes of the evaluations to relevant Anthropic teams.
Collaborate with other projects to improve infrastructure and design safety techniques.

Requirements:

ML background and experience leading experimental research on LLMs/multimodal models and/or agents
Strong Python-based engineering skills
Driven to find solutions to ambiguously scoped problems
Experience designing and running experiments
Thrive in a collaborative environment
Experience training, working with, and prompting models
Bachelor's degree in a related field or equivalent experience.

What Anthropic Offers:

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours

Apply Manually

Anthropic

All Jobs at Anthropic (208)

Clash

of Jobs

Research Scientist, Frontier Red Team (Autonomy)

Job Description

Anthropic

This feature is not ready yet

Sign up for the newsletter to get notified when it's available

Research Scientist, Frontier Red Team (Autonomy)

Job Description

Anthropic