Research Engineer / Scientist, Safeguards

AI safety research and engineering role at Anthropic.

Anthropic

Hybrid

On-Site

United States

USD 320,000 - 560,000

Job Description

Anthropic is seeking a Research Engineer/Scientist to join its Safeguards Research Team. This team focuses on critical safety research and engineering to ensure AI systems can be deployed safely. The role involves addressing immediate safety challenges and longer-term research initiatives, including jailbreak robustness, automated red-teaming, monitoring techniques, and applied threat modeling. The ideal candidate will take a pragmatic approach to machine learning experiments, helping Anthropic understand and steer the behavior of powerful AI systems. They will focus on risks from powerful future systems, as well as better understanding risks occurring today.

Role involves: