Browse All Jobs
Job Description
Anthropic is seeking a Research Engineer to join their Reinforcement Learning Fundamentals team. This role involves collaborating with researchers and engineers to enhance the capabilities and safety of large language models through reinforcement learning research. The engineer will focus on improving reasoning abilities in areas like code generation and mathematics, and exploring reinforcement learning for agentic tasks.Role involves:
  • Developing and implementing novel reinforcement learning techniques.
  • Creating tools and environments for model interaction.
  • Designing and running experiments to enhance models' reasoning capabilities.
Requirements:
  • 5+ years of industry-related experience.
  • Proficiency in Python and experience with deep learning frameworks (PyTorch or Jax).
  • Strong software engineering background.
  • Interest in pair programming.
  • Commitment to code quality, testing, and performance.
  • Passion for AI and its safe development.
Role offers:
  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
Apply Manually