Job Description
Anthropic is seeking a TPU Kernel Engineer to optimize machine learning systems, including research, training, and inference, with a focus on designing and optimizing kernels for TPUs. The engineer will provide feedback to researchers on the performance impact of model changes. Anthropic values collaboration and communication skills.
Role involves:
- Identifying and addressing performance issues in ML systems.
- Designing and optimizing kernels for TPUs.
- Providing feedback to researchers on model performance.
- Implementing low-latency, high-throughput sampling for large language models
- Adapting existing models for low-precision inference
- Building quantitative models of system performance
- Designing and implementing custom collective communication algorithms
- Debugging kernel performance at the assembly level
Requirements:
- Experience optimizing ML systems for TPUs, GPUs, or other accelerators.
- Results-oriented with a bias towards flexibility and impact.
- Enjoy pair programming.
- Interest in machine learning research.
- Care about the societal impacts of work.
- Bachelor's degree in a related field or equivalent experience.
Anthropic offers:
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.