Machine Learning Engineer

ML Engineer to develop systems for LLM inference and fine-tuning.

Job Description

Together AI is seeking a Machine Learning Engineer to contribute to the development of systems and APIs that empower customers to perform inference and fine-tune Large Language Models (LLMs). The ideal candidate will possess experience in implementing runtime systems capable of conducting inference at scale, utilizing AI/ML models ranging from simple to the most extensive LLMs. This role involves designing and constructing production systems that drive the Together Cloud inference and fine-tuning APIs, ensuring reliability and performance at scale.

Responsibilities:

Design and build the production systems that power the Together Cloud inference and fine-tuning APIs
Partner with researchers, engineers, product managers, and designers
Analyze and improve efficiency, scalability, and stability of various system resources
Conduct design and code reviews
Create services, tools & developer documentation
Create testing frameworks for robustness and fault-tolerance
Participate in an on-call rotation to respond to critical incidents as needed

Requirements: