Machine Learning Engineer - Inference

Machine Learning Engineer for AI inference optimization at Together AI.

Job Description

Together AI is seeking a Machine Learning Engineer to optimize AI inference systems. The successful candidate will work with large language models to ensure efficient and effective performance at scale. This role offers the chance to collaborate with AI researchers and engineers, contributing to cutting-edge AI solutions.Together AI believes open and transparent AI systems will drive innovation for better outcomes. The company aims to significantly lower the cost of modern AI systems.Responsibilities:

Design and build production systems for the Together AI inference engine.
Develop and optimize runtime inference services for large-scale AI applications.
Collaborate with researchers, engineers, product managers, and designers.
Conduct design and code reviews.
Create services, tools, and developer documentation.
Implement robust and fault-tolerant systems for data ingestion and processing.

Requirements: