Browse All Jobs
Job Description
Perplexity is seeking an AI Systems Engineer to join their expanding team. The company has experienced significant growth since launching its conversational answer engine and serves numerous enterprise clients. The AI Systems Engineer will focus on the large-scale deployment of machine learning models for real-time inference, working with technologies like Python, Rust, C++, PyTorch, Triton, CUDA, and Kubernetes.Responsibilities include:
  • Developing robust APIs for AI inference.
  • Designing, deploying, and maintaining scalable infrastructure.
  • Benchmarking system performance and implementing improvements.
  • Enhancing system reliability and observability.
  • Responding to system outages and collaborating with other teams.
Qualifications include:
  • Experience in developing APIs and managing distributed systems.
  • Strong understanding of Kubernetes and container orchestration.
  • Experience with deploying reliable, distributed, real-time systems at scale.
  • Familiarity with LLM architecture.
Perplexity offers:
  • Comprehensive health, dental, and vision insurance.
  • A 401(k) plan.
  • Equity may be part of the total compensation package.
Apply Manually