Browse All Jobs
Job Description
Cresta is seeking a Senior Software Engineer, Backend (AI Platform) to join their team. This role involves designing, building, and maintaining low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners. The engineer will automate training pipelines, orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices. They will also optimize at scale by profiling and tuning throughput, memory, and cost, introducing caching, sharding, batching, and GPU/CPU autoscaling.Role involves:
  • Owning model serving
  • Automating training pipelines
  • Optimizing at scale
  • Building platform primitives
  • Raising the bar for engineering standards
Requirements:
  • 5+ years writing production software; 2+ years focused on ML platform or infra
  • Expert Python (async, typing, packaging, performance)
  • Working Golang knowledge for systems components
  • Proven experience with one or more serving frameworks (e.g., vLLM, Triton, TorchServe)
  • Kubernetes and cloud-native ops
  • Solid grasp of distributed systems, networking, and container security
  • Culture of rigorous testing, code review, and continuous delivery
Cresta offers:
  • Medical, dental, and vision plans
  • Paid parental leave
  • Monthly Health & Wellness allowance
  • Work from home office stipend
  • Lunch reimbursement for in-office employees
  • PTO: 3 weeks in Canada
Apply Manually