Job Description
Perplexity is offering an Internship Program for Master’s and PhD students (or recent graduates) in AI or Computer Science in the UK. The intern will work directly with Perplexity's AI Inference team to support the inference engines serving the models behind Perplexity. This program offers a chance to gain experience in a rapidly growing AI startup, with potential full-time offers for outstanding performers.
Responsibilities:
- Work with the inference team to improve serving latency and throughput
- Bring up support for new models and accelerate inference for existing ones
- Optimize inference across the entire stack, from GPU kernels to serving endpoints
Qualifications:
- Pursuing a Master's or PhD (or recently graduated) in Computer Science with a focus on Artificial Intelligence or Performance
- Experience with ML frameworks (Torch, JAX)
- Experience with GPU programming (CUDA, Triton)
- Experience with High-Performance Computing (OpenMPI)
Perplexity offers:
- Laptop
- Hybrid schedule: 3 days from the office, 2 days WFH
- Potential full time position at the end of the program