Job Description
Tenstorrent is seeking an experienced engineer for an AI Model Productization and Benchmarking role. The position is based in either Warsaw or Gdansk, Poland, on a hybrid work arrangement. The engineer will focus on making models customer-ready and developing benchmarking infrastructure.Role involves:
- Designing and executing model testing protocols.
- Developing and executing performance and accuracy benchmarking tests.
- Analyzing and optimizing system performance.
- Conducting competitive analysis.
- Collaborating with cross-functional teams.
- Integrating LLMs with inference server platforms.
- Tracking AI model accuracy and performance in a CI/CD environment.
- Identifying and triaging regressions.
Requirements:
- Bachelor's, Master’s, or PhD in Computer Science, Electrical Engineering, Machine Learning, or a related field.
- Strong background in AI model benchmarking and profiling.
- Experience with scalable AI infrastructure, including distributed computing environments.
- Proficiency in Python for AI workload optimization.
- Familiarity with LLM frameworks, AI accelerators, and performance tuning methodologies.
- Familiarity with Github CI/CD environments.
- Familiarity with LLM inference servers (e.g. vLLM) is bonus.
- Ability to interpret and analyze hardware/software interactions.
Tenstorrent offers:
- Highly competitive compensation package and benefits.