Browse All Jobs
Job Description
Tenstorrent is seeking an experienced engineer for an AI Model Productization and Benchmarking role. The position is based in either Warsaw or Gdansk, Poland, on a hybrid work arrangement. The engineer will focus on making models customer-ready and developing benchmarking infrastructure.Role involves:
  • Designing and executing model testing protocols.
  • Developing and executing performance and accuracy benchmarking tests.
  • Analyzing and optimizing system performance.
  • Conducting competitive analysis.
  • Collaborating with cross-functional teams.
  • Integrating LLMs with inference server platforms.
  • Tracking AI model accuracy and performance in a CI/CD environment.
  • Identifying and triaging regressions.
Requirements:
  • Bachelor's, Master’s, or PhD in Computer Science, Electrical Engineering, Machine Learning, or a related field.
  • Strong background in AI model benchmarking and profiling.
  • Experience with scalable AI infrastructure, including distributed computing environments.
  • Proficiency in Python for AI workload optimization.
  • Familiarity with LLM frameworks, AI accelerators, and performance tuning methodologies.
  • Familiarity with Github CI/CD environments.
  • Familiarity with LLM inference servers (e.g. vLLM) is bonus.
  • Ability to interpret and analyze hardware/software interactions.
Tenstorrent offers:
  • Highly competitive compensation package and benefits.
Apply Manually