Job Description
Tenstorrent is seeking a Lead, Acceleration Kernel Developer to guide the optimization of low-level workloads and enhance software performance for machine learning applications. The role is based in Santa Clara, CA; Austin, TX; or Toronto, ON, and is a hybrid position. The ideal candidate will contribute to software development and lead a team of developers.
Responsibilities:
- Lead the design, development, and maintenance of acceleration kernel software components.
- Develop and optimize kernels and kernel libraries for efficient machine learning and HPC applications.
- Lead the implementation and optimization of tensor compute and tensor data movement kernels.
- Drive efforts in analyzing and optimizing low-level code, with a strong emphasis on tensor optimization.
- Collaborate with machine learning engineers to integrate optimized kernels into machine learning frameworks.
- Identify performance bottlenecks, conduct performance profiling, and develop strategies to resolve them.
- Oversee the creation of comprehensive unit tests and ensure the stability of kernel-level code.
- Create and maintain documentation for code, APIs, and best practices.
- Stay abreast of the latest developments in kernel development and machine learning to propose innovative solutions.
Requirements:
- Bachelor’s degree in Computer Science, Software Engineering, or a related field.
- Proven experience in kernel development, with a strong focus on low-level optimizations and tensor optimization.
- Proficiency in C/C++ programming languages.
- Familiarity with machine learning frameworks and concepts.
- Strong problem-solving skills and the ability to analyze and debug complex issues.
- Experience with performance profiling and optimization tools.
- Excellent communication and leadership skills.
- Self-motivated, detail-oriented, and able to work independently as well as lead a team.
Tenstorrent offers:
- Highly competitive compensation package and benefits.
- Equal opportunity employment.