Job Description
Xometry is seeking a Sr. Manager of Site Reliability Engineering (SRE) to define the strategic direction for SRE teams and initiatives. This role involves building cost-effective, secure, fast, and reliable systems for Xometry's global manufacturing marketplace. The Sr. Manager will collaborate with engineering, product, and program management leaders to improve operational rigor, efficiency, and engineering velocity.
Responsibilities include:
- Defining standards, metrics, and practices to improve operational rigor and engineering velocity.
- Establishing automated and self-service strategies to improve operational efficiency.
- Championing and measuring observability, monitoring, and metrics practices.
- Supervising the development, configuration, and maintenance of underlying platforms and tools.
Requirements:
- 7+ years of experience in software development and site reliability.
- Experience in defining & operationalizing SLOs, SLAs, and error budgets.
- Strong understanding of infrastructure automation observability within distributed systems.
- Proven track record of building and growing a high-performing SRE team.
- A US person (citizen or green card holder).
Xometry offers:
- Opportunity to define strategic direction for SRE teams.
- Chance to build cost-effective, secure, and reliable systems.
- Collaboration with engineering, product, and program management leaders.