Browse All Jobs
Job Description
Pismo is seeking a Site Reliability Engineer to join their Observability Squad. This role focuses on maintaining and improving the tooling used for monitoring Pismo services, as well as guiding engineers in creating effective observability for their systems. The ideal candidate will have a strong background in software development and distributed systems, with expertise in monitoring tools and OpenTelemetry.

What this role involves:
  • Managing and improving observability services for Pismo engineers.
  • Developing standards and best practices for engineers to create observability for their APIs.
  • Guiding engineers on tuning their monitoring to reduce noise.
  • Providing guidance on applying machine learning to observability.
  • Helping engineers conduct root cause analysis using observability tooling.


Requirements:
  • 8+ years of experience as a Site Reliability Engineer.
  • Background in software development with experience in languages like Python, Golang, Java, or Javascript.
  • Experience designing distributed systems and understanding concepts like REST and enterprise integration patterns.
  • Proficiency in using monitoring tools to gain observability into distributed systems.
  • Familiarity with OpenTelemetry, including configuring and managing OpenTelemetry Collectors.


What this role offers:
  • Opportunity to work with cutting-edge observability technologies.
  • Chance to influence the observability practices of a growing engineering team.
Apply Manually