Browse All Jobs
Job Description

Elastic is seeking a Site Reliability Engineer to join its Platform Engineering department. This role focuses on designing, building, and scaling the multi-cloud platform that hosts both internal and external services, including Elastic Cloud Hosted and Serverless. The SRE team develops new software and tools to support the infrastructure, enabling rapid product deployment across Elastic.

The ideal candidate will take an engineering approach to automate system engineering efforts, ensuring the reliability of Elastic's global infrastructure. They will grow the global platform infrastructure to meet increasing scaling demands by developing and maintaining software, tooling, and automations. The role involves championing a collaborative environment focused on operational excellence and uplifting others, as well as responding to and preventing repeated customer impact in response to major incidents and prioritized problem management.

What this role involves:

  • Leading technical initiatives for automating system engineering efforts.
  • Developing and maintaining software, tooling, and automations to meet scaling demands.
  • Championing a collaborative environment focused on operational excellence.
  • Responding to and preventing customer impact from major incidents.

Requirements:

  • Experience in striving for platform reliability.
  • A customer-first approach to solving operational problems with an SRE perspective.
  • A background in software engineering.
  • Experience in public cloud and managed Kubernetes services.
  • Passion for developing solutions that involve inclusive communication methods.

What this role offers:

  • Competitive pay based on the work you do.
  • Health coverage for you and your family in many locations.
  • Ability to craft your calendar with flexible locations and schedules for many roles.
  • Generous number of vacation days each year.
  • Company-matched 401k with dollar-for-dollar matching up to 6% of eligible earnings.
Apply Manually