Browse All Jobs
Job Description
ClickHouse is seeking a Senior Site Reliability Engineer to join their team. This role involves building and leading processes to ensure the reliability, availability, scalability, and performance of ClickHouse Cloud's infrastructure. The Senior Site Reliability Engineer will collaborate with various engineering teams, including Control Plane, Dataplane, Core, Security, Support, and Operations, to design and implement scalable, secure, highly available, and fault-tolerant distributed systems. They will also manage incident response, post-mortem analysis, and continuous improvement of ClickHouse services.

Role involves:
  • Collaborating with engineering teams to design and implement scalable, secure systems.
  • Establishing and managing service level objectives (SLOs) and service level agreements (SLAs).
  • Ensuring monitoring and alerting for all infrastructure components.
  • Enhancing incident response processes and post-mortem analysis.
  • Continuously improving the reliability and performance of ClickHouse services.
  • Planning and driving Chaos initiatives across Engineering teams.
  • Managing on-call processes and establishing best practices for issue resolution.

Requirements:
  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • At least 8 years of experience in Site Reliability Engineering or a related field.
  • Previous experience using ClickHouse in production.
  • Coding experience with Go and/or Python.
  • Strong knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
  • Excellent understanding of distributed databases and SQL, particularly ClickHouse is a major plus.
  • Hands-on experience with container orchestration tools such as Kubernetes or Docker Swarm.
  • Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
  • Strong problem-solving and production debugging skills.
  • Passion for efficiency, availability, scalability, and data governance.
  • Ability to thrive in a fast-paced environment as part of a global team.
  • High level of responsibility, ownership, and accountability.
  • Excellent communication and interpersonal skills.

ClickHouse offers:
  • Flexible work environment.
  • Healthcare contributions.
  • Equity in the company.
  • Flexible time off.
  • $500 Home office setup (for remote employees).
  • Global Gatherings.
Apply Manually