Reddit is seeking a Senior Site Reliability Engineer to enhance the reliability and performance of its engineering platforms and services. The successful candidate will work within the Infrastructure SRE team, collaborating closely with the Compute, Traffic, and Observability infrastructure teams. This role offers a unique opportunity to impact one of the internet's most influential platforms by improving its foundational infrastructure.Reddit is rapidly innovating, and the SRE teams are working to meet the needs of infrastructure and development teams as they evolve the product faster than ever before.Responsibilities:
Work closely with engineering teams in designing and developing resilient and highly performant systems.
Identify and build capabilities into foundational Infrastructure and Platform services.
Deliver software to improve the availability, scalability, latency, and efficiency of observability components.
Automate repetitive, manual, or risky tasks.
Diagnose and fix network, system, and service-level issues.
Practice sustainable incident response and drive structural improvement with blameless postmortems.
Observe and improve performance, reduce cost, and improve user experience.
Requirements:
5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
Proficiency in one or more programming languages (Go and Python preferred).
Experience with Kubernetes and Cloud systems.
Familiarity with distributed systems development (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki).
Experience with the development and operation of high-traffic backend systems.
Strong debugging, fixing, and optimization skills.
Troubleshooting skills across applications, networking (TCP/IP), and systems.
Strong working knowledge of Linux and containers.
Excellent communication and collaborative skills.
What Reddit Offers:
Private Medical, Dental and Vision Benefits
Retirement Savings plan with matching contributions
Reddit is a prominent online platform characterized by its community-driven structure, enabling users to engage in authentic conversations centered around shared interests. Functioning as a vast network of over 100,000 active communities, Reddit serves as a significant source of information, where users contribute, evaluate, and discuss diverse topics daily. The platform fosters open dialogue and collaboration, establishing itself as a leading destination for online interaction and information exchange.