Job Description
Alchemy is seeking a Site Reliability Engineer to join its Infrastructure department. The engineer will collaborate with the engineering team to design, deploy, and continuously improve the infrastructure supporting Alchemy's globally used developer platform. The role focuses on enhancing developer productivity and ensuring product reliability as the company scales. The Infrastructure team's mission is to provide the infrastructure, tooling, and expertise needed to allow Alchemy engineers to ship, scale, and operate high-quality products in a fast, safe, and cost-efficient manner.
What this role involves:
- Setting high standards for Reliability at Alchemy.
- Developing and owning company-wide Reliability best practices.
- Architecting production infrastructure and tools that encourage and enforce high reliability.
- Inspiring the broader engineering organization to ensure Reliability.
- Collaborating, partnering, advising, reviewing, and mentoring engineering teams on Reliability topics.
- Improving critical infrastructure and systems used to operate infrastructure at scale.
- Developing and owning best practices for managing production infrastructure and developer processes.
- Providing input into long-term platform requirements and operational guidelines.
- Continuously raising the standard of engineering excellence.
- Building and maintaining documentation around processes and workflows.
Requirements:
- 5+ years of experience as an Infrastructure Engineer focused on Reliability.
- Experience leading and driving company-wide reliability efforts and engineering initiatives.
- Experience with observability best practices and tooling.
- Experience designing and operating large-scale, multi-region production systems.
- Experience working with AWS or other cloud infrastructures.
- Experience with container schedules and runtimes such as Docker and Kubernetes.
- Experience building deployment pipelines leveraging common CI/CD tools.
- Experience with Infrastructure-as-Code.
- Strong communication and collaboration skills.
- (Preferred) Experience with running production services on bare-metal.
- (Preferred) Experience with Typescript and Python.
- (Preferred) Excellent understanding of web applications and architecture.
What Alchemy offers:
- Competitive compensation, including base salary and equity.
- Comprehensive medical, dental, and vision coverage.
- 401k.
- Unlimited flexible time off.