Job Description
Rent the Runway is seeking a Site Reliability Engineer to join their team in Galway, Ireland. This role involves contributing to technology initiatives in cloud infrastructure, software delivery, and observability. The Site Reliability Engineer will be responsible for building and developing tooling, policies, and processes to advance Rent the Runway to higher levels of scale and performance. He/She will lead assigned projects and ensure their overall delivery.
What This Role Involves: - Utilizing technologies like Terraform, Helm, Python, Go, Docker, and Kubernetes to drive service reliability.
- Implementing software development practices to build observability, alerting, tracing, automation, and self-healing capabilities.
- Coordinating across platforms, supporting, identifying, responding to, and reporting issues.
- Developing maintenance and operations automation through CI/CD.
Requirements: - 2 years of hands-on experience with orchestration tools such as Kubernetes and/or Helm.
- Proficiency in Terraform, Ansible, or Helm, with an understanding of CI/CD tools like GitHub, GitLab, and Artifactory.
- Practical experience with monitoring, alerting, and logging tools, including Splunk and GCP Monitoring.
- 2 years of experience in maintaining production environments across cloud platforms like GCP, AWS, or Azure.
- Experience working within Agile teams.
- Willingness to participate in an on-call rotation.
What Rent the Runway Offers: - Generous Paid Time Off.
- Universal Paid Parental Leave + flexible return to work program.
- Paid Sabbatical after 5 years of continuous service.
- Competitive Stakeholder Pension.
- Comprehensive health and dental care.
- Company-wide events and outings.
- Hybrid Work (2-3 days per week in the Galway, Ireland office).