Job Description
GitLab is seeking an Intermediate Site Reliability Engineer to join their Foundations team. This role focuses on maintaining the smooth operation of user-facing services and production systems. The ideal candidate will blend operational skills with software engineering principles to enhance GitLab's environments and codebase.
Role Involves:
- Designing and implementing scalable networking infrastructure.
- Collaborating with cross-functional teams on projects.
- Responding to incidents on an on-call rotation during daytime hours.
- Leading initiatives through problem definition, design, and project management.
- Acting as a subject matter expert in networking and rate limiting services.
- Automating operational tasks.
Requirements:
- Google Cloud Platform expertise, especially in networking and GKE configuration.
- Experience with Terraform and configuration management tools like Ansible and Chef.
- Experience with the Kubernetes ecosystem, including Helm.
- Programming skills in Ruby or Go.
- Understanding of network protocols and familiarity with network observability tools.
- Comfort with scripting languages for automation.
- Experience with GitLab CI or equivalent.
- Strong problem-solving and communication skills.
- Proactive and self-organized mindset.
GitLab Offers:
- The opportunity to work on a large-scale, single-tenancy open-source SaaS site.
- Challenging and rewarding problems that directly impact users.
- A focus on increasing automation and enabling other teams.
- A transparent work environment.