Wellhub, formerly Gympass, is seeking a Lead Site Reliability Engineer to join its Platform team in Portugal. This role is focused on building a global, secure, recoverable, and cost-efficient infrastructure that enables engineering teams to scale the Wellhub product autonomously.
The Lead Site Reliability Engineer will build tooling and automation to minimize operational processes, focusing on a reliable and real-time logistics platform. They will continuously evaluate infrastructure-related operational processes, seeking opportunities for frictionless operations and delivering intuitive tools for other teams.
Responsibilities:
- Build a global, secure, scalable, and cost-effective Cloud platform using Kubernetes in AWS.
- Develop and evolve Kubernetes operators and other cloud-native automation in Kubernetes.
- Build products and tools enabling engineering teams to create and maintain their cloud resources autonomously.
- Help ensure security and compliance by delivering secure products and implementing DevSecOps integrations.
- Improve observability, reliability, and cost awareness.
- Support engineering teams in the products and tools usage.
- Build and maintain a modern CI/CD set of tools and services.
- Keep all the Kubernetes clusters highly available and reliable.
- Contribute to Wellhub's product documentation.
- Participate in the definition of standards, RFCs, guidelines and best practices.
Requirements:
- Proven technical experience with AWS cloud services, Kubernetes, and software engineering.
- Deep knowledge of Kubernetes and its ecosystem.
- Solid knowledge of observability systems.
- Experience with operator-managed Infrastructure as Code, preferably crossplane or Kubernetes Operators.
- Ability to write software for production environments.
- Excellent analytical and problem-solving skills.
- Collaboration and learning-driven mindset.
- Excellent communication skills in both English and Portuguese, both verbally and in writing.
What Wellhub Offers:
- Access to the Wellhub platform with premium plans available at a discount.
- Fitness subsidy for onsite gyms and fitness studios.
- Flexible work environment with remote options.
- Home office stipend and monthly flexible work allowance.
- Minimum of 25 days paid holiday per year, plus an additional day for each year of tenure (up to 5).
- Paid parental leave.
- Career growth opportunities.
- Supportive and inclusive work environment.