Wellhub is seeking a Lead Site Reliability Engineer to join their Platform team. This role is focused on building a global, secure, recoverable, and cost-efficient infrastructure. The Lead Site Reliability Engineer will enable engineering teams to scale Wellhub's product autonomously by building tooling and automation to eliminate operational processes. They will also evaluate infrastructure-related operational processes. The position involves using technologies such as Kubernetes, Kafka, AWS, Github Actions, ArgoCD, Hashicorp Vault, Istio, Knative, Prometheus, and Grafana.Responsibilities:
- Help build a global, secure, scalable, and cost-effective Cloud platform using Kubernetes in AWS.
- Develop and evolve Kubernetes operators and other cloud-native automation in Kubernetes.
- Build products and tools enabling engineering teams to create and maintain their cloud resources autonomously.
- Help ensure security and compliance by delivering secure products and implementing DevSecOps integrations.
- Improve observability, reliability, and cost awareness.
- Support engineering teams in the products and tools usage.
- Build and maintain a modern CI/CD set of tools and services.
- Keep all the Kubernetes clusters highly available and reliable.
- Contribute to Wellhub's product documentation.
- Participate in the definition of standards, RFCs, guidelines, and best practices.
Requirements:
- Proven technical experience with AWS cloud services, Kubernetes, and software engineering.
- Deep knowledge of Kubernetes and its ecosystem.
- Solid knowledge of observability systems.
- Experience with operator-managed Infrastructure as Code.
- Ability to write software for production environments.
- Excellent analytical and problem-solving skills.
- Excellent communication skills in both English and Portuguese.
Wellhub offers:
- Health, dental, and life insurance.
- Flexible work options (remote).
- Home office stipend and a monthly flexible work allowance.
- Flexible schedule.
- Access to onsite gyms and fitness studios, digital fitness programs, and online wellness resources.
- Paid time off, including vacations, days off, and a birthday day off.
- Paid parental leave.
- Opportunities for personal and career growth.