Job Description
CookUnity is seeking a Senior Site Reliability Engineer (SRE) to join their Infrastructure team. The Infrastructure team is responsible for maintaining highly available infrastructure that services millions of customers, guaranteeing availability, reliability, and confidentiality. The team services the requests of the engineering organization related to CICD pipelines, builds, infrastructure, and security.This role involves architecting, implementing, and maintaining robust cloud-native infrastructure and deployment pipelines with a focus on reliability, scalability, and automation. The ideal candidate will collaborate closely with software development and operations teams to ensure continuous delivery, system reliability, and rapid incident response.
Responsibilities: - Architect, deploy, and manage highly available and scalable infrastructure on AWS.
- Design, implement, and maintain Kubernetes clusters (EKS).
- Develop and manage GitOps workflows using ArgoCD.
- Write and maintain infrastructure as code (IaC) using tools such as Terraform.
- Build, optimize, and troubleshoot CI/CD pipelines.
- Develop robust automation scripts and tools in languages such as Kotlin, Python, and/or Bash.
- Proactively monitor system performance, reliability, and security.
- Collaborate with software engineers to improve deployment strategies and system observability.
- Implement and enforce security best practices.
- Maintain comprehensive documentation.
- Experience using GitHub and GitHub Actions to automate, testing and deployments.
Requirements: - 7+ years in SRE, or related roles in cloud-native environments, with at least 5 years of direct experience managing AWS infrastructure at scale
- Proficiency in deploying, managing, and troubleshooting Kubernetes clusters, especially AWS EKS.
- Advanced English Level
- Advanced hands-on experience with ArgoCD for GitOps-based Kubernetes deployments.
- Strong development and scripting skills in Kotlin, Python, and Bash.
- Deep knowledge of CI/CD concepts and tools.
- Demonstrated ability to design and implement infrastructure as code using Terraform and/or AWS CloudFormation.
- Strong problem-solving skills.
- Excellent communication and collaboration abilities.
CookUnity offers: - Payment in USD, Crypto, Euro, or ARS.
- Remote work.
- 15 days of vacation each year.
- 16 fully paid Argentinean holidays.
- Healthcare Benefit: Monthly stipend to use in your preferred healthcare provider
- 5- year Sabbatical: After 5 years with CookUnity, you get a 4-week paid sabbatical
- Paid Family leave
- Compassionate Leave: 3-5 days each time the need arises
- Personalized English coach