Job Description
Zup is seeking a passionate and experienced DevOps/Site Reliability Engineer (SRE) to ensure the stability and scalability of its critical credit decision services. The ideal candidate will be responsible for maintaining the efficiency and sustainability of the products, working in a challenging environment with constant knowledge exchange.
This role involves:
- Ensuring high availability and resilience of credit decision services.
- Designing and implementing scalable solutions using AWS services, containers, and CI/CD tools.
- Collaborating with development, data, and product teams.
- Participating in plannings, development, code reviews, and deployments.
- Monitoring systems, responding to incidents, and preventing failures.
- Contributing to continuous improvement of processes.
- Automating tasks and ensuring quality with continuous testing.
The requirements are:
- Experience with cloud infrastructure (AWS) and container orchestration (Docker, Kubernetes).
- Experience with observability tools (CloudWatch, Prometheus, Grafana, Datadog, etc.).
- Knowledge of CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, etc.).
- Good communication, organization, teamwork, and responsibility.
- Practical experience in incident resolution and a focus on automation and reliability.
Zup offers:
- Remote work flexibility.
- Career development opportunities.
- Health and wellness benefits, including medical and dental plans.
- Financial benefits such as meal vouchers and life insurance.