Job Description
Veeam, the leading provider of data resilience solutions, is seeking a Site Reliability Engineer to join their expanding team in Prague. This role focuses on the company's SaaS platform, built on Microsoft Azure, to deliver top-tier data protection services. The Site Reliability Engineer will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions on Microsoft Azure, automating deployments, and ensuring the resilience, security, and efficiency of the SaaS application platform.Role involves:
- Designing, implementing, and maintaining scalable infrastructure solutions on Microsoft Azure.
- Automating deployments and maintaining resilient SaaS platform.
- Supporting delivery and release pipelines.
- Improving system reliability, performance, and scalability.
- Developing monitoring and alerting solutions.
- Responding to incidents in production environments (on-call rotations).
- Meeting information security and compliance standards.
- Defining and improving internal standards.
Requirements:
- 3+ years in 24x7 production operations for SaaS or cloud service provider.
- Experience with infrastructure and application monitoring tools (Azure Monitor, AppInsights, Elastic Cloud).
- Experience managing Azure IaaS and PaaS solutions.
- Strong problem-solving skills in distributed environments.
- Experience with container orchestration platforms.
- System programming skills in Python, PowerShell, Bash, Go, etc.
- Experience with CI/CD practices and tools (Azure DevOps or similar).
- Experience with distributed, event-based messaging architectures.
- English proficiency for international team communication.
Veeam offers:
- Premium healthcare program.
- Annual vacation and sick days.
- Meal vouchers.
- Public transportation subscription.
- MultiSport card.
- Cafeteria Benefit Plan.
- Veeam Care Days for volunteering.
- Professional training and education.