BlueConic, a Dutch-founded international SaaS company, is seeking a Site Reliability Engineer to maintain and improve the reliability and scalability of its customer data operating system. The ideal candidate will be proactive, analytical, and capable of resolving issues in collaboration with development teams.
The Site Reliability Engineer will play a crucial role in ensuring the production platforms on AWS are highly reliable, available, and performing optimally. This involves analyzing bottlenecks, implementing optimizations, responding to incidents, and defining strategies for increased visibility, auto-healing, and auto-recovery.
Role involves:
- Ensuring platform reliability and availability.
- Scaling infrastructure and software.
- Analyzing and resolving issues.
- Implementing improvements and optimizations.
- Responding to incidents.
Requirements:
- At least 3 years of experience as a Software Engineer or Site Reliability Engineer.
- Knowledge of AWS services (EC2, ECS, CloudFront, Config, GuardDuty, Lambda, SES, SNS).
- Professional knowledge of Java and related tooling (YourKit).
- Excellent software engineering skills.
- Expertise in Docker.
- Experience with observability tooling (Splunk, OpenTelemetry, Grafana).
- Familiarity with infrastructure-as-code tooling.
- Deep sense of security.
- Flexibility for on-call duties.
BlueConic offers:
- Opportunities for career advancement.
- A remote-first team environment.
- A multi-cultural and inclusive work culture.