Job Description
Aerospike is seeking a Senior Site Reliability Engineer to join their Aerospike Cloud team in Bengaluru. The ideal candidate will play a crucial role in designing, building, and optimizing scalable and resilient cloud-based Aerospike deployments. He will focus on enhancing reliability, performance, and automation, ensuring the platform efficiently supports multiple cloud product offerings.
Role involves:
- Designing, implementing, and managing large-scale Aerospike deployments across multiple cloud environments.
- Developing expertise in Aerospike and its cloud deployment patterns.
- Automating infrastructure and service configurations.
- Building and maintaining monitoring, alerting, and observability solutions.
- Implementing and enforcing security best practices for cloud infrastructure.
- Participating in incident response and continuous improvement initiatives.
- Collaborating with development teams to align new deployments with SRE best practices.
- Being part of a 24/7 on-call rotation.
Requirements:
- 6+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields.
- Hands-on experience designing, deploying, and optimizing production-grade systems in cloud environments.
- Expertise with at least one major public cloud provider (AWS, Google Cloud, or Azure).
- Strong proficiency in infrastructure-as-code (IaC) tools such as Terraform.
- Experience in CI/CD pipeline design and implementation.
- Deep understanding of Linux/Unix systems, networking fundamentals, and distributed system architectures.
- Proficiency in scripting and software development using Python, Bash, or Go.
- Experience with containerization and orchestration technologies such as Docker and Kubernetes.
- Hands-on experience with monitoring, logging, and observability tools.
- Strong problem-solving skills with an engineering-first mindset.
- Experience implementing security best practices for cloud infrastructure.
- Excellent English communication skills (verbal and written).
Role offers:
- Opportunity to work on a real-time data platform.
- Chance to enhance reliability, performance, and automation of cloud-based Aerospike deployments.
- Collaborative environment with development teams.