Job Description
The Site Reliability Engineer (SRE) role is based in Jakarta, Indonesia. The SRE team at the company architects, builds, and maintains the infrastructure that applications rely on. They collaborate with development teams to ensure scalability, reliability, and efficiency, delivering customer experiences and enabling developers to focus on building features.
What this role involves:
- Deploy, automate, maintain, and manage cloud-based and on-premises production systems.
- Document new and existing requirements for smooth project delivery.
- Work with security and infrastructure teams to adopt security best practices.
- Ensure the availability, performance, scalability, and security of production systems.
- Troubleshoot and resolve system issues across platform and application domains.
- Suggest architectural improvements and recommend process optimizations.
- Evaluate new technologies to enhance the infrastructure stack.
- Ensure system security policies are properly remediated.
- Drive and implement automated provisioning and scaling of servers.
- Handle operational tasks, including on-call duties, alerts, and incident management.
Requirements:
- Minimum 2 years of engineering experience.
- Bachelor’s or Master’s degree in a relevant field (e.g., IT, Computer Science) or a proven track record in DevOps.
- Willingness to continuously upgrade skills and stay up-to-date with the latest DevOps trends.
- Experience with cloud-native tools (e.g., Kubernetes, Docker, Nginx, OpenTelemetry) is a plus.
- Experience managing cloud servers (AWS, GCP).
- A desire to transition into engineering management is a valued addition.
- Experience with on-premises physical servers, databases, and storage solutions (MySQL, PostgreSQL, Redis) is a plus, as well as familiarity with Infrastructure as Code (IaC) tools (Terraform, Pulumi).