Job Description
Visier is seeking a Senior Site Reliability Engineer to join their Shared Services SRE team. This team is responsible for operating the cloud infrastructure underlying Visier's technology platform and collaborating with development teams to effectively utilize these technologies in production environments. The role involves managing AWS integration, API gateway, Cassandra, Kafka, Vault, Consul implementations, data science workbench, and network infrastructure security.
Responsibilities: - Deploying and maintaining highly available services in AWS using Terraform, Cloudformation, and Jenkins.
- Debugging production issues across hardware, OS, and application layers.
- Working with the Kong API gateway for secure API access.
- Writing secure code to protect Visier and customer data.
- Optimizing diagnostics infrastructure components like Splunk, Cloudwatch, and Prometheus.
- Supporting large clusters of 3rd party systems like Cassandra, Postgres, and Kafka.
- Preparing for and simulating disasters.
- Collaborating with development teams to design infrastructure for application features.
Requirements: - Extensive experience in networking, network security, firewalls, routing, DNS, and Linux.
- Hands-on proficiency with AWS services (EC2, S3, RDS, IAM, Lambda, VPC).
- Strong knowledge of deployment and configuration management tools.
- Skilled in Terraform code and modules for Infrastructure as Code (IaC).
- Experience in troubleshooting and root cause analysis.
- Experience in system security patching.
- Strong experience with container technologies such as Kubernetes or ECS.
- Strong experience with infrastructure as code tools and languages such as Terraform.
- Coding skills in Java, Scala, Python, or Groovy.
Visier offers: