Flow Traders is seeking a Site Reliability Engineer to contribute to the construction, upkeep, and expansion of its cloud platform. The candidate will ensure the platform's smooth operation, reliability, and scalability using technologies like Kubernetes, Kafka, containers, and automation tools. They will also collaborate with systems engineering teams and remain updated on cloud-native trends.
What the role involves: - Designing, developing, and implementing platform solutions using containerization technologies and container orchestration tools.
- Deploying and managing cloud-native applications on the platform using Infrastructure as Code (IaC) tools.
- Building and maintaining automation tools and scripts for infrastructure provisioning, configuration management, and deployments.
- Implementing and managing industry-standard monitoring solutions to collect and analyze platform metrics for performance optimization and troubleshooting.
- Integrating and managing message streaming platforms like Kafka for real-time data pipelines.
- Collaborating with developers and operations teams to ensure a smooth development and deployment lifecycle.
- Staying up-to-date with the latest trends and technologies in the cloud-native space and identifying opportunities for improvement.
Requirements: - Proven experience as a Platform Engineer or similar role in a DevOps environment.
- In-depth knowledge of containerization technologies (Docker, etc.) and container orchestration tools (Kubernetes).
- Experience with Infrastructure as Code (IaC) tools like Terraform or Ansible.
- Familiarity with cloud platforms (AWS, GCP, Azure) is a plus.
- Experience with automation tools (Bash scripting, Python, etc.) for infrastructure management.
- Experience with monitoring tools for infrastructure and application health.
- Experience with distributed systems such as Kafka, Kubernetes, Hazelcast, or Hadoop is a major plus.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration skills.