Browse All Jobs
Job Description
The Site Reliability Engineering team dedicated to Efficiency and Performance at Cisco ThousandEyes is seeking a Senior Site Reliability Engineer. This role is crucial for optimizing AWS cost intelligence, managing the ThousandEyes infrastructure, and ensuring optimal resource utilization and performance. The engineer will lead efforts to streamline infrastructure management, optimize cloud expenditures, and ensure efficient resource utilization. This position involves participation in a "Follow the sun" model incident response and on-call rotation.
What the role involves:
  • Designing and implementing scalable, well-tested solutions.
  • Streamlining operations within the ThousandEyes infrastructure.
  • Optimizing cloud expenditures.
  • Streamlining infrastructure management.
  • Ensuring efficient resource utilization.
  • Participating in and contributing to improve our "Follow the sun" model incident response and on-call rotation.
Requirements:
  • Strong hands-on experience in cloud, preferably AWS.
  • Strong Infrastructure as Code skills, ideally with Terraform and Kubernetes.
  • Previous experience in AWS cost management.
  • Understanding of Prometheus and its ecosystem, including Alertmanager.
  • Ability to write high-quality code in Python, Go, or equivalent languages.
  • Good understanding of Unix/Linux systems, the kernel, system libraries, file systems, and client-server protocols.
What the role offers:
  • Opportunity to work with a Digital Assurance platform.
  • Being part of a team focused on driving AWS cost intelligence and managing infrastructure.
  • Contributing to the overall performance and reliability of ThousandEyes services.
Apply Manually

Cisco ThousandEyes

Cisco ThousandEyes is a Digital Experience Assurance platform that helps organizations ensure optimal digital experiences across all networks. Leveraging AI and comprehensive telemetry data from cloud, internet, and enterprise networks, ThousandEyes enables proactive detection, diagnosis, and remediation of issues. Integrated within Cisco's technology portfolio, it delivers AI-driven insights for networking, security, collaboration, and observability, facilitating scalable deployments and enhanced end-user experiences.

All Jobs at Cisco ThousandEyes (59)