Browse All Jobs
Job Description

BTIG is seeking a Senior Site Reliability Engineer (SRE) to join their engineering team and be responsible for the infrastructure operating the algorithmic trading systems. This is an opportunity to apply skills and be a leader in a fast-paced, high-performance environment. The trading systems operate in a latency-sensitive, high-throughput on-prem environment, integrating with global exchanges, market data feeds, and client order flows. The SRE will work closely with developers and dev ops to design and maintain systems that are highly available, scalable, and fault tolerant.

Role involves:

  • Building and managing reliable, scalable infrastructure using infrastructure-as-code principles
  • Automating operational processes — monitoring, deployment, incident response, and recovery
  • Instrumenting and monitoring systems to proactively identify and fix bottlenecks and failure points
  • Participating in incident responses, and postmortems
  • Driving improvements in service reliability, performance, and observability
  • Collaborating with software engineers to embed reliability into application design and deployment
  • Continuously improving CI/CD pipelines to support faster, safer deployments

Requirements:

  • 3+ years of experience in SRE, DevOps, or infrastructure engineering roles
  • Strong experience with both on-prem and cloud infrastructure management
  • Proficiency with scripting or programming (Python, Go, Bash, etc.)
  • Experience with containerization and orchestration (Docker, Kubernetes)
  • Solid understanding of Linux internals, networking, and system performance tuning
  • Deep experience with monitoring and alerting tools (Prometheus, Grafana, Datadog, etc.)
  • Familiarity with CI/CD tooling (Jenkins, GitHub Actions, ArgoCD, etc.)

BTIG offers:

  • Competitive compensation and benefits package
Apply Manually