Job Description
Lightspeed is seeking a Senior Site Reliability Engineer to join their SRE team in Montreal. This role focuses on empowering data teams with a scalable, secure, and high-performance infrastructure. The ideal candidate will be passionate about data security, reliability, and high availability.
Responsibilities:
- Collaborate with Data teams to design and implement scalable, reliable, secure, and cost-efficient Cloud infrastructure.
- Ensure security in a holistic manner, including infrastructure, supply chain, and interaction with third-party systems.
- Contribute to the development of data and infrastructure self-service workflows.
- Advocate for best practices in terms of Infrastructure as Code, High Availability, Disaster Recovery, and Security.
- Perform competitive analysis for infrastructure frameworks and data processing solutions.
- Participate in day-to-day support and troubleshooting.
Requirements:
- Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
- Strong experience managing production environments.
- Strong experience with Google Cloud Platform.
- Strong experience managing infrastructure with code, preferably Terraform.
- Proficiency in Bash, Go, or Python.
- Understanding of GCP Data and Security Components.
- Expertise in Linux/Unix and Networking.
- Hands-on experience with Docker, Kubernetes, MySQL, and PostgreSQL.
- Experience with networking tools (VPN, VPC, VPC-SC).
Lightspeed offers:
- Equity for all Lightspeeders.
- Flexible paid time off and hybrid work policies (3 days in Montreal Office).
- Health insurance.
- Contributions to your pension plan - RRSP.
- Health and wellness benefit of $500 per year.
- Paid leave and assistance for new parents.
- Mental health online platform and counseling & coaching services.
- Training opportunities to grow your skills and career.
- Fully stacked kitchen (hot and cold beverages, meals served).
- Happy hours.