Site Reliability Engineer

Broadridge

📍 toronto, on, Canada

Full-time Engineering

Job Description

At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you’re passionate about developing your career, while helping others along the way, come join the Broadridge team.

Key Responsibilities

Monitoring & Incident Response: Develop and enhance monitoring systems (e.g., Datadog) and lead incident response for production outages, working on root cause analysis and prevention.

Infrastructure as Code (IaC): Design and maintain scalable infrastructure using tools like Chef, Terraform, Ansible, or CloudFormation.

System Reliability: Ensure the stability, performance, and scalability of Linux‑based infrastructure and services, leveraging SRE practices to achieve reliability targets (SLAs, SLOs, SLIs).

CI/CD Pipelines: Build, manage, and maintain CI/CD pipelines to automate code deployment and testing, ensuring rapid and safe release cycles.

Automation: Develop and implement scripts and toolin...
Apply for this Position