Senior Site Reliability Engineer at Confluent

IBM

📍 toronto, on, Canada

Full-time Other-General

Job Description

Join Confluent as a Senior Site Reliability Engineer and set new reliability standards. Focus on engineering improvements and incident management in a multi-cloud environment.

This senior role allocates 75% of your time to engineering tasks, enhancing tools, and analyzing failure patterns, while 25% involves coaching and promoting incident response practices. Your contributions are essential for reducing incidents in Confluent's energetic cloud landscape.

Key Responsibilities: • Analyze failure patterns for proactive reliability design • Manage configuration of Rootly and key integrations • Define and uphold SLO/SLA frameworks • Edit incident documents for customer-facing quality • Create training programs and guide teams through post-mortems

Requirements: • 10+ years in SRE, incident management, or reliability engineering • Proficiency with cloud platforms: AWS, GCP, or Azure • Expertise in management tools like Rootly • Strong knowledge of distributed syste...
Apply for this Position