L1 Site Reliability Engineer Position

Net2Source (N2S)

📍 winnipeg, mb, Canada

Full-time Engineering

Job Description

Contribute to L1 Site Reliability Engineering at a leading enterprise, focusing on monitoring and automating IT operations. Your expertise in Kubernetes and APIs is key to maintaining performance.
This entry-level engineer role requires up to five years of experience in IT operations, NOC, or Site Reliability Engineering. You'll actively monitor systems with Grafana, Splunk, and Prometheus while triaging incidents and executing runbooks. Automation skills in Python or Bash will enhance operational workflows and streamline processes effectively.
Key Responsibilities:
• Monitor systems using Grafana, Datadog, and AIOps tools
• Execute predefined runbooks for quick incident resolution
• Validate Kubernetes performance with dashboard metrics
• Collect and analyze logs for proactive issue detection
• Communicate with stakeholders during incidents
Requirements:
• 2–5 years in IT operations or SRE roles
• Strong knowledge of Linux and networking principles
• F...
Apply for this Position