Net2Source (N2S)
Site Reliability Engineer H/F (IT) (Winnipeg)
Job Description
Become a crucial part of L1 Site Reliability Engineering focused on monitoring and automating operational tasks across enterprise applications. Leverage your skills with Kubernetes, APIs, and multi-cloud environments to ensure seamless performance. This L1 Site Reliability Engineer role demands up to five years in IT operations, NOC, or SRE roles. You will be involved in monitoring systems using Grafana, Splunk, and Prometheus, while also triaging incidents and following standard runbooks for rapid resolution. Your expertise in automation with Python or Bash will streamline processes, enhancing operational workflow. Key Responsibilities
Monitor systems with Grafana, Datadog, and AIOps tools Execute predefined runbooks for quick incident resolution Validate Kubernetes performance using dashboard metrics Collect and analyze logs for proactive issue detection Communicate effectively with stakeholders throughout incidents Requirements
2–5 years in IT operations or SRE rol...
Monitor systems with Grafana, Datadog, and AIOps tools Execute predefined runbooks for quick incident resolution Validate Kubernetes performance using dashboard metrics Collect and analyze logs for proactive issue detection Communicate effectively with stakeholders throughout incidents Requirements
2–5 years in IT operations or SRE rol...