Kaseya Limited

Kaseya Production Site Reliability Engineer

📍 Location

toronto, on

⏰ Job Type

Full-time

📅 Posted

June 05, 2026

Apply Now

Job Description

      Become a Site Reliability Engineer at Kaseya and safeguard our production systems as we scale. Your role includes leading incident responses and implementing automation for optimal performance.

As a Site Reliability Engineer, you will define SLOs, lead critical incident responses, and manage cloud infrastructures using Terraform or CloudFormation. Enhancing observability and working closely with development teams are crucial aspects of this role, ensuring our services meet the reliability standards that MSPs depend on.

Key Responsibilities: • Set and monitor SLOs, SLIs, and manage error budgets • Facilitate incident resolution and lead preventive postmortems • Automate infrastructure management with Infrastructure as Code • Manage cloud infrastructure while balancing costs and scalability • Foster system observability with proactive monitoring solutions

Requirements: • 4 to 5 years of experience in AWS production • Expertise in Terraform or CloudFormation for...

Ready to Apply?

Take the next step in your career - we're hiring now!

Apply for this Position