Kaseya Limited
Kaseya Production Site Reliability Engineer
Job Description
Become a Site Reliability Engineer at Kaseya and safeguard our production systems as we scale. Your role includes leading incident responses and implementing automation for optimal performance.
As a Site Reliability Engineer, you will define SLOs, lead critical incident responses, and manage cloud infrastructures using Terraform or CloudFormation. Enhancing observability and working closely with development teams are crucial aspects of this role, ensuring our services meet the reliability standards that MSPs depend on.
Key Responsibilities: • Set and monitor SLOs, SLIs, and manage error budgets • Facilitate incident resolution and lead preventive postmortems • Automate infrastructure management with Infrastructure as Code • Manage cloud infrastructure while balancing costs and scalability • Foster system observability with proactive monitoring solutions
Requirements: • 4 to 5 years of experience in AWS production • Expertise in Terraform or CloudFormation for...
As a Site Reliability Engineer, you will define SLOs, lead critical incident responses, and manage cloud infrastructures using Terraform or CloudFormation. Enhancing observability and working closely with development teams are crucial aspects of this role, ensuring our services meet the reliability standards that MSPs depend on.
Key Responsibilities: • Set and monitor SLOs, SLIs, and manage error budgets • Facilitate incident resolution and lead preventive postmortems • Automate infrastructure management with Infrastructure as Code • Manage cloud infrastructure while balancing costs and scalability • Foster system observability with proactive monitoring solutions
Requirements: • 4 to 5 years of experience in AWS production • Expertise in Terraform or CloudFormation for...