Principal Cloud Site Reliability Engineer - United Kingdom

Remote Full-time
We're looking for a Principal Cloud Site Reliability Engineer - United Kingdom This role is Remote, United Kingdom We are seeking a Principal Cloud Site Reliability Engineer with strong Incident Management, Kubernetes, and Terraform expertise to ensure the reliability, scalability, and operational excellence of our production platforms. The ideal candidate will combine software engineering, infrastructure automation, and operational excellence to maintain highly available systems while leading and coordinating responses to critical production incidents. This role requires someone comfortable operating in high-availability cloud environments , managing large-scale distributed systems , and driving incident response, post-incident analysis, and reliability improvements . In this role you will... Site Reliability Engineering Maintain and improve system reliability, scalability, and performance for production environments. Implement Infrastructure as Code (IaC) using Terraform to manage and automate cloud infrastructure. Design, deploy, and operate Kubernetes clusters and containerized workloads. Build and maintain observability frameworks including monitoring, logging, and alerting. Automate operational tasks to reduce manual interventions and improve system resilience. Incident Management Lead and coordinate Major Incident Management (MIM) during production outages. Act as Incident Commander or technical lead during high severity incidents. Facilitate incident triage, mitigation, communication, and resolution across engineering teams. Drive Root Cause Analysis (RCA) and ensure corrective and preventive actions are implemented. Develop and improve runbooks, playbooks, and operational procedures . Platform & Cloud Operations Manage cloud infrastructure on platforms such as AWS, Azure, or GCP . Optimize cluster performance, scaling, and availability in Kubernetes environments. Implement high availability and disaster recovery strategies . Support CI/CD pipelines and deployment automation. Reliability & Engineering Excellence Define and monitor SLIs, SLOs, and error budgets . Implement proactive reliability improvements and capacity planning. Collaborate with development teams to improve application resilience and observability . Advocate for DevOps and SRE best practices across engineering teams. You've got what it takes if you have... 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure . Strong experience with Terraform (Infrastructure as Code) . Hands-on experience with Kubernetes (EKS, AKS, GKE, or self-managed clusters) . Experience with Major Incident Management and production incident response . Strong knowledge of Linux systems and networking fundamentals . Experience with cloud platforms (AWS preferred) . Familiarity with monitoring tools such as Prometheus, Grafana, Datadog, or ELK. Experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI, or similar. Strong scripting skills in Python, Bash, or Go . Preferred Qualifications Experience managing large-scale distributed systems in production . Experience implementing chaos engineering or resilience testing . Knowledge of security best practices in cloud-native environments . Our Culture: Spark Greatness. Shatter Boundaries. Share Success. Are you ready? Because here, right now – is where the future of work is happening. Where curious disruptors and change innovators like you are helping communities and customers enable everyone – anywhere – to learn, grow and advance. To be better tomorrow than they are today. Who We Are: Cornerstone powers the potential of organizations and their people to thrive in a changing world. Cornerstone Galaxy, the complete AI-powered workforce agility platform, meets organizations where they are. With Galaxy, organizations can identify skills gaps and development opportunities, retain and engage top talent, and provide multimodal learning experiences to meet the diverse needs of the modern workforce. More than 7,000 organizations and 100 million+ users in 180+ countries and in nearly 50 languages use Cornerstone Galaxy to build high-performing, future-ready organizations and people today. Total Rewards: At Cornerstone, we are dedicated to inspiring excellence and pushing boundaries in everything we do. Our compensation strategy is based on three fundamental principles: equitable pay, market-driven research, and skill-based appraisals. As part of our mission to share success and empower individuals to thrive in an ever-changing world, the listed salary range is just one element of Cornerstone’s comprehensive compensation package. This compensation package may also include annual bonuses, short- and program-specific awards depending on the role, and a comprehensive benefit offering. The disclosed salary range reflects the geographic differential based on the location of the position if applicable. The starting salary for the successful applicant will depend on several job-related factors, including education, training, experience, certifications, location, business needs, and market demands. This range is based on a full-time position and may be adjusted in the future. Join us in shaping the future of work — tomorrow, together. Experience flexibility and empowerment in your career at Cornerstone. The BASE salary range for this position is: 64600 - 103400 GBP. Check us out on LinkedIn , Comparably , Glassdoor , and Facebook !
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Senior Project Manager, Strategic Operations, Global Clinical Operations

Remote Full-time

Experienced Full-Time Pharmacist Centralized Services - Remote Patient Care and Pharmacy Programs at $27/Hour with blithequark

Remote Full-time

Customer Service Rep II

Remote Full-time

Engineer, Identity and Access Management

Remote Full-time

Entry-Level Remote YouTube Gaming Channel Assistant and Live Chat Support Specialist for Exciting Gaming Community

Remote Full-time

[Remote] Staff, Advanced Analytics New Business

Remote Full-time

100% Remote-$16/hr on 1099-Customer Service Support @ Remote in USA

Remote Full-time

Claims Examiner, Prior Authorization

Remote Full-time

Analyst, Transfer Pricing

Remote Full-time

Experienced Junior Accounting Clerk – Remote Opportunity for Career Growth and Development in Financial Administration

Remote Full-time
← Back to Home