SRE- Monitoring & Observability (M&O) : W2 role

Remote Full-time
Job Description SRE- Monitoring & Observability (M&O) Remote :: have to be willing to travel to Knoxville, TN sometimes. As a Senior Specialist in Monitoring & Observability, you will design, implement, and standardize enterprise-grade monitoring and alerting solutions across complex, cloud-based environments. This role sits at the intersection of Observability, SRE, and Incident Management, with a focus on ensuring systems are reliable, measurable, and proactively monitored. You'll collaborate with Cloud Operations, Architecture, and Platform Engineering teams to define best practices and build resilient, insight-driven infrastructure that supports business-critical services. Your Impact • Implement and standardize monitoring and alerting tools across multiple cloud platforms to ensure consistent observability practices. • Architect observability solutions with Splunk, OpenTelemetry, AWS CloudWatch, GuardDuty, Wiz, and other modern monitoring stacks. • Design and build incident response workflows, playbooks, and dashboards for actionable insights and faster recovery. • Define and operationalize SLOs, SLIs, and error budgets to align with reliability goals. • Integrate observability tools with ServiceNow ITOM and CMDB for automated incident management and asset tracking. • Collaborate with Cloud Operations and Architecture teams to ensure observability is embedded in design, build, and run phases. • Automate monitoring configurations and embed observability into CI/CD pipelines. • Optimize performance and reliability through log analysis, metrics correlation, and distributed tracing. • Drive initiatives to improve MTTR, incident detection, and proactive issue prevention. • Provide technical leadership and mentorship, sharing best practices across engineering and operations teams. Skills & Experience • 5-10 years of experience in infrastructure engineering, with significant focus on monitoring and observability. • Proven expertise with observability platforms such as Splunk, OpenTelemetry, AWS CloudWatch, GuardDuty, Wiz. • Strong knowledge of logging, metrics, tracing, and open standards for observability. • Experience designing and managing incident response workflows and escalation processes. • Hands-on experience with ServiceNow ITOM and CMDB integrations. • Proficiency in cloud-native monitoring (AWS, Azure, GCP) and container observability (Docker, Kubernetes). • Familiarity with SRE principles: defining SLOs, SLIs, and error budgets. • Knowledge of automation practices and Infrastructure as Code (Terraform, CloudFormation, ARM templates). • Strong problem-solving skills with the ability to troubleshoot complex distributed systems. • Excellent communication, presentation, and leadership skills. Set Yourself Apart With • Cloud certifications such as AWS DevOps Engineer, Azure DevOps Engineer Expert, or Google Professional Cloud DevOps Engineer. • Experience in AIOps, predictive analytics, and security-driven observability. • Exposure to chaos engineering or performance engineering practices. Experience in multi-cloud and hybrid environments with advanced observability patterns Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Experienced Massage Therapist (Flexologist) - Join Our Innovative Team at StretchLab Norfolk

Remote Full-time

Cco

Remote Full-time

Contract Opportunity - Supply Chain Manager (Part-Time, Remote)

Remote Full-time

Experienced Human Resources Manager – Airline Remote Careers $26/Hour – Leadership Development and Employee Relations Specialist

Remote Full-time

**Experienced Full Stack Live Chat Support Specialist – Web & Cloud Application Development**

Remote Full-time

Senior Data Engineer - Remote Opportunity with Nike, Inc. - $27/Hour - Explore Potential and Push Boundaries in Data Engineering

Remote Full-time

**Experienced Risk Management Data Entry Intern – Thriving Organization Seeks Detail-Oriented and Motivated Candidate**

Remote Full-time

Product Manager - Infrastructure

Remote Full-time

**Experienced Customer Service Representative – Immediate Start – Montgomery**

Remote Full-time

Professional Business Analyst

Remote Full-time
← Back to Home