Specialist - Software Engineering (MX)

Remote Full-time
Job Title Site Reliability Engineer (SRE) Role Description We are seeking an experienced Site Reliability Engineer (SRE) with strong DevOps and automation expertise to ensure the reliability, scalability, and performance of distributed systems. This role focuses on CI/CD automation, monitoring, observability, and system troubleshooting across cloud-native and Kubernetes-based environments. You will play a critical role in building and maintaining monitoring platforms, automating operational processes, and improving system reliability across multiple application domains. Key Responsibilities Apply Site Reliability Engineering (SRE) and DevOps best practices to improve system availability, performance, and scalability. Design, build, and maintain CI/CD pipelines with a strong focus on automation. Implement and manage metrics collection, monitoring, and alerting across platforms. Perform system troubleshooting and problem-solving across infrastructure and application layers. Create, operate, and maintain Prometheus and Grafana clusters for monitoring Kubernetes environments. Implement and support observability standards , including OpenTelemetry . Develop and maintain automation tools and scripts using Python, Groovy, and Shell . Collaborate with engineering and platform teams to improve reliability, deployment processes, and operational efficiency. Required Skills & Qualifications Hands-on experience in Site Reliability Engineering (SRE) and DevOps roles. Strong expertise in CI/CD pipelines , automation, and deployment strategies. Experience with metrics collection, monitoring, and alerting systems . Proven ability in system troubleshooting and root cause analysis across platforms and applications. Hands-on experience managing Prometheus and Grafana for Kubernetes cluster monitoring. Strong automation and scripting skills using: Python Shell scripting Groovy Experience working with OpenTelemetry for distributed tracing and observability. Key Skills SRE experience managing Google Cloud services and accounts . Strong Prometheus and Grafana querying and dashboarding skills. Observability and monitoring best practices. Automation-first mindset with strong scripting capabilities. Kubernetes monitoring and cloud-native operations experience.
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Principal Software Engineer/ Azure Solutions

Remote Full-time

GIS Technician I - Akron, OH

Remote Full-time

Apply Now Customer Service Representative (Remote | Part-Time/ Full-Time).

Remote Full-time

**Experienced Customer Chat Representative Entry-Level Role - Work from Home Opportunity with Flexible Scheduling and Competitive Hourly Rate**

Remote Full-time

**Experienced Inbound Customer Service Representative – Remote Healthcare Support**

Remote Full-time

Contact Center Support Operator I

Remote Full-time

Senior Data / Analytics Engineer

Remote Full-time

**Experienced Customer Service Representative – Remote Work Opportunity at arenaflex**

Remote Full-time

Experienced Remote Data Entry Specialist – Part-Time Work from Home Opportunity with blithequark for Market Research and Analysis

Remote Full-time

Outpatient Coding Auditor (PRN)

Remote Full-time
← Back to Home