Sumo Logic observability Engineer - Nityo Infotech Corporation

Remote Full-time
Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design and implement scalable dashboards, alerts, and tracing strategies, drive service-level reliability, and enable a steady-state SRE operations model. Key Responsibilities: Lead the end-to-end implementation of Sumo Logic observability platform for AWS and EKS environments. Migrate monitoring and alerting assets from Dynatrace to Sumo Logic. Define and implement SLIs/SLOs, error budgets, and reliability metrics for containerized services. Deploy and configure Sumo Logic collectors across AWS and Kubernetes workloads (EKS). Configure log, metric, and trace ingestion pipelines using OpenTelemetry and Sumo Logic apps. Design and maintain dashboards for service health, performance, and reliability insights. Implement intelligent alerting and notification workflows, using thresholds, baselines, and anomaly detection. Collaborate with DevOps, SRE, and development teams to ensure complete tracing coverage across services. Ensure best practices for alert noise reduction, escalation policies, and incident response are in place. Contribute to observability runbooks, operational handover, and training for the client SRE team. Focus Areas on Sumo Logic: Strong knowledge of the new UI navigation. Proven expertise in building and optimizing queries. Advanced troubleshooting skills. The ability to go beyond task execution and provide proactive recommendations to improve our setup and overall efficiency. Required Skills & Qualifications: Expert-level experience with Sumo Logic, including dashboarding, alerting, collector deployment, and ML features. Strong background in Site Reliability Engineering (SRE), including SLIs/SLOs, error budgets, MTTR/MTTD metrics. Proficiency in AWS services (especially CloudWatch, CloudTrail, Lambda, RDS) and EKS (Amazon Kubernetes Service). Hands-on experience with OpenTelemetry for distributed tracing and service maps. Strong understanding of Kubernetes metrics, pod health, container resource usage, and cluster monitoring. Proven ability to define alert thresholds, configure notification routing (e.g. Slack, PagerDuty, ServiceNow), and manage alert fatigue. Strong scripting experience with tools like Terraform, Helm, YAML, and GitOps workflows. Experience with incident triage, RCA documentation, and building operational maturity in observability teams. Excellent communication and stakeholder engagement skills. Preferred Qualifications: Sumo Logic certifications (Admin, Advanced Analytics) are a plus. Experience with Dynatrace (for migration purposes). Familiarity with integrating observability into CI/CD pipelines. Exposure to service mesh (Istio/Linkerd) and monitoring microservices in that context. Deliverables This Role Will Drive: Sumo Logic observability reference architecture EKS and AWS observability configuration SLI/SLO documentation and tracking Alerting and tracing setup across services Production-ready dashboards and runbooks Knowledge transfer and enablement sessions for SRE/DevOps teams Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Medical Record/Data Entry Clerk/Project Support

Remote Full-time

Threat Intelligence Analyst, Quantitative Methods

Remote Full-time

Underwriting Assistant Manager - Commercial Lines

Remote Full-time

Experienced Data Entry Specialist – Remote Work Opportunity with Comprehensive Training and Development at arenaflex

Remote Full-time

**Experienced Customer Service Representative/Data Entry Specialist – Remote Opportunity with arenaflex**

Remote Full-time

Experienced Customer Service Representative – Full or Part Time Remote Opportunity for Career Growth and Development

Remote Full-time

**Experienced Telecommute Clerk: Online Data Entry & Typing Support Specialist – Flexible Full-Time or Part-Time Opportunity**

Remote Full-time

Head of Agentic AI Consultancy

Remote Full-time

Experienced Full Stack Software Engineer – Big Data, Cloud Application Development, and Technical Leadership at arenaflex

Remote Full-time

Utilization Management Administration Coordinator (SNF)

Remote Full-time
← Back to Home