Infra Lead

Remote Full-time
About usOur mission is to reinvent the way people learn, starting with language. We begin by teaching the next billion people English, Spanish, and French.English is the global language of business, culture, and communication, and over 1.5 billion people around the world are actively trying to learn right now. Others dream of communicating with the half-billion native Spanish speakers across the globe. The problem is that it's nearly impossible to learn to speak a language without constant access to a speaking partner. Grammar and vocab apps don't really help – you need to actually converse with someone.Speak is on a journey to fix this. We're creating an AI-powered experience that replicates the flow of a conversation, without needing a human on the other end. The goal is to make it radically more accessible to be able to have conversations in a foreign language and eventually help hundreds of millions of people gain fluency who otherwise wouldn't be able to.We started on this journey over five years ago and we've still got a long ways to go. We're thoughtfully adding new team members only when we think they can truly play a big role in our mission.Speak launched first in South Korea where we have quickly grown to become the top grossing education app in the country. We have now delivered this winning product to more than 40 countries globally and are continuing to expand to more markets in the coming months. The company is well funded, and as of December 2024, we've reached a $1B valuation with our Series C round, through key partners like Accel, OpenAI, Founders Fund, Y Combinator, Khosla Ventures, Lachy Groom, Josh Buckley, and more. We’re a team of more than 90 based throughout San Francisco, Seoul, Tokyo, Taipei, and Ljubljana.About this roleAs an SRE Engineer, Lead at Speak, you’ll be the driving force behind the reliability and resilience of the systems that power our global language learning experience. You’ll lead efforts to scale our infrastructure, harden our platform, and ensure that our services are fast, available, and reliable for millions of users around the world.You’ll work across our stack—from Kubernetes on GCP to our Node.js APIs, Postgres, and Redis —building robust infrastructure and operational tooling. You’ll own incident response, observability, and SLOs while embedding a culture of reliability throughout the engineering org.Speak is growing rapidly, and we’re pushing our systems harder every day. This is a unique opportunity to shape the future of our platform as we scale to the next 10x of users.What you’ll be doingOwn the reliability of Speak’s infrastructure across GCP, Kubernetes, and our Node.js/Postgres stackLead response for P0/P1 incidents, drive postmortems, and ensure we’re learning from every outageImprove observability, alerting, and on-call processes so we catch issues before users doDefine and drive adoption of SLOs/SLAs for core systems and servicesBuild tools and frameworks to make reliability easier for product engineers—think safer deploys and infrastructure automationCollaborate cross-functionally with Product, Engineering, and ML teams to ensure reliability is baked into everything we buildSet short term and long term roadmaps to ensure stability for our growing userbase.Be a thought leader and coach around SRE principles—blameless culture, operational maturity, and continuous improvementWhat we’re looking for7+ years of experience in SRE, DevOps, or infrastructure-focused engineering roles, ideally with experience leading or mentoring othersStrong experience with GCP, Kubernetes, Terraform, Node.js, Python, PostgreSQL, Redis, and observability tooling like Prometheus and SentryProven track record of improving reliability, scaling systems, and reducing incident frequency and severity with high traffic systemsStrong incident management and root cause analysis skills—you know how to lead under pressureExperience building and maintaining CI/CD pipelines and deployment safety toolingStrong systems thinking, with the ability to identify failure points and proactively harden servicesDeep sense of ownership and a desire to make infrastructure a force multiplier for the rest of the orgBonusFamiliarity with cost optimization strategies in cloud-native environmentsBackground in security, chaos engineering, or disaster recovery planningContributions to internal tooling, automation, or developer productivity initiativesWhy work at SpeakJoin a fantastic, tight-knit team at the right time: we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company.Do your life's work with people you’ll love working with: we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply.Global in nature: We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you’ll have the opportunity to talk to users in each of these regions on a regular basis as well as travel.Impact people's lives in a major way: Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We’re helping millions of people achieve their goals and improve their lives.Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Apply Now
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Senior Customer Success Manager – Healthcare Technology**

Remote Full-time

**Experienced Data Entry Specialist – Home-Based Opportunity with blithequark**

Remote Full-time

(USA) Software Engineer III

Remote Full-time

Chief of Human Resources and Operations

Remote Full-time

Correspondence Representative II (Seasonal - Fully Remote)

Remote Full-time

14 & 15 year olds needed | Deptford

Remote Full-time

CRM Analytics Consultant

Remote Full-time

Junior Software Engineers — No Degree Required, Self-Taught & Hobbyists Welcome!

Remote Full-time

Mobile Mortgage Specialist

Remote Full-time

[Remote/WFM] Amazon Advertising Specialist

Remote Full-time
← Back to Home