Senior Site Reliability Engineer - Networking

Remote Full-time
In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences. We began as an AI company built by AI engineers. That hasn't changed. Today, we're on a mission to be the world's top AI computing platform. We equip engineers with the tools to deploy AI that is fast, secure, affordable, and built to scale. Whether they need powerhouse GPU hardware on-site or the flexibility of cloud-based solutions, we've got the horsepower to make it happen. Lambda’s AI Cloud has been adopted by the world’s leading companies and research institutions including Anyscale, Rakuten, The AI Institute, and multiple enterprises with over a trillion dollars of market capitalization. Our goal is to make computation as effortless and ubiquitous as electricity.
If you'd like to build the world's best deep learning cloud, join us.
What You'll Do


Help scale Lambda’s high performance cloud network


Contribute to the reproducible automation of network configuration and deployments


Contribute to the implementation and operations of Software Defined Networks


Help to deploy and manage Spine and Leaf networks


Ensure high availability of our network through observability, failover, and redundancy


Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies


Help with deploying and maintaining network monitoring and management tools


You


Have 5+ years of experience being SWE, SRE or Network Reliability Engineering


Been part of the implementation of production-scale networking projects


Experience being on-call and incident response management


Have experience building and maintaining Software Defined Networks (SDN), experience with OpenStack, Neutron, OVN


Are comfortable on the Linux command line, and have an understanding of the Linux networking stack


Have experience with multi-data center networks and hybrid cloud networks


Have Python programming experience and configuration management tools like Ansible


Have experience with CI/CD tools for deployment and GIT. Operated network environment with GitOps practices in place.


Experience with application lifecycle and deployments on Kubernetes


Nice To Have


Operated production-scale SDNs in a cloud context (e.g. helped implement or operate the infrastructure that powers an AWS VPC-like feature)


Have Software development experience with C, GO, Python


Experience automating network configuration within public clouds, with tools like kubentetes, HELM, Terraform, Ansible


Deep understanding of the Linux networking stack and its interaction with network virtualization, SR-IOV and DPDK


Understanding of the SDN ecosystem (e.g. OVS, Neutron, VMware NSX, Cisco ACI or Nexus Fabric Controller, Arista CVP)


Have experience with Spine and Leaf (Clos) network topology


Have experience and understanding of BGP EVPN VXLAN networks


Experience with building and maintaining multi-data center networks, SD-WAN, DWDM


Experience with Next-Generation Firewalls (NGFW)


Salary Range Information
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda


Founded in 2012, ~350 employees (2024) and growing fast


We offer generous cash & equity compensation


Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.


We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability


Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG


Health, dental, and vision coverage for you and your dependents


Commuter/Work from home stipends for select roles


401k Plan with 2% company match (USA employees)


Flexible Paid Time Off Plan that we all actually use


A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.


Apply Now
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

AI Engineer, Integrations - Bellevue

Remote Full-time

State Policy Internship (Summer)

Remote Full-time

Experian, Data Science Summer Intern (Remote & Paid) - Application via WayUp

Remote Full-time

Strategic Finance Manager

Remote Full-time

[Fully Remote] Amazon Customer Service - Work F...

Remote Full-time

Customer Complaint Resolution Agent

Remote Full-time

**Experienced Full Stack Customer Support Specialist – Global Apple Product Support**

Remote Full-time

Jr. Data Entry Operator / Part Time (Remote)

Remote Full-time

Experienced Remote Bookkeeper – Financial Management and Accounting Expertise for Dynamic Business Growth

Remote Full-time

AI Developer

Remote Full-time
← Back to Home