Engineering Manager, Fleet Clusters

Remote Full-time
About the TeamOur team runs the GPU fleet that serves the models backing ChatGPT and the API. We build automation to provision and manage one of the largest cutting edge GPU inference fleets in the world, exposing it as a singular platform for other OpenAI teams to seamlessly run production applied AI workloads. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.About the RoleWe are looking for an experienced engineering manager to help lead our Fleet Clusters team. You’ll be responsible for building, scaling, and operating the massive GPU fleet clusters that power AI inference and general purpose training at OpenAI. This role focuses on designing and managing large-scale, high-availability GPU clusters across multiple environments, ensuring reliability, scalability, and efficiency. You will partner closely with product, research, and infrastructure teams to rapidly ship and support advanced AI products at global scale.In this role, you will:Manage and build a team of high performing infrastructure engineersGuide the roadmap for automation for a fleet that can grow an order of magnitude in size or moreBuild a world-class, secure compute fleet that serves users at scaleSet technical direction on evolving our compute and abstractions to support a growing businessCollaborate closely with a broad set of stakeholders, including product engineering, inference, security, research and financeWork with external partners to unlock bleeding edge compute and making it available as a turnkey resource for scheduling workloadsCoach and nurture engineers to accelerate their growth and learningYou might thrive in this role if you:10+ years of experience in infrastructure software engineering, including 5+ years in engineering management.Proven track record of building high-performance computing infrastructure teams at scale.Hands-on experience provisioning bare-metal server data centers interconnected across WANs.Experience designing and operating hybrid-cloud platforms.Ownership mentality: willing to pick up new skills and knowledge to solve problems end-to-end. Comfortable being hands-on when needed to help debug systems and support the team.Ability to operate effectively in fast-paced environments with loosely defined priorities and competing deadlines..About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.OpenAI Global Applicant Privacy PolicyAt OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Apply Now
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Senior Software Engineer - Data Infrastructure, Quora (Remote)

Remote Full-time

Associate Consultant - 2026 Bootcamp

Remote Full-time

Client Partner (Salesforce Consulting Partner) - 100% Remote

Remote Full-time

UI Solution Engineer 1 (Mekari Officeless)

Remote Full-time

Pre-Licensing Training Agent - Remote

Remote Full-time

Room Attendant/Housekeeping - Able to Work Weekends

Remote Full-time

Certified Financial Planner ®

Remote Full-time

Discharge Coordinator Consultant (Freelance, $500 per Consultation)

Remote Full-time

Customer Enhancement Account Manager

Remote Full-time

Urgently Hiring: Recruiter (USA), Remote Job

Remote Full-time
← Back to Home