VP of Product, Research and Training Infrastructure

Remote Full-time
About the position

As CoreWeave continues to solidify its position as the Essential Cloud for AI, we are seeking a visionary VP of Research Training Infrastructure. This executive leader will own the product strategy and engineering execution for the services that power the most ambitious AI research labs in the world. You will bridge the gap between "the metal" and the researcher, delivering a seamless, high-performance environment where frontier models are born.
The Role: Architect of the AI Factory
You will lead the product strategy of our Research Training Stack, focusing on the specialized orchestration, evaluation, and iteration tools required for massive-scale pre-training and post-training. This is a mission-critical role at the intersection of high-performance computing (HPC) and cloud-native agility.
In 2026, CoreWeave is the foundation of the largest infrastructure buildout in human history. We are building AI Factories, not just data centers.

Responsibilities
• Frontier Orchestration: Oversee the evolution of SUNK (Slurm on Kubernetes) to provide researchers with deterministic, bare-metal performance through a cloud-native interface.
• Holistic Training Services: Beyond Slurm, drive the development of next-generation orchestrators and automated training-based evaluation frameworks that ensure model quality throughout the lifecycle.
• Post-Training Excellence: Build the infrastructure required for sophisticated Reinforcement Learning (RL) and RLHF pipelines, enabling labs to refine foundation models with maximum efficiency.
• Customer Advocacy: Act as the primary technical partner for lead researchers at global AI labs, translating their "future-state" requirements into actionable product roadmaps.

Requirements
• Proven Leadership: 15+ years of experience in engineering leadership, with at least 5+ years managing large-scale infrastructure at a top-tier research lab or an AI-native cloud provider.
• Domain Expertise: Deep, hands-on knowledge of Slurm, Kubernetes, and the specific networking requirements (InfiniBand/RDMA) for distributed training clusters.
• Research Mindset: You likely come from a background supporting frontier model research (pre-training and post-training) and understand the "pain points" of a research scientist.
• Scaling Experience: A track record of delivering mission-critical services on multi-thousand GPU clusters (H100/Blackwell/Rubin architectures).
• Strategic Vision: Ability to define "what’s next" in the AI stack, from automated RL loops to specialized sandbox environments.

Benefits
• Medical, dental, and vision insurance - 100% paid for by CoreWeave
• Company-paid Life Insurance
• Voluntary supplemental life insurance
• Short and long-term disability insurance
• Flexible Spending Account
• Health Savings Account
• Tuition Reimbursement
• Ability to Participate in Employee Stock Purchase Program (ESPP)
• Mental Wellness Benefits through Spring Health
• Family-Forming support provided by Carrot
• Paid Parental Leave
• Flexible, full-service childcare support with Kinside
• 401(k) with a generous employer match
• Flexible PTO
• Catered lunch each day in our office and data center locations
• A casual work environment
• A work culture focused on innovative disruption

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

[Remote] Senior Solutions Architect - Emerging Technologies (AI, GenAI, ML)

Remote Full-time

Employment and Compliance Counsel

Remote Full-time

Marketing Operations Manager (LATAM)

Remote Full-time

**Experienced Data Entry Customer Service Representative – Work from Home Opportunity with arenaflex**

Remote Full-time

Academic Advisor I or Academic Advisor II

Remote Full-time

Claims Processor I - (Remote)

Remote Full-time

Senior Tax Accountant (with Blockchain Experience) Part Time, Equity based

Remote Full-time

Manager, Marketing Analytics

Remote Full-time

Data Analyst

Remote Full-time

**Experienced Pharmacy Customer Service Associate – Delivering Exceptional Patient Care in New Bedford, MA at arenaflex**

Remote Full-time
← Back to Home