Software Engineer, Inference AI/ML

Remote Full-time
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost. Responsibilities Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve) Write tests, code comments, and short design docs; participate in code reviews Add basic metrics and dashboards; assist with alarms and runbooks Follow on-call runbooks and learn incident response in a guided rotation Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance Skills BS/MS in CS, EE, or related field, or equivalent practical experience Foundations in data structures, algorithms, and networked services Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics Exposure to containers and Kubernetes (coursework or projects welcome) Curiosity about GPU inference concepts (micro-batching, KV cache, streaming) Internship or project that deployed a microservice or ML inference demo Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling Benefits Medical, dental, and vision insurance - 100% paid for by CoreWeave Company-paid Life Insurance Voluntary supplemental life insurance Short and long-term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Ability to Participate in Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family-Forming support provided by Carrot Paid Parental Leave Flexible, full-service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Company Overview CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Product Designer

Remote Full-time

Experienced Customer Service Representative for Advanced Care Team - Providing Exceptional Support and Retention Services in a Dynamic and Fast-Paced Environment

Remote Full-time

Part-Time Data Entry Clerk - Remote Opportunity with blithequark

Remote Full-time

SR. PROJECT MANAGER-DESIGN & CONSTRUCTION (REMOTE-NY, VA, TX)

Remote Full-time

Remote - ServiceNow Architect (CSM & FSO) 24-10492

Remote Full-time

Business Development Manager, Amazon Pharmacy

Remote Full-time

Circuit: Product Designer – Remote

Remote Full-time

Experienced Healthcare Customer Service Representative – Remote Opportunity for Compassionate and Tech-Savvy Individuals to Deliver Exceptional Patient Support

Remote Full-time

Lead Data Engineer (Remote Work Eligible)

Remote Full-time

Experienced Part-Time Data Entry Associate – Remote Work Opportunity with blithequark for Flexible Scheduling and Career Growth

Remote Full-time
← Back to Home