Senior Deep Learning Engineer – Autonomous Vehicles

Remote Full-time
Job Description: • Crafting, scaling, and hardening deep learning infrastructure libraries and frameworks for training on multi-thousand GPU clusters. • Improving efficiency throughout the training stack: data loaders, distributed training, scheduling, and performance monitoring. • Building robust training pipelines and libraries to handle massive video datasets and enable rapid experimentation. • Collaborating with researchers, model engineers, and internal platform teams to enhance efficiency, minimize stalls, and improve training availability. • Owning core infrastructure components such as orchestration libraries, distributed training frameworks, and fault-resilient training systems. • Partnering with leadership to ensure infrastructure scales with growing GPU capacity and dataset size while maintaining developer efficiency and stability. Requirements: • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, or a related field, or equivalent experience. • 12+ years of professional experience building and scaling high-performance distributed systems, ideally in ML, HPC, or large-scale data infrastructure. • Extensive knowledge in deep learning frameworks (PyTorch is preferred), large scale training (DDP/FSDP, NCCL, tensor/pipeline parallelism), and performance profiling. • Strong systems background: datacenter networking (RoCE, IB), parallel filesystems (Lustre), storage systems, schedulers (Slurm, Kubernetes, etc.). • Proficiency in Python and C++, with experience writing production-grade libraries, orchestration layers, and automation tools. • Ability to work closely with multi-functional teams (ML researchers, infra engineers, product leads) and translate requirements into robust systems. Benefits: • equity • benefits Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Senior Software Engineer

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Homebased Data Management and Analysis**

Remote Full-time

Experienced Remote Data Entry Specialist – Full Time/Part Time Opportunities with Competitive Pay and Benefits at arenaflex

Remote Full-time

Experienced Customer Service Representative for Medicaid, SNAP, and TANF Application Support – Fully Remote Opportunity in Virginia

Remote Full-time

**Experienced Live Chat Support Specialist – Customer Service Representative for blithequark**

Remote Full-time

Experienced Remote Data Entry Specialist – Accurate and Efficient Data Management Professional

Remote Full-time

Principal Engineer: AI/ML Innovation- REMOTE

Remote Full-time

**Experienced Part-Time Data Entry Specialist – Remote Opportunity with arenaflex**

Remote Full-time

[Sign On Bonus] HYBRID Field Based Behavioral Health Specialist - PHILADELPHIA

Remote Full-time

Experienced Remote Chat Online Greeter – Delivering Exceptional Customer Support through Live Chat for a Leading Online Retailer at blithequark

Remote Full-time
← Back to Home