AI Researcher — Training Optimization

Remote Full-time
About the Role

We’re looking for an AI Researcher focused on training optimization to help us push the efficiency, stability, and scalability of large-scale model training. You’ll work at the intersection of research and systems, developing novel techniques to reduce training cost, accelerate convergence, and improve model quality—while validating ideas through rigorous experiments and publications.

This role is ideal for someone who enjoys turning research insights into practical training wins, and who has a track record (or strong ambition) of publishing applied ML research.

What You’ll Work On
• Design and evaluate training optimization techniques for large models (e.g. optimization algorithms, schedulers, normalization, curriculum strategies)
• Improve training efficiency and stability across long runs and large datasets
• Research and implement methods such as:
• Optimizer and scheduler innovations
• Mixed-precision, low-precision, and memory-efficient training
• Gradient noise reduction, scaling laws, and convergence analysis
• Training-time regularization and robustness techniques
• Run large-scale experiments, analyze results, and translate findings into actionable improvements
• Author or co-author research papers, technical reports, or blog posts
• Collaborate closely with infrastructure and inference teams to ensure training decisions translate to real-world performance

What We’re Looking For
• Strong background in machine learning research, with emphasis on training dynamics and optimization
• Experience training large neural networks (LLMs, multimodal models, or large sequence models)
• Publication experience in ML venues (e.g. NeurIPS, ICML, ICLR, ACL, EMNLP, COLM, arXiv) or equivalent high-quality open research
• Solid understanding of:
• Optimization theory and practice
• Backpropagation, gradient flow, and training stability
• Distributed and large-batch training
• Proficiency in Python and modern ML frameworks (PyTorch preferred)
• Ability to independently design experiments and reason from data

Nice to Have
• Experience with non-standard architectures (e.g. RNN variants, long-context models, hybrid systems)
• Experience optimizing training on GPUs at scale (FSDP, ZeRO, custom kernels)
• Contributions to open-source ML or research codebases
• Comfort operating in fast-moving, ambiguous startup environments

Why This Role
• Real influence over core model training decisions
• Freedom to pursue and publish novel research
• Direct access to large-scale experiments and real production constraints
• A small, senior team that values thinking deeply and shipping thoughtfully

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Full Stack Spanish Bilingual Remote Customer Service Representative – Web & Cloud Application Development**

Remote Full-time

[Remote] Client Experience Associate - Virginia

Remote Full-time

Experienced Assistant Teaching Faculty Position in Advertising, Public Relations, and Social Justice – Education and Community Leadership Opportunity

Remote Full-time

Bookkeeper (California tax laws)

Remote Full-time

Senior Projektleiter (w/m/d) Integrierte Projektabwicklung (IPA)

Remote Full-time

Website Merchandiser to optimize our digital storefront and streamline the product discovery journey

Remote Full-time

Experienced Licensed Pharmacy Technician for Data Entry and Pharmacy Operations – Remote Opportunity in Texas

Remote Full-time

**Experienced Part-Time Evening Remote Data Entry Specialist – Flexible Work Schedule and Career Growth Opportunities at arenaflex**

Remote Full-time

Training Specialist - Remote Career Transition

Remote Full-time

Consignment Analyst I

Remote Full-time
← Back to Home