Research Engineer (Machine Learning)

Remote Full-time
About Aldea Aldea is a multi-modal foundational AI company reimagining the scaling laws of intelligence. We believe today's architectures create unnecessary bottlenecks for the evolution of software. Our mission is to build the next generation of foundational models that power a more expressive, contextual, and intelligent human–machine interface. The Role We are hiring a Research Engineer (Machine Learning) to build the infrastructure that powers Aldea's multi-modal AI research. You will design, optimize, and scale the training and inference systems that enable our research team to explore next-generation architectures across language, speech, and multi-modal domains. This is a high-leverage role where your work directly enables breakthrough research. You'll build production-grade systems supporting rapid experimentation at billion-parameter scale and real-time deployment of speech and language models. If you're passionate about building the systems that accelerate AI research, this role is for you. What You'll Do • Build and maintain distributed training infrastructure supporting researchers across language and speech domains at a billion-plus-parameter scale. • Optimize training and inference performance across the stack, delivering significant speedups through framework optimization, custom kernels, and system-level improvements. • Design experiment infrastructure including automated evaluation pipelines, experiment tracking, and monitoring systems that enable rapid iteration. • Scale infrastructure from single-node to multi-node distributed training and deploy production inference systems for real-time applications. • Support researchers with fast turnaround on infrastructure issues and maintain high reliability across all systems. • Collaborate with research scientists, data engineers, and leadership to define technical priorities and infrastructure roadmap. Minimum Qualifications • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience. • 3+ years of experience with PyTorch and distributed training frameworks (DDP, FSDP, DeepSpeed, or similar). • Experience training large-scale deep learning models at 1B+ parameters. • Deep understanding of training optimization techniques including mixed precision, gradient checkpointing, and memory management. • Proven ability to build production-grade ML infrastructure with high reliability. • Track record of delivering significant performance optimizations in ML training or inference systems. Preferred Qualifications • Experience with custom kernel development (CUDA, Triton) or GPU optimization. • Hands-on experience with large-scale pretraining (100B+ tokens, ideally trillion+ scale). • Experience optimizing inference for production: quantization, vLLM, TensorRT, or custom serving engines. • Familiarity with speech/audio ML systems and real-time inference constraints. • Experience building automated evaluation frameworks and experiment tracking systems. • Knowledge of profiling tools and multi-node training across 8-32+ GPUs. • Exposure to job orchestration systems (SLURM, Kubernetes, Ray). • Master's or PhD in Computer Science, Machine Learning, or related field. Compensation & Benefits • Competitive base salary • Performance-based bonus aligned with research and model milestones • Equity participation • Comprehensive health, dental, and vision coverage • Flexible paid time off Aldea is proud to be an equal-opportunity employer. We are committed to building a diverse and inclusive culture that celebrates authenticity to win as one. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, citizenship or immigration status, or any other legally protected characteristics. Aldea uses E-Verify to confirm employment eligibility in compliance with federal law. For more information please visit: Please note: We do not accept unsolicited resumes from recruiters or employment agencies and will not be responsible for any fees related to unsolicited resumes. Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Experienced Full Stack Customer Service Representative – Work From Home Opportunity with Comprehensive Benefits and Growth Potential at Blithequark

Remote Full-time

Hybrid Urgent Care Credentialed Vet Tech

Remote Full-time

Logistics Specialist

Remote Full-time

Data and Operations Analytics Manager

Remote Full-time

FP&A Analyst - Remote (MIDDLETOWN, PA, US, 17057)

Remote Full-time

Experienced Full Stack Chat Operator – Online Customer Engagement and Lead Generation for blithequark in Idaho Falls, ID

Remote Full-time

Medical Scribe (COH/West County) - Dermatology

Remote Full-time

Documentation Specialist II

Remote Full-time

Associate Analyst-US RM Enablement- Data Protection

Remote Full-time

Officer / Assistant Vice President, AML Quality Assurance Specialist

Remote Full-time
← Back to Home