Applied AI Researcher, Post-Training

Remote Full-time
About Distyl AI Distyl AI develops production-grade AI systems to power core operational workflows for Fortune 500 companies. Powered by a strategic partnership with OpenAI, in-house software accelerators, and deep enterprise AI expertise, we deliver working AI systems with rapid time to value – within a quarter. Our products have helped Fortune 500 customers across diverse industries, from insurance and CPG to non-profits. As part of our team, you will help companies identify, build, and realize value from their GenAI investments, often for the first time. We are customer-centric, working backward from the customer’s problem and holding ourselves accountable for creating both financial impact and improving the lives of end-users. Distyl is led by proven leaders from top companies like Palantir and Apple and is backed by Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (Former CEO of GitHub), Brad Gerstner (Founder and CEO of Altimeter), and board members of over a dozen Fortune 500 companies. What We Are Looking For At Distyl we’re pushing the envelope of AI utilization in enterprise. This requires creative researchers who don’t just want to drive incremental improvements on benchmarks or optimize an existing process but instead are looking to creatively redefine how software is used. Our researchers come from many academic backgrounds but have strong research track records, operate in an AI-native way, and would be bored staying on the rails of a traditional research org. Key Responsibilities • The Post-Training team focuses on adapting foundation models to real-world performance and alignment requirements. Researchers develop and evaluate techniques such as supervised fine-tuning, preference optimization (DPO, RLHF, RLAIF), and continual adaptation to align models with Distyl’s enterprise systems. The goal is to bridge raw model capability with trustworthy, contextually aligned system behavior • Researchers in Post-Training investigate new methods for aligning large models with human and system-level objectives. They explore trade-offs between generalization and specialization, data efficiency and robustness, capability and controllability. Their work informs how Distyl leverages foundation models safely, effectively, and at scale across industries What We Require • Deep Understanding of Post-training Techniques: Familiarity with supervised fine-tuning, preference optimization (RLHF/DPO), LoRA/PEFT, and instruction-tuning pipelines. • Experience Adapting Frontier Models: You’ve tuned or adapted LLMs/SLMs to specialized domains or behaviors through data curation, reward modeling, or continual pretraining. • Experience Building with Models, Not Just Building Models: We develop intelligent systems using models rather than training or fine-tuning them. Ideal candidates have expertise in compound AI systems, agentic collaboration, and associated techniques (ensembling, ReAct, graph-of-thoughts, etc.). • Proven Track Record of Research Results: Whether you’ve published in top journals, posted amazing work on twitter, or somewhere else we want to see what you've done. • Uses AI Every Day: Before you can revolutionize someone else’s workflow, you need to revolutionize yours. You should be using tools like ChatGPT, Cursor, and Perplexity to accelerate your workflow. • Strong Programming and Data Analysis Skills: While you might not consider yourself a software engineer you need to be able to build prototypes of your ideas and then perform the experiments to prove the effectiveness to a F500 Head of AI. • Biases Towards Showing vs Telling: Our customers want to see the power of AI today vs discuss the most elegant idea that will take 5 years to realize. What We Offer • The base salary range for this role is $130K – $250K, depending on experience, location, and level. In addition to base compensation, this role is eligible for meaningful equity, along with a comprehensive benefits package • 100% covered medical, dental, and vision for employees and dependents • 401(k) with additional perks (e.g., commuter benefits, in‑office lunch) • Access to state‑of‑the‑art models, generous usage of modern AI tools, and real‑world business problems • Ownership of high‑impact projects across top enterprises • A mission‑driven, fast‑moving culture that prizes curiosity, pragmatism, and excellence Distyl has offices in San Francisco and New York. This role follows a hybrid collaboration model with 3+ days per week (Tuesday–Thursday) in‑office. Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Data Entry Career Opportunity at Delta Airlines - Join Our Team as a Detail-Oriented Data Entry Specialist

Remote Full-time

[Remote] MedTech Credentialing and Scheduling Coordinator

Remote Full-time

Online English Teacher at First Leap

Remote Full-time

**Experienced Customer Service Representative – Virtual Support Agent – Work from Home Opportunity**

Remote Full-time

Amazon Delivery Driver

Remote Full-time

Care Manager - Registered Nurse, PRN

Remote Full-time

**Experienced Chat Support Specialist – Delivering Exceptional Customer Experiences in a Dynamic Remote Environment**

Remote Full-time

PATIENT COORDINATOR (REMOTE/NON-CLINICAL) ACCESSNURSE 1/13/25

Remote Full-time

Sr Analyst, IKC Program Management

Remote Full-time

Experienced STEAM and Sports Instructor for After-School Programs - Remote Opportunity with KidzToPros

Remote Full-time
← Back to Home