[Remote] Research Staff, Voice AI Foundations

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Deepgram is the leading voice AI platform for developers building advanced speech technologies. As a member of the Research Staff, you will pioneer the development of Latent Space Models to address challenges in voice AI, focusing on creating innovative neural audio codecs and generative models that enhance human-machine interaction. Responsibilities • Build next-generation neural audio codecs that achieve extreme, low bit-rate compression and high fidelity reconstruction across a world-scale corpus of general audio. • Pioneer steerable generative models that can synthesize the full diversity of human speech from the codec latent representation, from casual conversation to highly emotional expression to complex multi-speaker scenarios with environmental noise and overlapping speech. • Develop embedding systems that cleanly factorize the codec latent space into interpretable dimensions of speaker, content, style, environment, and channel effects -- enabling precise control over each aspect and the ability to massively amplify an existing seed dataset through 'latent recombination'. • Leverage latent recombination to generate synthetic audio data at previously impossible scales, unlocking joint model and data scaling paradigms for audio. • Endeavor to train multimodal speech-to-speech systems that can 1) understand any human irrespective of their demographics, state, or environment and 2) produce empathic, human-like responses that achieve conversational or task-oriented objectives. • Design model architectures, training schemes, and inference algorithms that are adapted for hardware at the bare metal enabling cost efficient training on billion-hour datasets and powering real-time inference for hundreds of millions of concurrent conversations. Skills • Strong mathematical foundation in statistical learning theory, particularly in areas relevant to self-supervised and multimodal learning • Deep expertise in foundation model architectures, with an understanding of how to scale training across multiple modalities • Proven ability to bridge theory and practice—someone who can both derive novel mathematical formulations and implement them efficiently • Demonstrated ability to build data pipelines that can process and curate massive datasets while maintaining quality and diversity • Track record of designing controlled experiments that isolate the impact of architectural innovations and validate theoretical insights • Experience optimizing models for real-world deployment, including knowledge of hardware constraints and efficiency techniques • History of open-source contributions or research publications that have advanced the state of the art in speech/language AI Company Overview • Deepgram specializes in providing AI-powered speech-to-text technology that offers audio intelligence, text-to-speech, and voice agent API. It was founded in 2015, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is Company H1B Sponsorship • Deepgram has a track record of offering H1B sponsorships, with 2 in 2025, 1 in 2024, 1 in 2022. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Customer Support Specialist (Hybrid) - Full-time

Remote Full-time

Experienced Remote Data Entry Specialist for Entry-Level Teens – Dynamic Career Opportunities in Data Management

Remote Full-time

Financial Advisor - Career Changers Encouraged to Apply

Remote Full-time

Dynamic Customer Service Representative for Blithequark - Delivering Exceptional Customer Experiences in a Fast-Paced Environment

Remote Full-time

Experienced Online Data Entry Specialist for Teens – Flexible Work Arrangements and Competitive Compensation at blithequark

Remote Full-time

**Data Entry Support Specialist – Join arenaflex's Dynamic Team**

Remote Full-time

Clinical Pharmacist, Outpatient Infusion Center, 40hr, Day

Remote Full-time

Technology Experience Specialist, Studio [Remote]

Remote Full-time

Payroll Director job at Veolia Water Technologies in Milwaukee, WI

Remote Full-time

**Experienced Virtual Assistant – Live Chat Support for Global Financial Services Company (blithequark)**

Remote Full-time
← Back to Home