[Remote] Student Researcher [Seed Vision – Multimodal Video Generation] – 2026 Start (PhD)

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. ByteDance is a pioneering company dedicated to advanced AI foundation models. The role involves conducting research on multimodal video generation and collaborating with researchers and engineers to enhance generative models for visual content. Responsibilities Conduct research on multimodal video generation, with a focus on improving semantic alignment between inputs and generated content Integrate vision-language models (e.g., CLIP, pre/post-trained VLMs) into video generation architectures to enhance input understanding Explore and implement joint training or fine-tuning approaches that couple VLMs with video generation backbones Evaluate model performance on tasks requiring high-level reasoning or detailed semantic control over generation Collaborate with researchers and engineers to iterate on prototypes within an existing infrastructure Skills Currently pursuing a PhD in Computer Vision, Machine Learning, or a related field Research experience in one or more of the following areas: Vision-language models (VLMs); Multimodal or joint model training; Video generation Solid coding ability and clean research implementation style, and expected to work with a production-grade codebase (e.g., PyTorch) Demonstrated research ability, with first-author publications in top-tier ML/CV/AI conferences such as CVPR, ICCV, ECCV, and ICLR Experience in training or fine-tuning autoregressive or diffusion-based video generation models Background in multimodal instruction-following, alignment, or conditioning for generation tasks Understanding of evaluation techniques for assessing semantic consistency in generated video Benefits Health insurance Life insurance Wellbeing benefits 10 paid holidays per year Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year) Housing allowance Company Overview ByteDance is a technology company that develops content creation platforms and services. It was founded in 2012, and is headquartered in Beijing, Beijing, CHN, with a workforce of 10001+ employees. Its website is Company H1B Sponsorship ByteDance has a track record of offering H1B sponsorships, with 1350 in 2025, 1123 in 2024, 775 in 2023, 487 in 2022, 417 in 2021, 245 in 2020. Please note that this does not guarantee sponsorship for this specific role.
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Remote Customer Handling Assistant – Delivering Exceptional Customer Service in a Dynamic Work-from-Home Environment**

Remote Full-time

**Experienced Data Entry Specialist – Unlock Flexibility and Growth Opportunities at arenaflex**

Remote Full-time

Clinical Pharmacist

Remote Full-time

Regulatory Compliance Attorney – Professional Licensing

Remote Full-time

Remote Bilingual (English/Spanish) Physical Therapy Receptionist

Remote Full-time

SDET

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Work Opportunity with arenaflex**

Remote Full-time

Experienced Automation Solutions Software Engineer – Netflix Remote Job Opportunity ($27-$40/Hour) - Digital Content Protection (DCP)

Remote Full-time

UX Designer - UX Design, Figma, Site Migration (LC9-18301471)

Remote Full-time

Executive Director, GMP Quality Assurance

Remote Full-time
← Back to Home