AI Resident - Learning From Videos (LFV)
Toyota Research Institute (TRI) is on a mission to improve the quality of human life through advanced research in AI and robotics. They are seeking an AI Resident to join the Learning From Videos (LFV) team, focusing on developing foundation models for multi-modal data to enhance task performance in embodied AI applications.ResponsibilitiesDevelop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applicationsHandle the ingestion of large-scale datasets for training, including streaming, online, and continual learningContribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performanceWork closely with robotics and machine learning researchers and engineers to understand theoretical and practical needsFollow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific communityContribute to research publications and technical reportsSkillsBachelor's or Master's degree in Computer Science, Electrical Engineering, Robotics, or a related technical fieldExceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to applyStrong background in computer vision and its applications to robotics and embodied systemsDemonstrated research experience through publications, technical projects, or open-source contributionsStrong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research effortsPassionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovationSpatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etcLarge-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etcApplication of machine learning and computer vision to embodied applicationsBenefitsMedicalDentalVision insurancePaid time off benefitsHoliday paySick timeCompany OverviewToyota Research Institute is an R&D enterprise with an initial focus on artificial intelligence and robotics. It was founded in 2016, and is headquartered in Palo Alto, California, USA, with a workforce of 201-500 employees. Its website is http://www.tri.global.Company H1B SponsorshipToyota Research Institute has a track record of offering H1B sponsorships, with 10 in 2025, 7 in 2020. Please note that this does not guarantee sponsorship for this specific role.
Apply To This Job
Apply To This Job