[Remote] Research Intern (LLM)

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work. Responsibilities Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers Evaluate large language models on reasoning, factuality, and problem-solving benchmarks Develop review pipelines and quality-control criteria for expert-level question generation Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases Skills Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass) Excellent written and verbal English skills and analytical reasoning Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes Experience with formal methods, chain-of-thought evaluation, or curriculum generation Relevant publications in top conferences Company Overview The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Telehealth Veterinary Technician - Part Time

Remote Full-time

Senior Director, AI Server Platforms & Rack-scale Solutions Engineering

Remote Full-time

Senior Product Manager – Retail Media Platform (m//f/d)

Remote Full-time

Google Cloud Platform Big Data Engineer/Developer

Remote Full-time

[Remote] Senior/Principal Biostatistician FSP

Remote Full-time

**Experienced Staff Customer Success Manager, Enterprise (Italian Speaker) – Drive Strategic Partnerships and Deliver Exceptional Outcomes**

Remote Full-time

Work From Home Amazon Customer Care Representative - No Degree Needed

Remote Full-time

Experienced Remote Customer Service Representative – Chat Support and Home-Based Client Service Expert

Remote Full-time

A.I. Integration Specialist / Developer (Commission)

Remote Full-time

Experienced Phone and Data Entry Specialist for Remote Work Opportunity in Nevada – Supporting Essential Healthcare Workers with Exceptional Customer Service Skills

Remote Full-time
← Back to Home