[Hiring] Staff Applied Researcher, AI Quality @GitHub, Inc.

Remote Full-time

Role Description

At GitHub, we’re building the next generation of AI‑powered developer experiences. We’re looking for a Staff Applied Researcher with deep expertise in Large Language Model (LLM) evaluation, LLM agents, strong engineering instincts, and a bias for action to help shape the future of GitHub Copilot and our AI platform. This is a high‑impact role where you will design evaluation systems that directly influence how millions of developers experience AI every day.

Responsibilities
• Lead Model Quality & Evaluation
• Design next‑generation evaluation frameworks for code generation, reasoning, safety, multimodal tasks, and agentic workflows.
• Develop scalable automatic metrics, LLM‑judge systems, reward models, and human‑in‑the‑loop evaluation pipelines.
• Establish high‑signal, repeatable methodologies that influence product decisions across GitHub AI.
• Drive Applied Research & Engineering
• Build and optimize evaluation tooling, datasets, benchmarking systems, and experimentation pipelines.
• Create and onboard new benchmarks for the hardest tasks for the coding agents.
• Collaborate closely with engineering teams to productionize research, validate improvements, and accelerate model iteration cycles.
• Own end‑to‑end quality insights for the models behind GitHub Copilot and new AI features.
• Work closely with product development, engineering, and design teams to integrate advanced research findings into practical applications, ensuring alignment with product goals and user needs.
• Influence, Mentor & Lead
• Shape GitHub’s strategy for model quality, alignment, and evaluation.
• Mentor other researchers and engineers, helping elevate technical standards across the organization.
• Drive clarity in ambiguous problem spaces and champion fast, high‑quality execution.

Qualifications
• Required Qualifications
• Bachelor's degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 8+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field,
• OR master's degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 6+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field,
• OR doctorate in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 4+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field,
• OR equivalent experience.
• 3+ years of strong engineering skills in Python/Typescript and experience building production grade evaluation or data/ML pipelines at scale.
• Proven track record shipping research or evaluation systems in production environments.
• Strong cross‑functional communication and influence skills.
• Preferred Qualifications
• Experience with LLM judge systems, reward modeling, alignment, or safety evaluations.
• Background in code generation, developer tools, or AI‑assisted programming.
• Experience with large‑scale experimentation and online/offline evaluation strategies.
• Open‑source contributions or experience working with developer communities.
• Experience designing and leading complex research projects from ideation to implementation.
• Ability to define and articulate data-driven strategies that consider cross-functional impacts and align with organizational priorities, particularly in a software development platform context.

Requirements
• The base salary range for this job is USD $140,400.00 - USD $372,300.00 /Yr.
• These pay ranges are intended to cover roles based across the United States. An individual's base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant.
• At GitHub, certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role.
• In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee's role.

Benefits
• Competitive pay
• Generous learning and growth opportunities
• Excellent benefits to support you, wherever you are

Apply tot his job

Apply To this Job

Apply Now

Experienced Online and Offline Data Entry Specialist – Part-Time and Full-Time Opportunities for Detail-Oriented Individuals at blithequark

Remote Full-time

LendKey - Senior Sales Executive - Lender Partnerships, Central and Western US

Remote Full-time

← Back to Home

[Hiring] Staff Applied Researcher, AI Quality @GitHub, Inc.

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

[Remote] Data Analyst (Only W2 or Selfcorp)

Senior Full Stack Developer (Ruby on Rails)

Experienced Virtual Customer Care Professional – Deliver Exceptional Service in a Remote Environment

IT Manager - OpenText VIM Functional Lead

Security Engineer

Teleradiologist Opportunity | Fully Remote | Flexible Evening Hours | Comprehensive Benefits

National Sales Trainer

Continuous Improvement/Process Improvement Manager, Lean, 5S

Experienced Online and Offline Data Entry Specialist – Part-Time and Full-Time Opportunities for Detail-Oriented Individuals at blithequark

LendKey - Senior Sales Executive - Lender Partnerships, Central and Western US

[Hiring] Staff Applied Researcher, AI Quality @GitHub, Inc.

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

[Remote] Data Analyst (Only W2 or Selfcorp)

Senior Full Stack Developer (Ruby on Rails)

**Experienced Virtual Customer Care Professional – Deliver Exceptional Service in a Remote Environment**

IT Manager - OpenText VIM Functional Lead

Security Engineer

Teleradiologist Opportunity | Fully Remote | Flexible Evening Hours | Comprehensive Benefits

National Sales Trainer

Continuous Improvement/Process Improvement Manager, Lean, 5S

Experienced Online and Offline Data Entry Specialist – Part-Time and Full-Time Opportunities for Detail-Oriented Individuals at blithequark

LendKey - Senior Sales Executive - Lender Partnerships, Central and Western US

Experienced Virtual Customer Care Professional – Deliver Exceptional Service in a Remote Environment