LLM Performance Researcher

Remote Full-time

Full-time • San FranciscoAt Endeavor, we’re rebuilding ERP from first principles for $1B+ manufacturing and distribution companies. These companies run on PDFs, spreadsheets, and semi-structured chaos — and we’re building LLM-powered systems to parse, match, and reason through all of it with human-level reliability.We’re looking for a researcher with deep experience in LLM performance on document tasks — especially extraction, entity linking, and record matching. You’ve likely published papers on it. You’ve probably run head-to-head evals on OpenAI, Claude, and open-source models. You’re fluent in both academic benchmarks and in the weird, grimy failure modes that only show up in production.Your work will directly improve the core performance of our agentic ERP. You’ll prototype new techniques, run structured evals, improve few-shot + tool-augmented performance, and help shape how LLMs interface with structured business systems.What You’ll DoDesign and run experiments to improve extraction, normalization, and matching across real-world documentsEvaluate LLM performance on noisy, multi-format inputs like scanned PDFs, OCR output, and Excel sheetsImprove model accuracy and reliability in the face of rare formats, abbreviations, bad formatting, and domain-specific vocabBuild and own our eval infrastructure for matching, linking, extraction, and schema alignment tasksWork with the Applied AI Researcher and Backend Engineers to deploy improvements into productionContribute to long-term strategy around fine-tuning, retrieval augmentation, tool use, or structured memory (if and when needed)You Might Be a Fit If YouHave deep experience with document understanding and information extraction using LLMsHave worked on schema alignment, record linking, or entity resolution at scaleHave published papers on LLM performance (e.g. extraction, evals, few-shot prompting, matching)Understand both academic benchmarks and real-world weirdnessKnow how to make evals meaningful, tight, and fast to iterate onWant to work in a setting where research turns into production code fastHave a PhD or equivalent research background in NLP, ML, or similar (but we care more about what you’ve done than what your title says)Bonus PointsExperience with post-OCR workflows or noisy doc normalizationDeep intuition for failure modes in enterprise-scale matching/linking systemsObsession with eval quality and reproducibilityComfort implementing papers and benchmarking models at scalePast work in procurement, invoicing, logistics, or any doc-heavy vertical

Apply Now

Apply Now

LLM Performance Researcher

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Sr. Account Executive - Influencer Marketing - Full-time / Part-time

Looking for Title I Tutor - Part-time in Idaho

FT Data Entry Clerk (Driver Enrollment) - Work From Home

Experienced Part-Time Remote Data Entry Clerk – Flexible Work Schedule and Opportunity to Shape Market Trends

Experienced Remote Chat Agent – Deliver Exceptional Customer Experiences at arenaflex

Aetna Careers, Aetna Jobs Remote, Virtual Jobs

In-Home Product Tester – No Fees, No Experience, $25-$45/hr

Marketing Assistant (Entry Level)

Wellness Success Manager

Remote Sales Career Opportunity: Unlock Your Earning Potential and Define Success

LLM Performance Researcher

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Sr. Account Executive - Influencer Marketing - Full-time / Part-time

Looking for Title I Tutor - Part-time in Idaho

FT Data Entry Clerk (Driver Enrollment) - Work From Home

**Experienced Part-Time Remote Data Entry Clerk – Flexible Work Schedule and Opportunity to Shape Market Trends**

**Experienced Remote Chat Agent – Deliver Exceptional Customer Experiences at arenaflex**

Aetna Careers, Aetna Jobs Remote, Virtual Jobs

In-Home Product Tester – No Fees, No Experience, $25-$45/hr

Marketing Assistant (Entry Level)

Wellness Success Manager

Remote Sales Career Opportunity: Unlock Your Earning Potential and Define Success

Experienced Part-Time Remote Data Entry Clerk – Flexible Work Schedule and Opportunity to Shape Market Trends

Experienced Remote Chat Agent – Deliver Exceptional Customer Experiences at arenaflex