Generalist Evaluator Expert

Remote Full-time
Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - This is a **remote and asynchronous** role — work on your own schedule. - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### **About** [**Mercor**]( - Our team is based in San Francisco, CA - We [specialize]( in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

(USA) Principal, Data Scientist

Remote Full-time

[PART_TIME Remote] Freelance Writer - Remote

Remote Full-time

**Experienced Customer Service Representative – Work from Home Opportunity with arenaflex**

Remote Full-time

Content Writer

Remote Full-time

Remote Virtual Text Chat Operator (Flexible Hou...

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Database Management and Customer Service**

Remote Full-time

**Part Time Customer Service Representative (Fully Remote / Entry Level) at blithequark**

Remote Full-time

**Experienced Full Stack Social Media Customer Support Specialist – Work From Home Opportunity at arenaflex**

Remote Full-time

Experienced Remote Medical Data Entry Clerk (Typist) – High-Volume Data Entry and Client Management Expertise in Healthcare and Manufacturing Sectors

Remote Full-time

AI Trainer Telerobotics Operator (Freelance, San Francisco)

Remote Full-time
← Back to Home