[Remote] AI Quality Analyst (LLM) | $30/hr Remote

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. Crossing Hurdles is seeking an AI Model Evaluator to assess outputs from large language models and autonomous agent systems. The role involves evaluating model performance, providing feedback for refinement, and ensuring consistent evaluation across reviewers. Responsibilities • Evaluate outputs from large language models and autonomous agent systems using defined rubrics and quality standards • Review multi-step agent workflows, including screenshots and reasoning traces, to assess accuracy and completeness • Apply benchmarking criteria consistently while identifying edge cases and recurring failure patterns • Provide structured, actionable feedback to support model refinement and product improvements • Participate in calibration sessions to ensure consistent evaluation alignment across reviewers • Adapt to evolving guidelines and ambiguous scenarios with sound judgment • Document findings clearly and communicate insights to relevant stakeholders Skills • Strong experience in LLM evaluation, AI output analysis, QA/testing, UX research, or similar analytical roles • Proficiency in rubric-based scoring, benchmarking frameworks, and AI quality assessment • Excellent attention to detail with strong decision-making skills in ambiguous cases • Proficient English communication skills (written and verbal) • Ability to work independently in a remote environment • Comfortable committing to structured evaluation workflows and evolving guidelines Company Overview • At Crossing Hurdles, we specialise in customised recruitment and staffing solutions designed to drive success for businesses and professionals. It was founded in 2022, and is headquartered in , with a workforce of 11-50 employees. Its website is Apply tot his job

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Call Center Sales Representative - Hybrid

Remote Full-time

Experienced Full-Time Customer Service Representative - Remote in Florida - Delivering Exceptional Pet Parent Experiences with a Fast-Growing E-commerce Retailer

Remote Full-time

**Experienced Live Chat Support Agent – Delivering Exceptional Customer Service in a Dynamic Remote Environment**

Remote Full-time

American Airlines – Developer/Senior Developer, IT Applications – Phoenix, AZ

Remote Full-time

**Experienced Remote Chat Operator – Deliver Exceptional Customer Support and Earn a Competitive Salary**

Remote Full-time

Project Manager

Remote Full-time

Administrative Assistant/Receptionist

Remote Full-time

Credit Risk Analyst – SME (m/f/d)

Remote Full-time

Experienced Remote Technical Support Assistant – Immediate Hiring Opportunity for Tech-Savvy Individuals with Excellent Communication Skills

Remote Full-time

Experienced Remote Benefits Customer Service Representative – Delivering Compassionate Support and Expertise in Health and Welfare Benefits Administration at arenaflex

Remote Full-time
← Back to Home