Lead QA Engineer (AI Agent Quality & Evaluation)

Remote Full-time
As a Lead QA Engineer, you’ll own quality strategy for AI-powered systems where correctness is probabilistic, outputs are structured (JSON), and evaluation requires real measurement (accuracy, cost, latency, edge-case handling, regression detection).

You’ll build automated evaluation harnesses, and partner closely with Engineering and Product to prevent silent quality regressions as the system evolves.

High autonomy, high leverage, and direct impact on the core product.

Ideal Profile
• Lead QA engineer who has moved beyond manual testing into automation, tooling, and quality systems.
• Comfortable testing systems where “expected output” is not always deterministic — and knows how to create evaluation strategies anyway.
• Strong Python + data mindset: can build repeatable harnesses, metrics pipelines, and regression suites.
• Product-minded and skeptical in the best way: notices failure modes, ambiguous cases, and risks early.
• Comfortable collaborating with engineers and shipping quality gates, not just filing bugs.
• Hands-on experience with AI developer / agent tooling (e.g., Claude Code, GitHub Copilot or similar) and building agents that amplify inputs and orchestrate multi-step workflows (prompt engineering, tool integration).

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Phone Services Advocate (Overnight Shift)

Remote Full-time

Data Entry Clerk Jr (No Experience) (Entry Level)

Remote Full-time

Remote/WAH - Resource Coordinator

Remote Full-time

[Remote] Corporate Strategy and FP&A Associate

Remote Full-time

IT Service Techniker (m/w/d) im Bereich Business Communication

Remote Full-time

Service BDC Representative

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Workforce Management at arenaflex**

Remote Full-time

Markets and Distribution Career Coach (Part-Time)

Remote Full-time

Freelance Academic Writer - Research & Assignment Support (Remote)

Remote Full-time

Compliance Counsel

Remote Full-time
← Back to Home