Senior Agentic AI Test and Evaluation Engineer Remote / Telecommute Jobs

Remote Full-time

R-00165583

Description

At Leidos, you'll contribute to AI solutions that serve critical national and global missions—ranging from defense and intelligence to healthcare, energy, and space exploration. Our work emphasizes Trusted Mission AI: systems that are transparent, ethical, resilient, and accountable. You’ll collaborate with multidisciplinary teams to transition AI research into operational environments where accuracy, security, and reliability are non-negotiable. Joining Leidos means applying your expertise to solve some of the most complex and meaningful challenges of our time.

We are looking for a motivated Agentic AI Test and Evaluation Engineer who wants to work on challenging problems in a variety of domains – including enterprise IT, health, defense, intelligence, and energy – to get results that apply and go beyond the state of the art for measurably better outcomes. We apply our knowledge, capabilities, and experience to develop and deploy Trusted Mission AI – AI that deserves to be trusted by system owners, end users, and the public – to be accurate, ethical, reliable, and adaptable.

You will work with a team of agentic AI scientists, agentic AI scientists, data scientists and data engineers to operationalize new approaches for test and evaluation of Agentic AI models that produce measurable advances over state of the art solutions.

Primary Responsibilities
• Develops AI Models Test and Evaluation CONOPS
• Creates scalable Test and Model Evaluation plans for Agentic AI systems including process, techniques and tools.
• Works with AI scientists, agentic AI scientists, data scientists and data engineers to understand the AI system under test to develop test procedures
• both positive and negative testing and evaluation
• Collect performance metrics as part of evalulation results documentation
• Works with MLOps engineers to integrate testing tools and procedures with the CI/CD pipeline
• Analyzes existing processes and resultant metrics to recommend potential improvements
• Collaborates with AI Governance team to maintain visibility and explainability through testing
• Implements testing process in the AI system design, development and deployment life cycle
• Identifies the risk in testing of projects, particularly for assessing the limitations of planned tests on complex AI systems
• Works within teams of AI/ML researchers and engineers using Agile development processes

Basic Qualifications
• Bachelor's degree in Computer Science, Data Science or related field and over 8 years of relevant experience, Masters with 6 years experience. Additional experience may be considered in lieu of degree.
• Strong Python programming fundamentals
• Experience with system and subsystem level test process and automation
• Experience with creating user acceptance test scenarios
• Experience with SecDevOps tooling and MLOps pipeline development
• Experience with software test automation techniques
• Experience with AI Performance and vulnerability assessment
• AI model assurance evaluation
• Experience applying and automating AI interpretability & explainability tools and methods
• Experience with developing CONOPS and presentations
• Good understanding of machine learning algorithms, tools and platforms
• Self-starter with high intellectual curiosity
• Great communication skills, able to explain model and test results to a non-technical audience
• Proficient in data exploration techniques and tools
• Ability to obtain a Secret clearance

Preferred Qualifications
• Experience with data visualization libraries such as Plotly, Streamlit, and matplotlib.
• Experience with AI/ML tools, such as common Python packages (e.g., scikit-learn, NumPy, Pandas) and Jupyter notebooks
• Experience with database administration and data repositories
• Experience in data exploration techniques and tools
• Experience with building LLM and other Generative AI applications.
• Willing to learn new skills and platforms to support data analytics.
• Ability and willingness to obtain a Top Secret security clearance

At Leidos, we don’t want someone who "fits the mold"—we want someone who melts it down and builds something better. This is a role for the restless, the over-caffeinated, the ones who ask, “what’s next?” before the dust settles on “what’s now.”

If you’re already scheming step 20 while everyone else is still debating step 2… good. You’ll fit right in.

Original Posting: August 29, 2025

For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

Pay Range: Pay Range $104,650.00 - $189,175.00

The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.

#Remote

Apply Now

Apply Now

Apply Now

Senior Agentic AI Test and Evaluation Engineer Remote / Telecommute Jobs

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Experienced Data Entry Specialist – Remote, Part-Time Opportunity with arenaflex

Mississippi Healthcare Provider (Hybrid Remote - Flexible Schedule)

Product Tester at Home. $25hr-$45hr. No Experience Required. Part-time – Canadian Consumer Panels – Woburn, MA – Canada

Sr Reconciliation Rep (Remote, $17/hour)

Flexible Remote Data Entry Research Assistant (Hiring Immediately)

Class Action Project Manager

Leasing Professional (Part Time and Primarily Work From Home)

Newsletter Director

Remote Registered Dietitian/Certified Nutrition Specialist (Flexible Hours)

Senior Counsel

Senior Agentic AI Test and Evaluation Engineer Remote / Telecommute Jobs

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

**Experienced Data Entry Specialist – Remote, Part-Time Opportunity with arenaflex**

Mississippi Healthcare Provider (Hybrid Remote - Flexible Schedule)

Product Tester at Home. $25hr-$45hr. No Experience Required. Part-time – Canadian Consumer Panels – Woburn, MA – Canada

Sr Reconciliation Rep (Remote, $17/hour)

Flexible Remote Data Entry Research Assistant (Hiring Immediately)

Class Action Project Manager

Leasing Professional (Part Time and Primarily Work From Home)

Newsletter Director

Remote Registered Dietitian/Certified Nutrition Specialist (Flexible Hours)

Senior Counsel

Experienced Data Entry Specialist – Remote, Part-Time Opportunity with arenaflex