Azure open AI Engineer

Remote Full-time
Role: AI EVAL Engineer Location: Bellevue, WA (Remote) Duration: 6+ months AI EVAL Engineering Azure OpenAI; EVAL; Bench Marking Required Skills - Strong understanding of LLMs and generative AI concepts, including model behavior and output evaluation - Experience with AI evaluation and benchmarking methodologies, including baseline creation and model comparison - Hands-on expertise in Eval testing, creating structured test suites to measure accuracy, relevance, safety, and performance - Ability to define and apply evaluation metrics (precisionrecall, BLEUROUGE, F1, hallucination rate, latency, cost per output)Prompt engineering and prompt testing experience across zero-shot, few-shot, and system prompt scenarios - Python other programming languages, for automation, data analysis, batch evaluation execution, and API integration - Experience with evaluation tools/frameworks (OpenAI Evals, HuggingFace evals, Promptfoo, Ragas, DeepEval, LM Eval Harness) - Ability to create datasets, test cases, benchmarks, and ground truth references for consistent scoring - Test design and test automation experience, including reproducible evaluation pipelines - Knowledge of AI safety, bias, security testing, and hallucination analysis Nice-to-Have - RAG evaluation experience - Azure OpenAI - OpenAI - Anthropic - Google AI platforms - Performance benchmarking (speed, throughput, cost) - Domain knowledge Office apps enterprise systems networking Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Digital Designer (Remote US) - Future Opening

Remote Full-time

WORK FROM HOME/IN-OFFICE INSURANCE BENEFITS REP in Warren, MI in Aston Carter (job Id: 1681365917)

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Healthcare Operations**

Remote Full-time

IS Security Manager

Remote Full-time

HIM Coder I - HIM Financial - Full Time 8 Hour Days (Non-Exempt) (Non-Union)

Remote Full-time

Medical Writer II – Marketing & Digital Content (contract)

Remote Full-time

Comcast Cybersecurity: Software Development & Engineering 3 (Python or Rust)

Remote Full-time

[Remote] Senior Project Manager - Oncology (Radiopharmaceutical) - US/Canada - Remote

Remote Full-time

[Remote] Hiring for Salesforce Administration - Remote

Remote Full-time

Johnson City Flexible Role:Delta Airlines Flight Attendant Needed

Remote Full-time
← Back to Home