AI Engineer, Prompt Engineering, Python

Remote Full-time
Description: • Design & iterate prompts (system, tool/function-calling, task prompts) to boost voice AI agent success, reliability, and tone. • Build co-pilots for customers to author their own prompts: meta-prompted assistants that suggest structures, lint for risks, autocomplete tool schemas, critique drafts, and generate eval cases. • Work directly with customer feedback and conversation logs to identify failure modes; translate them into prompt changes, guardrails, and data improvements. • Build eval datasets (success labels, rubrics, edge cases, regressions) and run offline/online evaluations (A/B tests, canaries) to quantify impact. • Create Python utilities/services for prompt versioning, config-as-code, rollout/rollback, and guardrails (policies, refusals, redaction). • Partner with PM/Success to define success metrics (task completion, first-pass accuracy, cost, latency) and instrument dashboards/alerts. • Own LLM integration details: function/tool schemas, output parsing/validation (pydantic), retrieval-aware prompting, and fallback strategies. • Ensure privacy & compliance (PII handling, anonymization, regional data boundaries) in datasets and logs. • Share learnings via concise docs, playbooks, and internal demos. • Run a tight feedback loop with customers, turn real conversations into better prompts and eval datasets, and ship changes that measurably improve agent outcomes. Requirements: • Python: 3+ years writing clean, tested, production code (typing, pytest, profiling); experience building small services/APIs (FastAPI preferred). • Prompt Engineering: Hands-on experience designing system/tool prompts, meta-prompting, rubric graders, and iterative prompt tuning based on real user data. • LLM Integration: Comfortable with major APIs (OpenAI/Anthropic/Google/Mistral), function/tool calling, streaming, and robust output handling. • Evaluation Mindset: Ability to define measurable success, create labeled datasets, and run methodical experiments/A/B tests. • Product Sense: Comfortable talking with customers, turning qualitative feedback into shipped improvements. • Data Hygiene: Practical experience cleaning, labeling, and balancing datasets; awareness of privacy/PII constraints. • Nice-to-haves: Experience building prompt-authoring UIs/SDKs or internal tooling for prompt versioning and governance. • Nice-to-haves: Agentic frameworks & tooling: DSpy, MCP, LangGraph, LlamaIndex, Rasa; experience with agent/tool schemas and orchestration. • Nice-to-haves: Observability & eval tooling: Langfuse, LangSmith, Braintrust; building eval harnesses and experiment dashboards. • Nice-to-haves: RAG & vector stores: Qdrant/Weaviate/Pinecone and retrieval-aware prompting. • Nice-to-haves: Experimentation workflows: A/B testing, prompt diffing/versioning. • Nice-to-haves: Infra & analytics: light SQL/log analysis, metrics & tracing, simple Grafana/OTel dashboards. • Nice-to-haves: Writing public blog posts or talks about applied LLM techniques. Benefits: Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Part-Time Telehealth Therapist (LCSW, LMHC, LPC) - Maryland

Remote Full-time

Experienced Automotive Service Advisor – Customer-Focused Representative for Electric Vehicle Technology Leader

Remote Full-time

Salesforce Application Developer - Apex, Lightning Components, and Visualforce Pages - 3-Month Contract with Competitive Pay and Remote Work Opportunity

Remote Full-time

Entry Level Game Tester – Hiring Gamers (Work f...

Remote Full-time

Primary Care Provider, Nurse Practitioner / Physician Assistant

Remote Full-time

Compliance Consultant, Partnership Consulting Services

Remote Full-time

RN/LPN- Quality Clinical Management Lead Analyst United States Work at Home

Remote Full-time

Data Analyst

Remote Full-time

Administrative Specialist

Remote Full-time

Multi-Carrier Insurance Agent (Base salary + Uncapped commissions)

Remote Full-time
← Back to Home