AI Engineer, Prompt Engineering, Python

Remote Full-time
Description: • Design & iterate prompts (system, tool/function-calling, task prompts) to boost voice AI agent success, reliability, and tone. • Build co-pilots for customers to author their own prompts: meta-prompted assistants that suggest structures, lint for risks, autocomplete tool schemas, critique drafts, and generate eval cases. • Work directly with customer feedback and conversation logs to identify failure modes; translate them into prompt changes, guardrails, and data improvements. • Build eval datasets (success labels, rubrics, edge cases, regressions) and run offline/online evaluations (A/B tests, canaries) to quantify impact. • Create Python utilities/services for prompt versioning, config-as-code, rollout/rollback, and guardrails (policies, refusals, redaction). • Partner with PM/Success to define success metrics (task completion, first-pass accuracy, cost, latency) and instrument dashboards/alerts. • Own LLM integration details: function/tool schemas, output parsing/validation (pydantic), retrieval-aware prompting, and fallback strategies. • Ensure privacy & compliance (PII handling, anonymization, regional data boundaries) in datasets and logs. • Share learnings via concise docs, playbooks, and internal demos. • Run a tight feedback loop with customers, turn real conversations into better prompts and eval datasets, and ship changes that measurably improve agent outcomes. Requirements: • Python: 3+ years writing clean, tested, production code (typing, pytest, profiling); experience building small services/APIs (FastAPI preferred). • Prompt Engineering: Hands-on experience designing system/tool prompts, meta-prompting, rubric graders, and iterative prompt tuning based on real user data. • LLM Integration: Comfortable with major APIs (OpenAI/Anthropic/Google/Mistral), function/tool calling, streaming, and robust output handling. • Evaluation Mindset: Ability to define measurable success, create labeled datasets, and run methodical experiments/A/B tests. • Product Sense: Comfortable talking with customers, turning qualitative feedback into shipped improvements. • Data Hygiene: Practical experience cleaning, labeling, and balancing datasets; awareness of privacy/PII constraints. • Nice-to-haves: Experience building prompt-authoring UIs/SDKs or internal tooling for prompt versioning and governance. • Nice-to-haves: Agentic frameworks & tooling: DSpy, MCP, LangGraph, LlamaIndex, Rasa; experience with agent/tool schemas and orchestration. • Nice-to-haves: Observability & eval tooling: Langfuse, LangSmith, Braintrust; building eval harnesses and experiment dashboards. • Nice-to-haves: RAG & vector stores: Qdrant/Weaviate/Pinecone and retrieval-aware prompting. • Nice-to-haves: Experimentation workflows: A/B testing, prompt diffing/versioning. • Nice-to-haves: Infra & analytics: light SQL/log analysis, metrics & tracing, simple Grafana/OTel dashboards. • Nice-to-haves: Writing public blog posts or talks about applied LLM techniques. Benefits: Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Entry-Level Technical Support Representative - Immediate Start, No Experience Required - 100% Remote

Remote Full-time

Corporate Affairs and Patient Engagement Intern

Remote Full-time

[Remote] Virtual Visit Facilitator (Remote)

Remote Full-time

Experienced Full Stack Software Engineer – Web & Cloud Application Development

Remote Full-time

Account Director, Criminal Justice Systems (Southeast)

Remote Full-time

Experienced or Aspiring Remote Data Entry Specialist - Flexible Part-Time & Full-Time Opportunities with Unlimited Earning Potential

Remote Full-time

IT Contract Specialist; Hybrid

Remote Full-time

Region Operations Manager III

Remote Full-time

Customer Service Representative - Problem Solver & Client Advocate at blithequark

Remote Full-time

Social Media Content Specialist

Remote Full-time
← Back to Home