Staff Machine Learning Engineer - Agentic AI

Remote Full-time
Job Description Team: AI Agents | Location: Melbourne / Sydney What we have built We run production AI agents that autonomously resolve customer service tickets across 100,000+ Zendesk accounts. Each agent takes a customer issue, decomposes it into a multi-step plan, executes real actions refunds, order modifications, escalations through live APIs, and closes the ticket without a human in the loop. The core uses a proprietary iterative architecture: goals decompose into plans, reusable skills are pulled from a registry, execution is evaluated, and the result feeds the next attempt. Successful resolution patterns are synthesised into new skills and written back into the registry the system learns from its own execution history. On GAIA-class multi-step tool-use benchmarks, our agents match the best published results. Internally, 158+ scenario-based evals run continuously against real Zendesk tickets, scored through Braintrust with regression detection on every deploy. What you will own Architecture: The iterative planner works. What we have not solved: plan decomposition under ambiguous goals, memory-tier interference across concurrent sessions, over-eager skill acquisition, and multi-agent delegation via A2A. These are yours to take on. Domain-specialised training: We are building toward RL-trained models specialised for customer service resolution. The data pipeline is instrumented. The next step reward curricula, rollout systems, feedback loops is a 6–12 month build. You own both the science and the systems. Evaluation infrastructure: 158+ evals run continuously, but multi-turn evaluation and automated trajectory analysis are early. You will build the quality gates that block deploys when performance drops, integrated into CI from the start. Guardrails at scale: Tool misuse, cascading action chains, prompt injection, hallucination loops: the threat surface for autonomous agents at enterprise scale is real. You will design the multi-layered defences supervisor patterns, capabilities-based access control, output validation that work across thousands of concurrent sessions without adding latency. What we are looking for 5+ years building production ML/AI systems. You have shipped agent architectures that handle planning, tool dispatch, memory, and failure recovery. If your experience is LangChain tutorials, this is not the right fit. You have built internal evals because you know why public benchmarks lie, and you have the scars to prove it. Python and PyTorch fluency, plus at least one agent framework and the judgment to know when to throw it out and build custom. Bonus: genuine depth in RL for language models reward shaping, online/offline tradeoffs, reward hacking as a diagnostic. We are building toward domain-specialised training and need someone who can lead that work. The intelligent heart of customer experience Zendesk software was built to bring a sense of calm to the chaotic world of customer service. Today we power billions of conversations with brands you know and love. Zendesk believes in offering our people a fulfilling and inclusive experience. Our hybrid way of working, enables us to purposefully come together in person, at one of our many Zendesk offices around the world, to connect, collaborate and learn whilst also giving our people the flexibility to work remotely for part of the week. As part of our commitment to fairness and transparency, we inform all applicants that artificial intelligence (AI) or automated decision systems may be used to screen or evaluate applications for this position, in accordance with Company guidelines and applicable law. Zendesk is an equal opportunity employer, and we’re proud of our ongoing efforts to foster global diversity, equity, & inclusion in the workplace. Individuals seeking employment and employees at Zendesk are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law. We are an AA/EEO/Veterans/Disabled employer. If you are based in the United States and would like more information about your EEO rights under the law, please click here . Zendesk endeavors to make reasonable accommodations for applicants with disabilities and disabled veterans pursuant to applicable federal and state law. If you are an individual with a disability and require a reasonable accommodation to submit this application, complete any pre-employment testing, or otherwise participate in the employee selection process, please send an e-mail to [email protected] with your specific accommodation request.
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

[Remote] Sales Support and Data Coordinator

Remote Full-time

**Experienced Weekend Part-Time Customer Service Representative – Remote Opportunity at arenaflex**

Remote Full-time

Agricultural Finance Specialist (Home-based, SHORT-TERM)

Remote Full-time

**Experienced Remote Chat Support Representative – Delivering Exceptional Customer Service in a Dynamic Energy Industry**

Remote Full-time

Project Coordinator, Platform Services (Remote, CAN))

Remote Full-time

**Experienced Data Entry Specialist – Remote Opportunity with arenaflex**

Remote Full-time

Pension Plan Administrator

Remote Full-time

French Document Review Attorney

Remote Full-time

Volunteer Educational Content Developer – Remote & Hybrid Role for Environmental Learning & Sustainability Initiatives

Remote Full-time

**Experienced Data Entry Specialist – Remote Opportunity for Freshers**

Remote Full-time
← Back to Home