ML Engineer (LLM)
Synthflow AI is a no-code platform for deploying voice AI agents that automate phone calls across contact center operations and business process outsourcing (BPO) at scale. We help mid-market and enterprise companies manage routine calls to save teams time and resources. Our agents have already delivered measurable impact:Over 5 million hours of contact center operations saved35% more calls answered compared to non-AI operators45 million calls handled with a 99.9% uptimeBacked by Accel, Atlantic Labs, and Singular and trusted by over 1,000 customers, our growth leads an industry shift toward sophisticated and accessible conversational AI.The RoleWe are looking for a handsâon ML Engineer who lives at the intersection of TTS, STT and large language models. You will design and ship new lowâlatency voice capabilities, working closely with product, research and infrastructure teams to push the boundaries of natural, multilingual conversation.What Youâll DoArchitect & implement realâtime speech pipelines (ASR â LLM â TTS) that meet stringent latency and quality targets.Evaluate and fineâtune stateâofâtheâart ASR, LLM and TTS modelsâboth commercial and openâsourceâand integrate the best performers into production.Optimise inference through quantisation, distillation, hardwareâaware graph compilation and reinforcementâlearningâbased tuning.Expose scalable APIs & microâservices with Python/FastAPI, gRPC or WebSocket streaming, backed by robust observability and autoscaling.Own deployment across cloud and onâprem environments, collaborating on containerisation (Docker), orchestration (Kubernetes) and CI/CD workflows.Stay ahead of the curve by tracking research, running experiments and sharing learnings with the broader team.What weâre looking forPython Engineering: 5+ years writing productionâgrade, wellâtested Python; deep familiarity with async, typing and performance profilingSpeech / Audio: Handsâon experience building realâtime ASR, TTS, voice chat or streaming audio productsLLM Tooling: Fineâtuning, prompt design, evaluation, retrievalâaugmented generation; familiarity with frameworks such as Openpipe/ART, LangChain, LlamaIndex or similarSystems & MLOps: Containerisation, GPU scheduling, observability, DevOps on GCP or AWS; infrastructureâasâcode principlesAPI Design: Building and maintaining highâthroughput REST/gRPC/FastAPI services; securing and monitoring them in productionBonus PointsModel compression expertise (quantisation, pruning, ONNX/TensorRT)Knowledge of audio and acousticsExperience with reinforcementâlearningâfromâhumanâfeedback (RLHF) or direct preference optimisationContributions to openâsource ML/speech projects (share your GitHub!)Familiarity with GPU inference servers (Triton, KServe) or distributed compute frameworks (Ray)Founded in Berlin in 2023 by serial entrepreneurs Albert Astabatsyan, Hakob Astabatsyan, and Sassun Mirzakhan-Saky, Synthflow AI democratizes access to advanced voice AI with a no-code platform that lets enterprises easily create, deploy and scale natural-sounding, cost-effective voice agents tailored to their business needs.
Apply Now
Apply Now