ML Platform Engineer

Remote Full-time
Let me introduce... With Sonia, doctors are successful doctors. We create and deploy AI enhanced solutions that make doctors’ lives easier, patients’ care better, and healthcare systems more efficient. If you’re an intrinsically motivated self-starter who values impactful work, join us in revolutionizing healthcare. We’re looking for an experienced ML Platform Engineer (all) with deep Kubernetes expertise to support the infrastructure powering our AI and ML workloads. You’ll work closely with ML engineers on everything from deploying cutting-edge LLM inference to refining observability and automating workflows—always with reliability, scalability, and performance as your guiding principles. This role can be performed remotely from anywhere in Germany or Luxembourg, or in a hybrid setup from our offices in Luxembourg or Berlin. This is what you’ll own Support and enhance our Kubernetes-based infrastructure in cloud environments, running both ML/LLM workloads and general applications Deploy and optimize LLM inference systems Design, build, and improve MLOps/DevOps pipelines to support the entire development lifecycle Manage GPU scheduling and autoscaling with Kubernetes-native tooling Ensure observability and alerting across the platform Operate and troubleshoot supporting infrastructure Contribute to platform reliability, security, and performance through automation and best practices You’ll thrive in this role if you bring 5+ years of experience in MLOps or SRE Strong hands-on Kubernetes experience, including GitOps (Flux or ArgoCD), Kustomize, Helm and production troubleshooting Familiarity with LLM inference deployment and optimization in Kubernetes (e.g., vLLM, LMCache, llm-d) Experience with MLOps supporting tools such as MLflow or Argo Workflows Understanding of GPU resource orchestration in Kubernetes environments Profound knowledge of observability tools, such as VictoriaMetrics, VictoriaLogs and Grafana Knowledge of database and broker administration (PostgreSQL, Redis and RabbitMQ) Solid scripting skills in Python Comfortable working with cloud platforms (OVHcloud, AWS, GCP or Azure) Nice-to-Haves Experience with audio ML models or real-time inference Exposure to CI/CD practices tailored for ML systems Familiarity with Kubernetes networking, security, or performance tuning Why you’ll love working with us Full ownership of a mission-critical platform A team that values curiosity, learning, and experimentation Remote-first setup with the option to work in our Berlin office Competitive salary depending on experience Work on AI infrastructure that directly impacts healthcare innovation Ready to apply? If you're passionate about web development and want to work with cutting-edge technologies, we'd love to hear from you! I'm Margarita and will be guiding you through the application process.
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Experienced Part-Time Work from Home Data Entry Clerk and Market Research Participant – Flexible Remote Opportunity for Self-Motivated Individuals

Remote Full-time

Grants Compliance Senior Specialist, Community Partnerships & Investments

Remote Full-time

Experienced Data Science Manager – Strategic Planning, Team Leadership, and Data Analysis for Business Growth at blithequark

Remote Full-time

Junior Account Executive (AE)

Remote Full-time

**Experienced Customer Service Representative – Remote Opportunity at arenaflex**

Remote Full-time

Customer Success Manager – LATAM (One-to-Many)

Remote Full-time

**Experienced Part-Time Evening Work From Home Data Entry Specialist – Flexible and Convenient Opportunity to Enhance Your Income**

Remote Full-time

REMOTE - Guest Booking Coordinator/Producer (Part-Time)

Remote Full-time

**Experienced Remote Data Entry Specialist – Organizing and Maintaining arenaflex's Databases**

Remote Full-time

[Hiring] Senior Security Governance and Risk Consultant @Tenchi Security

Remote Full-time
← Back to Home