ML Ops Engineer, Machine Learning & AI

Remote Full-time
About the position

Machine Learning (ML) at the New York Times enhances the experience of our 150 million digital readers from around the globe and grows our subscriber base through content recommendations and personalizations.
The Machine Learning & AI team builds and maintains the infrastructure that hosts all of The New York Times real-time ML inference models, including both data and compute. Our partners are Data Scientists that build and deploy their ML models on the ML platform. On the other end, our partners are engineering systems that call these hosted models at scale with low-latency and Service Level Agreements guaranteed by our platform.
As an MLOps Engineer you will partner with product, data science and ML platform engineers to build and maintain the infrastructure that powers the machine learning lifecycle. You will automate and refine the training, deployment, monitoring, and management of our ML models.
This role reports to the Senior Engineering Manager of Data Management Infrastructure.

Responsibilities
• Build and Automate ML Pipelines: by owning robust CI/CD pipelines for automated model training, validation, deployment, and retraining.
• Productionalize Models: Build the process for packaging, containerizing, and deploying ML models as scalable, low-latency, and highly-available services.
• Monitoring and Operations: Implement and manage comprehensive monitoring for production models, tracking system health, data drift, and model performance degradation.
• Tooling and Infrastructure: Manage and evolve our MLOps toolchain, including model registries, feature stores, experiment tracking systems, and model serving platforms.
• Collaboration and Support: Partner with data scientists to understand model requirements and optimize them for production. Support software engineers in integrating with ML services.
• Best Practices and Governance: Champion and enforce MLOps best practices for reproducibility, versioning (data, code, model), testing, and governance.
• Demonstrate support and understanding of our value of journalistic independence and a strong commitment to our mission to seek the truth and help people understand the world.

Requirements
• 2+ years of software engineering or DevOps experience with a focus on MLOps, automation, and infrastructure
• 2+ years of experience programming in Python or Go
• Experience building and managing CI/CD pipelines (e.g., Github Actions, Jenkins, GitLab CI)
• Hands-on experience with containerization and orchestration (e.g., Docker, Kubernetes)
• Cloud platform experience (AWS, GCP) and familiarity with infrastructure-as-code (e.g., Terraform, CloudFormation)

Nice-to-haves
• Experience with MLOps tools (e.g., MLflow, Kubeflow)
• Experience with the machine learning model lifecycle, from experimentation to production
• Experience with data processing frameworks (e.g., Spark, Dask, or Ray)
• Experience with low-latency no-sql datastores (BigTable, Dynamo, etc)
• Familiarity with monitoring and observability stacks (e.g., Prometheus, Grafana, Datadog, or ELK)
• Knowledge of data engineering pipelines and orchestration tools (e.g., Airflow, Prefect)

Benefits
• dependent on your role, you may be eligible for variable pay, such as an annual bonus and restricted stock
• Benefits may include medical, dental and vision benefits, Flexible Spending Accounts (F.S.A.s), a company-matching 401(k) plan, paid vacation, paid sick days, paid parental leave, tuition reimbursement and professional development programs

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Diversity & Inclusion Specialist

Remote Full-time

Invoice Auditor Freight Audit

Remote Full-time

Engineering Manager, Revenue

Remote Full-time

QA Analyst

Remote Full-time

Remote Fraud Call Center Representative – Puerto Rico

Remote Full-time

Regional Sales Manager, Upper Midwest

Remote Full-time

Fraud Investigation Specialist I

Remote Full-time

Bilingual Customer Services Representative (Remote – Puerto Rico)

Remote Full-time

**Experienced Remote Call Center Customer Service Representative – Medicaid Member Support**

Remote Full-time

Staff Data Platform Engineer

Remote Full-time
← Back to Home