Data Engineer: Scalable Pipelines for ML Workflows

Remote Full-time
Roles and Responsibility - • Design, build, and maintain scalable and reliable data pipelines for dataset creation, transformation, and benchmarking • Own and optimize Airflow pipelines on AWS for data processing, orchestration, and evaluation workflows • Write efficient, production-grade SQL and Python code for large-scale data processing and analysis • Partner closely with ML engineers to enable model training, evaluation, and benchmarking pipelines • Improve pipeline performance, reliability, and observability, ensuring high data quality in production • Build and maintain systems to support model performance tracking and data drift monitoring • Troubleshoot and resolve data issues across pipelines, ensuring minimal impact on ML workflows • Contribute to data architecture decisions and best practices across the platform • Collaborate cross-functionally with ML, platform, and data teams to support scalable ML infrastructure What Were Looking For • 35 years of experience in Data Engineering, Data Platforms, or related roles • Strong proficiency in Python and SQL with experience in production systems • Hands-on experience with AWS services (S3, EC2, SageMaker or similar) • Solid experience building and managing Airflow (or similar orchestration tools) • Strong understanding of data engineering fundamentals (ETL/ELT, data modeling, pipeline design) • Experience working with large-scale datasets and distributed data systems • Experience supporting ML workflows, datasets, or evaluation pipelines • Strong problem-solving skills and ability to work independently in a fast-paced environment Nice to Have • Experience with ML infrastructure, MLOps, or model evaluation workflows • Exposure to biometric systems or computer vision datasets • Familiarity with data quality frameworks, monitoring, and observability tools • Experience working in SaaS or high-scale production environments
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Part-Time Remote Data Entry Specialist Opportunity at arenaflex**

Remote Full-time

Remote - Supply Chain Strategist

Remote Full-time

Statewide Business Development Executive

Remote Full-time

Retail Sales Associate - Crossing Smithfield – Amazon Store

Remote Full-time

Compliance Manager (Remote based in US) | Tenet Healthcare | Remote (United States)

Remote Full-time

Senior Auto Claims Advisor- Remote

Remote Full-time

Authorization Coordinator I, Clinical Pharmacy Management

Remote Full-time

Senior VDC Manager

Remote Full-time

Triage Nurse (Remote, Contact Center)

Remote Full-time

Assurance Associate II/ Audit Associate II

Remote Full-time
← Back to Home