Data Engineer - AI (REMOTE)

Remote Full-time
Description • Own the end-to-end data lifecycle for Upbound’s AI initiatives, from raw ingestion through model-ready datasets, powering the next generation of Crossplane and the Intelligent Control Plane. • Architect and maintain scalable, cloud-native data pipelines (batch + streaming) that collect, clean, and enrich telemetry from thousands of Kubernetes clusters, cloud APIs, and customer workloads worldwide. • Partner with ML engineers, product, and SRE teams to define data contracts, schema evolution strategies, and governance policies that keep petabyte-scale lakes reliable, secure, and compliant (SOC 2, GDPR, HIPAA). • Design real-time feature stores that feed both online inference services and offline training jobs, ensuring sub-second latency for critical control-plane decisions while guaranteeing reproducibility and version control. • Build self-service tooling (SDKs, notebooks, observability dashboards) that empowers analysts and data scientists to discover, profile, and experiment with datasets without bottlenecks. • Optimize compute and storage costs through intelligent partitioning, incremental processing, and auto-scaling clusters on AWS/GCP, cutting spend by double-digit percentages year-over-year. • Implement advanced data quality frameworks—unit tests, anomaly detection, lineage tracking—that surface issues before they reach production models or customer dashboards. • Contribute to open-source Crossplane providers and Upbound’s internal “Data as Infrastructure” codebase, turning repeatable patterns into reusable packages the community can adopt. • Champion a culture of documentation and knowledge sharing: run internal tech talks, write runbooks, and mentor junior engineers to raise the bar for data excellence across the company. • Stay ahead of the curve by evaluating emerging technologies (Iceberg, DuckDB, Flink, vector databases) and running proof-of-concepts that translate into competitive advantages for Upbound’s AI roadmap. Requirements • 5+ years building production-grade data pipelines in Python, SQL, and at least one JVM language (Scala/Java/Kotlin). • Deep expertise with cloud data stacks: S3/GCS, Redshift/BigQuery, EMR/Dataproc, Kinesis/PubSub, Airflow/Mage, dbt, Terraform. • Hands-on experience with Kubernetes, Docker, and infrastructure-as-code; familiarity with Crossplane is a strong plus. • Proven track record designing real-time streaming architectures (Kafka, Pulsar, Flink) and batch ETL at multi-terabyte scale. • Nice-to-have: contributions to open-source data projects, advanced SQL performance tuning, or prior work in ML feature engineering. ️ Benefits • Fully remote-first culture with quarterly off-sites in inspiring global locations. • Competitive salary + equity package that grows with the company’s valuation. • $3,000 annual learning stipend for conferences, courses, and certifications. • Flexible PTO policy and 16-week gender-neutral parental leave. • Home-office setup budget and monthly wellness stipend. Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Part Time Instructor - Cyber Security Engineering (US)

Remote Full-time

[Remote] Director, Strategic Finance (Product & Engineering)

Remote Full-time

HR - Business Partner

Remote Full-time

[Remote] Experienced Insurance Sales Producer | Remote, NC

Remote Full-time

Experienced Customer Service Representative – Remote Work Opportunity with arenaflex for Delivering Exceptional Customer Experiences

Remote Full-time

Consultant - Hedge Fund/Family Office Controller

Remote Full-time

Experienced Data Entry Professional for Remote Opportunities – Full-Time or Part-Time Data Management and Administration

Remote Full-time

Claims Processing Specialist

Remote Full-time

Experienced Remote Data Entry Clerk – Detail-Oriented and Organized Individual for Accurate Data Management and Entry

Remote Full-time

Experienced Customer Service Representative - Remote Opportunity at blithequark

Remote Full-time
← Back to Home