Data Engineer – AI

Remote Full-time
Job Description:
• Define and drive the technical vision for data platforms that support AI-powered features in Crossplane and Upbound Spaces
• Lead the design of data pipelines that transform infrastructure and data into training datasets for ML models
• Architect vector search and RAG systems that leverage Crossplane Control Planes & Upbound Marketplace as a knowledge store
• Build data infrastructure that processes resources, extensions, and compositions for semantic search
• Establish frameworks for collecting, processing, and analyzing infrastructure configuration data
• Design data pipelines that handle Crossplane-specific data
• Create infrastructure for indexing and searching Upbound Marketplace content, documentation, and community patterns
• Develop metrics and monitoring for AI features integrated with Upbound's control plane architecture
• Design data systems that power AI agents for infrastructure provisioning & operations, helping users generate and optimize Crossplane compositions
• Create feature engineering platforms that extract signals from control plane operations, resource status, and reconciliation patterns
• Implement data infrastructure for training models that predict infrastructure failures, optimize resource allocation, and suggest configuration improvements
• Drive the development of knowledge graph representations of infrastructure dependencies and relationships

Requirements:
• 10+ years of software/data engineering experience with at least 4 years in technical leadership roles
• Proven track record building data platforms that support production systems at scale
• Deep expertise in both traditional data engineering (Spark, Airflow, data lakes) and ML-specific infrastructure (feature stores, model serving)
• Experience with vector databases (Pinecone, Weaviate, Qdrant, Milvus, pgvector, Opensearch, ElasticSearch)
• Demonstrated experience with LLM applications, including RAG architectures and semantic search implementations
• Understanding of Kubernetes, cloud-native architectures, and infrastructure-as-code principles
• Strong understanding of data requirements for AI/ML systems: training pipelines, feature stores, and inference infrastructure
• Hands-on experience building knowledge bases and semantic search systems for technical documentation and code
• Experience with embedding models for code and technical documentation
• Knowledge of time-series data processing for infrastructure metrics and events
• Understanding of graph databases and their application to infrastructure dependency modeling

Benefits:
• Health insurance
• 401(k) matching
• Flexible work hours
• Paid time off
• Remote work options

Apply Now

Apply Now
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Order Management Specialist

Remote Full-time

Security Shift Supervisor

Remote Full-time

Freelance Data Entry Professional - Easy Income Source

Remote Full-time

Engineer - Linux - Military Veterans

Remote Full-time

Experienced Customer Service Representative – Remote Opportunity for Delivering Exceptional Support and Driving Customer Satisfaction

Remote Full-time

**Experienced Customer Experience Representative – INBOUND (Remote) at arenaflex**

Remote Full-time

Director, Customer Experience

Remote Full-time

**Experienced Remote Customer Support Specialist – Deliver Exceptional Service from the Comfort of Your Own Home**

Remote Full-time

English As A Second Language Teacher

Remote Full-time

Diligence QC Analyst II (Part-Time)

Remote Full-time
← Back to Home