AI Engineer (Synthetic Data Pipelines)

Remote Full-time

This a Full Remote job, the offer is available from: Europe V7 At V7, we’re building AI platforms that help humans do their best work, at incredible scale and speed. Our mission is to turn human knowledge into trustworthy AI, making complex tasks faster, smarter, and more accurate. We’re growing fast, backed by leading investors and AI pioneers (including the minds behind Transformers and Gemini). The team you’ll be joining and the impact you’ll have We are a high-impact team at the forefront of AI research and engineering, developing large-scale synthetic data generation pipelines to train cutting-edge machine learning models. Our work blends rigorous experimentation with robust engineering, bridging the gap between foundational research and production-quality systems. We are seeking a technically strong and scientifically grounded AI Engineer to lead the development and evaluation of synthetic data pipelines used to train frontier models. You will design modular, reproducible pipelines that can be evaluated using proxy performance metrics, while collaborating closely with researchers and ML practitioners. The role requires strong command of experimental methodology, comfort with ambiguity, and fluency in large language model (LLM) systems—especially context engineering, agentic execution strategies, and performance optimization. You will be expected to move quickly, maintaining high-quality standards and leveraging modern AI tooling to streamline every stage of development. What you’ll be doing from day one • Design, implement, and maintain synthetic data generation pipelines for multi-modal training tasks. • Evaluate pipeline output using well-grounded proxy metrics and sound statistical experiments. • Own the design and execution of experiments involving LLMs, ensuring high reproducibility and clarity of findings. • Apply agentic design patterns and context engineering techniques to maximize model performance. • Use tools like Cursor, GitHub Copilot, and LLM agents to accelerate iteration, debugging, and documentation. • Collaborate with researchers and engineers across the stack to translate experimental insights into scalable systems. Who you are • 3+ years of software engineering experience with at least one major programming language (Python or JavaScript preferred). • Strong academic background with an MS or higher in Computer Science, Engineering, Mathematics, or a related scientific field. • Deep familiarity with Git, DVC, shell environments, and data pipeline orchestration. • Solid foundation in statistics and experimental design, especially in the context of ML evaluation. • Experience working with LLM systems, including: • Prompt and context engineering • Agentic workflows • Output optimization and reliability strategies • Familiarity with recent research on LLM training datasets and evaluation benchmarks, including: • CoDA: Agentic Systems for Collaborative Data Visualization • ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation • Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data • ChartQA-X: Evaluation and Augmentation for Visual Chart Reasoning What We Value • Curiosity • A bias toward iteration and improvement—welcoming early feedback, embracing failure as part of the discovery process, and viewing feedback not as criticism but as a signal for the next meaningful step forward. • A structured and analytical mindset, with strong attention to the scientific soundness of results. • The ability to thrive in fast-moving environments without clearly defined playbooks. • A preference for modular, reproducible systems over ad-hoc experimentation. • Rigour in both code and evaluation, especially when assessing LLM behaviour through proxy metrics and synthetic data feedback loops. Why Join Us This is a rare opportunity to contribute directly to the next generation of training infrastructure for advanced AI systems. The challenges are complex, the tooling is bleeding-edge, and the impact is tangible. You will be surrounded by researchers and engineers who care deeply about both product and science, and who are committed to solving hard problems with clear thinking and high standards. V7 champions equality and inclusion because diverse teams build better products. Don't check every box? Apply anyway — we value what makes you unique and will support you through the process, just let our Talent team know how they can help. This offer from "V7" has been enriched by Jobgether.com and got a 86% flex score. Apply tot his job

Apply Now

Experienced Remote Chat Support Specialist for Dynamic Customer Service Team – Entry-Level Opportunity with Flexible Hours and Professional Growth

Remote Full-time

AI Engineer (Synthetic Data Pipelines)

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Career with American Airlines:Flight Attendant | Hiring

Mortgage Compliance Officer

Experienced Remote Chat Support Specialist for Dynamic Customer Service Team – Entry-Level Opportunity with Flexible Hours and Professional Growth

Senior Agile Coach

Virtual Tutor (Fall 2025)

Experienced Full Stack Data Analyst – Big Data Analytics and Business Intelligence

Earn Cash Surveying New Health Products – Part-Time - Flexible hours with remote participation (Hiring Immediately)

Experienced Entry-Level Customer Support Associate – Live Chat (Remote / No Experience) at blithequark

Sr. Security Account Manager

Delta Airlines Online Careers Remote Jobs At Home

AI Engineer (Synthetic Data Pipelines)

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Career with American Airlines:Flight Attendant | Hiring

Mortgage Compliance Officer

Experienced Remote Chat Support Specialist for Dynamic Customer Service Team – Entry-Level Opportunity with Flexible Hours and Professional Growth

Senior Agile Coach

Virtual Tutor (Fall 2025)

**Experienced Full Stack Data Analyst – Big Data Analytics and Business Intelligence**

Earn Cash Surveying New Health Products – Part-Time - Flexible hours with remote participation (Hiring Immediately)

**Experienced Entry-Level Customer Support Associate – Live Chat (Remote / No Experience) at blithequark**

Sr. Security Account Manager

Delta Airlines Online Careers Remote Jobs At Home

Experienced Full Stack Data Analyst – Big Data Analytics and Business Intelligence

Experienced Entry-Level Customer Support Associate – Live Chat (Remote / No Experience) at blithequark