AI Engineer (RAG Specialist)

Remote Full-time
AI Engineer (RAG Specialist) We are looking for a skilled AI Engineer specializing in Retrieval-Augmented Generation (RAG) to join our team. Your primary focus will be bridging the gap between static LLMs and dynamic, proprietary data. You won't just be "calling an API"; you will be architecting the entire data lifecycle-from ingestion and chunking strategies to advanced retrieval and response synthesis. The ideal candidate understands that the secret to a great RAG system isn't just the LLM, but the quality of the retrieval and the nuances of the vector database. US Citizenship Required Key Responsibilities • Pipeline Architecture: Design and deploy end-to-end RAG pipelines using frameworks like LangChain , LlamaIndex , or Haystack . • Data Engineering: Develop robust ETL processes to ingest unstructured data (PDFs, docs, web scrapes) into high-performance vector stores. • Retrieval Optimization: Implement and tune advanced retrieval techniques, including Hybrid Search (keyword + semantic), Re-ranking (Cross-Encoders), and Parent-Document Retrieval . • Vector Database Management: Manage and scale vector databases such as Pinecone, Weaviate, Milvus, or Chroma . • Evaluation & Benchmarking: Establish rigorous evaluation frameworks (e.g., RAGAS , TruLens ) to measure faithfulness, relevancy, and hit rates. • Performance Tuning: Optimize embedding models and prompt engineering to reduce latency and "hallucinations." Technical Qualifications • Language Proficiency: Advanced Python (preferred) or TypeScript. • LLM Expertise: Hands-on experience with OpenAI GPT-4, Anthropic Claude, or open-source models like Llama 3 via Ollama or vLLM . • Vector Expertise: Deep understanding of embeddings, similarity metrics (Cosine, Euclidean), and indexing strategies (HNSW, IVF). • NLP Fundamentals: Familiarity with tokenization, context windows, and attention mechanisms. • Cloud/DevOps: Experience deploying AI applications on AWS, GCP, or Azure using Docker/Kubernetes. Preferred Skills • Experience with Agentic RAG (Multi-step reasoning and tool-use). • Knowledge of Graph Databases (Neo4j) for GraphRAG implementations. • Contributions to open-source AI projects. • Background in traditional Information Retrieval (Elasticsearch/Solr).
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

ES Scorer - Language Scorer (NY)

Remote Full-time

**Experienced Part-Time Remote Chat Support Agent – Customer Service Representative**

Remote Full-time

Experienced Data Entry and Customer Support Group Lead – Full-Time Opportunity with Competitive Hourly Rate and Comprehensive Benefits Package at arenaflex

Remote Full-time

Event Production Coordinator

Remote Full-time

**Experienced Customer Service Representative – Window Clerk Position at arenaflex**

Remote Full-time

Experienced Remote Sales Chat Representative – Shipping Container Sales and Customer Service Expert

Remote Full-time

Flight Research Remote Pilot

Remote Full-time

Outside Property Claim Associate

Remote Full-time

Remote English Copy Editor - Now Hiring

Remote Full-time

Precertification and Authorization Rep – Remote Anywhere

Remote Full-time
← Back to Home