Senior Python / LLM Engineer Needed – Router MVP & Predictive Model Loader

Remote Full-time
✅ In-Scope Work (Remaining) Milestone 1 Router MVP Implementation Deliverables • Embedding pipeline (query + document embeddings) • Vector storage using FAISS or Chroma • Projection module for query feature extraction • Configurable scoring & model-selection strategies • Router MVP with pluggable LLM backends • Router validation tests (routing correctness & mis-routing analysis) Acceptance Criteria • End-to-end routing demonstrated • Deterministic and explainable routing decisions • Pytest unit tests included • All code committed to repository ⸻ Milestone 2 Predictive Loader & Integration Deliverables • Predictive model loader (LLM-based classifier) • Warm-start caching & preload logic • Cache management strategy • Full integration with Router MVP • FastAPI backend exposing routing endpoints • Structured JSON logging • End-to-end testing + documented stress testing • Final validation & testing report Acceptance Criteria • Predictive loading working correctly • Fully integrated end-to-end system • Stress-testing methodology clearly documented • Final report delivered ⸻ Technical Requirements • Strong Python backend experience • FastAPI • LangChain or equivalent LLM orchestration framework • FAISS or Chroma vector stores • Dockerized services • Structured JSON logging • Pytest for unit & integration testing ⸻ Ideal Candidate • 3+ years of Python backend experience • Proven experience with LLMs, embeddings, and routing systems • Hands-on with vector databases and retrieval pipelines • Comfortable writing clean, testable, production-ready code • Experience with performance testing and system validation • Strong communication and documentation skills ⸻ Deliverables & Collaboration • All work delivered via Git repository • Clean, readable, well-tested code • Clear documentation for setup, testing, and usage • Milestone-based payments ⸻ To Apply Please include: 1. Relevant experience with LLM routing, embeddings, or RAG systems 2. GitHub or code samples (if available) 3. Brief explanation of how you would approach Router MVP + predictive loading Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Category Analyst – Retail at The Coca-Cola Company

Remote Full-time

International Tax Services – Quantitative Tax Solutions and Technologies – Manager 13 Locations

Remote Full-time

Private Equity Underwriter, Middle Markets (Mid-Senior- Senior)

Remote Full-time

Client Executive, Strategic Accounts - Wells Fargo

Remote Full-time

**Experienced Remote Data Entry Research Panelist – Work From Home Opportunity at blithequark**

Remote Full-time

Special Education Compliance Specialist

Remote Full-time

[Remote] Associate Program Manager - Kitchenware

Remote Full-time

Experienced Remote Customer Service Representative - arenaflex - $30/Hr - Work From Home Opportunity

Remote Full-time

Catering Screener - SMF Airport

Remote Full-time

Valuation Consultant | January 2025

Remote Full-time
← Back to Home