AI Software Engineer – Python, LLM Integrations & Scalable Systems

Remote Full-time
We are seeking a hands-on AI Software Engineer to design, build, and deploy intelligent backend systems that power conversational AI, automation, and data-driven decision engines. You’ll collaborate with data scientists, ML engineers, and product teams to integrate LLM-based models (OpenAI, Anthropic, Meta Llama, etc.) into scalable microservices and internal tools. Key Responsibilities • Design and develop Python-based backend systems supporting AI/LLM workflows, APIs, and data pipelines. • Build scalable microservices and vector-database integrations (e.g., Milvus, Pinecone, FAISS) for retrieval-augmented generation (RAG) pipelines. • Integrate and orchestrate LLMs using APIs (OpenAI, Anthropic, Hugging Face, vLLM, Triton, or similar). • Work closely with data engineering to optimize data ingestion, preprocessing, and embeddings pipelines. • Implement asynchronous and distributed processing (Celery, Kafka, or Ray). • Deploy and monitor services on Docker/Kubernetes with CI/CD pipelines (GitHub Actions, Jenkins, or GitLab CI). • Maintain documentation, testing, and model performance metrics. • Collaborate with DevOps and security to ensure safe and reliable AI deployments. Required Skills & Experience • 3+ years experience in backend or full-stack development with Python (FastAPI, Flask, or Django). • Proven experience integrating AI/ML or NLP systems (LLMs, embeddings, transformers, etc.). • Strong understanding of RESTful and async APIs, data serialization, and model inference optimization. • Familiarity with vector databases (Milvus, Pinecone, FAISS, Weaviate) and document chunking/embedding techniques. • Experience with SQL and NoSQL databases (PostgreSQL, MongoDB, Redis). • Hands-on with Docker, Kubernetes, and cloud environments (AWS / GCP / Azure). • Knowledge of MLOps workflows (model packaging, inference serving, versioning). • Experience with Git, CI/CD, and automated testing. Nice to Have • Familiarity with AI voice technologies (Riva, ElevenLabs, VAPI SDK, or similar). • Experience with LangChain, LlamaIndex, or Haystack for RAG pipelines. • Exposure to NVIDIA Triton / TensorRT-LLM / vLLM for high-performance inference. • Understanding of prompt engineering, retrieval evaluation, and fine-tuning pipelines. • Experience contributing to open-source AI frameworks. Why Join Us • Build real AI products — from voice agents to LLM-powered automation systems — not just prototypes. • Work with a high-performance engineering team using NVIDIA hardware and cutting-edge open-source tools. • 100% remote flexibility, cross-functional collaboration, and ownership of critical AI systems. Apply tot his job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Remote Customer Support Representative – Deliver Exceptional Service Experience to blithequark Customers**

Remote Full-time

Senior Epidemiologist, Real-World Evidence (FSP Sponsor Dedicated) 6 Locations

Remote Full-time

“Soar From Home – Delta Airlines Remote Part-Ti...

Remote Full-time

Adam Hergenrother Companies – Real Estate Operations Coordinator – Westfield, NJ

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Opportunity at blithequark**

Remote Full-time

Data Entry Clerk (Work At Home) Part Time

Remote Full-time

Field Chief Security Officer

Remote Full-time

**Experienced Part-Time Data Entry Claims Intake Processor – Remote Opportunity with arenaflex**

Remote Full-time

AI Pilot Vibe Coding Assistant (Freelance)

Remote Full-time

**Experienced Customer Service Professional – Delivering Exceptional Work-from-Home Experiences**

Remote Full-time
← Back to Home