Backend Engineer - AI Runtime

Remote Full-time
About Us
We are a stealth-mode startup building the new AI runtime. Our mission is to make advanced language models deployable, customizable, and secure across diverse environments.
Role
We are seeking a Backend Engineer (Node.js/NestJS) to extend our platform using our existing codebase. You'll build the proxy backend that interacts with our custom inference runtime and extend dashboards.
This role requires strong backend engineering skills, an ability to integrate existing systems, and comfort working closely with C++ engineers who are building low-level runtime features using CUDA.

Responsibilities

Proxy Backend for Inference Runtime
Build and maintain a Node.js-based proxy backend that:Accepts inference requests from the frontend.Schedules and serializes prompts.Manages QKV cache load/unload (API hooks from the C++ runtime).Provides APIs to manage LoRA adapters.
Integrate with authentication, RBAC, and logging already provided by the existing stack.Expose metrics and logs for monitoring inference usage and performance.
Dashboards
Extend existing Dashboard: Dataset upload, training job view, model management, inference usage, request history, and adapter selection.Reuse auth, billing, and user management code (Auth0, Stripe).Add necessary backend endpoints to support new UI flows.
Core Stack & Infrastructure

Develop using NestJS as the main backend framework.Work with PostgreSQL, Redis, MongoDB, and HashiCorp Vault for persistence, caching, and secrets.Use Socket.IO for real-time updates (job status, inference progress).Ensure secure integration with Stripe (billing) and Auth0 (identity).Collaborate with DevOps on deployment pipelines.

Requirements
Deep knowledge of the JavaScript and TypeScript languages.Strong experience with Node.js and NestJS framework.Proficiency in PostgreSQL and Redis for persistence and caching.Hands-on experience with Socket.IO or other WebSocket libraries.Experience with secure configuration and secrets management (HashiCorp Vault preferred).Experience with JWKS.Comfortable working with microservices and integrating with existing codebases.Strong debugging and systems thinking, able to reason about scheduling, state management, and concurrency.

Nice to Have
Experience integrating with AI runtimes (gRPC/REST backends for inference).Experience with RAG and MCP.Experience with authentication/authorization frameworks (Auth0, JWT, RBAC).Familiarity with Stripe API or similar billing systems.Contributions to backend open-source projects.Experience with WebRTC.

Why Join
Extend a proven SaaS foundation into a new AI runtime platform.Work directly with a C++ systems team building custom inference features.Build real products (dashboards + runtime APIs) used by vendors and customers.Competitive compensation, equity potential.
Please use this link to: https://www.baasi.com/career/apply/3164212

Apply Now

Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Account Manager - State Farm Agent Team Member

Remote Full-time

Travel Agency Owner - Flexible Hours & Comprehensive Training Provided

Remote Full-time

New & Used Truck & Trailer Sales Representative - Hobbs, NM

Remote Full-time

Customer Chat Support Specialist – Work From Home | Part-Time Remote Customer Service Role

Remote Full-time

Travel Booking Agent

Remote Full-time

Caregivers Amazing Opportunities for Babysitting!

Remote Full-time

2025 Chubb Associate, Commercial Underwriting

Remote Full-time

Partner Onboarding Specialist

Remote Full-time

Part-Time Saturday Customer Support Representative – Flexible Weekend Position with Growth Opportunities at arenaflex

Remote Full-time

Amazon Remote Customer Service No Experience jobs

Remote Full-time
← Back to Home