Deep Learning Software Engineer, LLM Performance

Remote Full-time
Job Description:
• Performance optimization, analysis, and tuning of LLM, VLM and GenAI models for DL inference, serving and deployment in NVIDIA/OSS LLM frameworks
• Scale performance of LLM models across different architectures and types of NVIDIA accelerators
• Scale performance for max throughput, minimum latency and throughput under latency constraints
• Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton
• Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions

Requirements:
• Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, EECS, AI)
• 2+ years of relevant software development experience
• Excellent Python/C/C++ programming, software design and software engineering skills
• Experience with a DL framework like PyTorch, JAX, TensorFlow
• Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation
• Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application
• Architectural knowledge of CPU and GPU
• GPU programming experience (CUDA or OpenCL)

Benefits:
• Equity
• Benefits
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Customer Service Representative – Southwest Airlines Remote Customer Support**

Remote Full-time

NextGen Medical Affairs Manager, THV

Remote Full-time

Experienced Data Entry Professional – Remote Full-Time Opportunity for Detail-Oriented Individuals to Join arenaflex Team

Remote Full-time

Customer Service Agent - Call Center

Remote Full-time

Experienced Remote Chat Coordinator - Nocturnal Customer Engagement & Support | $25-$35/hr | Flexible Night Shifts | blithequark

Remote Full-time

**Experienced Remote Data Entry Specialist – Administrative Assistant Opportunity at arenaflex**

Remote Full-time

Senior Medical Writer Oncology & ADC Focus (Remote/Fractional)

Remote Full-time

Commercial Sales Support Specialist - East Coast

Remote Full-time

Travel CT Technologist - $2,740 per week

Remote Full-time

Experienced Remote Data Entry and Inbound Sales Representative for Wayfair – Utilize Your Exceptional Communication Skills to Drive Sales and Customer Satisfaction in a Dynamic and Supportive Environment

Remote Full-time
← Back to Home