REMOTE::Senior Software Engineer LLM Evaluation :: AI-generated @ US ,Western Europe

Remote Full-time
Title: Senior Software Engineer LLM Evaluation
Duration: Long term ( depends on candidates performance)

Work Type: Remote ( hybrid or onsite depending on candidate s location)

Multiple openings

Key skills: Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go

Project Overview:

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like?
• Working on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
• Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable.
• Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
• Build agents that can verify the quality of the code and identify error patterns.
• Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
• Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills:
• Several years of software engineering experience (+5 years), including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
• Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.
• Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
• Excellent oral and written communication skills for clear, structured evaluation rationales.

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

Remote Work from Home Data Entry Computer Job - Part Time Market Research Assistant with Opportunities for Career Growth and Professional Development

Remote Full-time

**Experienced Entry-Level Chat Support Specialist – Remote Customer Service Representative**

Remote Full-time

Experienced Remote Review Blog Writer for Careers Remote - Flexible Hours, Competitive Pay

Remote Full-time

**Senior Account Executive – Digital Solutions and Client Growth**

Remote Full-time

Amazon Work At Home (Data Entry) Jobs No Experience ? Hiring Now ? Hire Me Remotely

Remote Full-time

[Remote] Infra architect with azure migration

Remote Full-time

NERA - Associate Analyst/Analyst - International Arbitration (WDC)

Remote Full-time

Regional Sales Manager

Remote Full-time

Key Account Management Professional Healthcare (f/m/d) (fixed-term)

Remote Full-time

**Experienced Part-Time Remote Data Entry Specialist – Market Research and Data Insights**

Remote Full-time
← Back to Home