ML/NLP Engineer Needed – Low-Resource Language AI & Speech Project

Remote Full-time
Hello,

We are launching a language technology project for Chimini, a low-resource Bantu language, and are seeking an ML/NLP engineer to help us design and implement the foundational phase of the project.

Long-Term Vision

Our long-term goal is to build:

A structured Chimini text + audio corpus

A scalable API layer for integration into our own applications

Eventually, speech-to-text and text-to-speech capability in Chimini

Chimini is historically related to Swahili, but we do not yet know how structurally similar they are. Pronunciation may differ significantly, which may impact model transfer for speech systems.

We currently have:

Written texts

Audio recordings

Access to native speakers for transcription and validation

Phase 1 (3–6 Months)

The objective of Phase 1 is to build a strong ML-ready foundation, including:

Designing a scalable database structure for text and audio

Preparing and structuring data for NLP workflows

Building a clean corpus pipeline (segmentation, transcription storage, metadata)

Advising on whether Chimini–Swahili linguistic comparison should be conducted before leveraging transfer learning

Evaluating potential approaches:

Fine-tuning multilingual models

Embedding-based retrieval systems

LLM + RAG architectures

Longer-term speech model strategy

We want the system designed from the beginning to support future ML training and experimentation.

Responsibilities

Define ML/NLP strategy for a low-resource language

Recommend architecture for scalable corpus and training workflows

Implement foundational data pipelines

Advise on transfer learning feasibility from Swahili or multilingual models

Provide phased roadmap (short-term vs long-term)

Ideal Experience:

NLP for low-resource or multilingual languages

Speech systems (ASR/TTS)

Fine-tuning transformer models

Embeddings and vector databases

Designing ML pipelines for scalable experimentation

We will handle data collection, transcription, and language validation.

Please include:

Relevant ML/NLP experience

Proposed high-level technical approach

Estimated timeline for Phase 1

Availability

We are looking for someone who can help architect this correctly from the start, with long-term ML scalability in mind.

Best regards,

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

**Experienced Full Stack Data Entry Specialist – Remote Homebased Data Entry Solutions for arenaflex**

Remote Full-time

[Remote] Administrative Assistant 2

Remote Full-time

**Experienced Full Stack Data Analyst – Web & Cloud Application Development**

Remote Full-time

Virtual Admin Jobs – Flexible, Home-Based Work

Remote Full-time

Inside Channel Manager

Remote Full-time

**Experienced Work from Home Data Entry Clerk – Full & Part Time Opportunity**

Remote Full-time

Licensing & Registration Manager (Remote, USA)

Remote Full-time

Experienced Data Analyst II – arenaflex Data Ventures and Business Intelligence Development

Remote Full-time

**Fresh Foods Customer Service Representative – blithequark Store**

Remote Full-time

**Experienced Remote Customer Service Representative - American Airlines $25/Hour - Join Our Global Team and Explore Endless Opportunities**

Remote Full-time
← Back to Home