[Remote] AI Trainer Jobs in Canada
Note: The job is a remote job and is open to candidates in USA. Rex.zone is seeking AI Trainers to improve large language models through various evaluation methods and data labeling. The role involves performing RLHF evaluations, executing prompt evaluations, and ensuring the quality of training data.ResponsibilitiesPerform RLHF evaluations (pairwise ranking, rubric-based scoring) and write clear rationalesExecute prompt evaluation for instruction-following, factuality, and safetyLabel and validate datasets for NLP and content safety labelingRun QA evaluation checks (consistency, agreement, systematic error discovery)Document edge cases and build error taxonomies to drive model performance improvementCollaborate on rubrics, gold sets, calibration, and regression testing for model updatesSkillsExperience with structured evaluation and guideline-driven judgmentStrong writing and documentation for rationales and edge-case notesFamiliarity with RLHF, LLM evaluation, and prompt evaluation workflowsComfort with data labeling, QA evaluation, and training data quality processesBonus: multilingual evaluation, NER/classification tasks, or multimodal evaluationCompany OverviewRemoExperts (by Abaka AI) is a global platform where skilled professionals and tutors contribute to AI training across video, image, audio, text, code, math, and more. It was founded in 2025, and is headquartered in Palo Alto, California, US, with a workforce of 201-500 employees. Its website is https://www.remoexperts.com.
Apply To This Job
Apply To This Job