RLHF Specialist

Odixcity Consulting

📍 Mozambique, Mozambique, Mozambique

fullTime Business Operations Specialists

Job Description

Job Title: RLHF Specialist
Location: Remote (Worldwide)
Job Summary: An RLHF Specialist is responsible for improving and aligning AI models using Reinforcement Learning from Human Feedback (RLHF) methodologies. This role focuses on designing, implementing, and optimizing feedback pipelines that enhance model performance, safety, factual accuracy, and alignment with human values.
Responsibilities:
·        Generate high-quality preference data by comparing multiple model responses and ranking them based on criteria such as helpfulness, honesty, and harmlessness (HHH).
·        Design complex, multi-turn prompts to stress-test model behavior and expose weaknesses in reasoning or safety.
·        Write detailed “chain-of-thought” explanations and ra...
        

Apply for this Position

Job Details

Location Mozambique, Mozambique

Job Type fullTime

Category Business Operations Specialists

Posted March 02, 2026

Deadline April 11, 2026