Labelbox is seeking an Applied Research Engineer to develop cutting-edge systems for creating and leveraging high-quality human-in-the-loop data. The role involves designing and implementing advanced systems that align human feedback into AI training processes, such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO). The engineer will also work on innovative techniques to measure and improve human data quality, and develop AI-assisted tools to enhance the data labeling process.
Responsibilities:
Requirements:
Labelbox offers:
Labelbox is a company building critical infrastructure for developing AI models. They offer an integrated platform with advanced annotation tools, workflow automation, and quality control systems. Labelbox also provides a specialized data labeling service leveraging subject matter experts and an expert marketplace connecting AI teams with skilled annotators. Focused on data-centric approaches, Labelbox serves leading research labs and enterprises, shaping the future of artificial intelligence by enabling high-quality training data at scale.
All Jobs at Labelbox (15)