AI Training Data Researcher with RLHF

Only for registered members Canada

1 month ago

Default job background
$25 - $40 (USD) per hour

Job summary

We're a stealth-mode AI training data startup launching Q. We're building longitudinal, causal, grounded human reasoning data for RLHF and Direct Preference Optimization (DPO)—training data designed specifically for AI alignment.

We are looking for an AI Training Data Researcher to support our ongoing research efforts in the field of artificial intelligence. The ideal candidate will have a deep understanding of modern training pipelines and be familiar with Georgia Tech's LEAF framework or similar lived experience research.


Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • I'm hiring a full-time (40 hours/week) assistant to help me execute and scale finance-focused RLHF / LLM evaluation contracts (model grading, rubric design, golden responses QA and reviewer feedback) · This work blends finance (valuation accounting markets) with AI evaluation (co ...

    $20 - $50 (USD) per hour Full time

    1 week ago

  • Work in company

    Artificial Intelligence Researcher

    Only for registered members

    Concevoir, prototyper et déployer des modèles d'IA pour automatiser des tâches logicielles et offrir des services API intelligents. · Mener des recherches appliquées avec LLMs, GNNs, Transformers dans des contextes logiciel, cybersécurité ou APIs réseau. · ...

    Montreal, Quebec

    1 month ago

  • Work in company Remote job

    Software Engineer, AI

    Only for registered members

    We are looking for engineers with a strong command of Python to help train large-language models (LLMs) to write production-grade code across a wide range of programming languages. · ...

    2 weeks ago

  • Work in company Remote job

    Senior AI Evaluation Lead

    Only for registered members

    Own the design and execution of contextual AI evaluations for real agent workflows translating human judgment into clear trusted evals that reflect production performance. · Specify · Work with partners to define what great means for one workflow · Measure · Design expert-grounde ...

    $15 - $22 (USD) per hour

    1 week ago

  • We're developing an AI-powered app for real-time meeting transcription and analysis with dual modes: End-user (standalone) and Corporate (enterprise teams). It uses AssemblyAI for transcription/speaker ID, RAG for contextual retrieval from past meetings, · Sentiment with enhanced ...

    $11,000 - $0 (USD) budget

    3 weeks ago

  • Work in company Remote job

    On-Premise Automated Clinical AI Pipeline

    Only for registered members

    +Job Summary · We are seeking a Machine Learning Expert to build an end-to-end AI system to automatically convert clinical audio in Arabic into accurate transcripts and structured SOAP notes.The system must operate in a highly automated fashion with minimal human intervention, wh ...

    $20 - $59 (USD) per hour

    1 week ago

  • Work in company Remote job

    Sales Senior Director

    Only for registered members

    We are hiring a Sales Senior Director who will be a senior executive leader responsible for global enterprise revenue growth, market expansion, and strategic customer partnerships for LXT's AI training data and generative AI services.Bachelor's degree in Business, Technology, or ...

    1 month ago

  • Work in company Remote job

    Technical assistant

    Only for registered members

    We need to hire technical assistant internally to do some of the jobs as well as create internal tools for optimizing workflow. · ...

    $6 - $50 (USD) per hour

    1 month ago

  • Work in company Remote job

    Senior AI/ML Engineer for LLM Output Assessment

    Only for registered members

    We have a 14-month corpus of human-AI dialogue that includes full extended thinking block visibility. We need an expert reality check: Is what we're seeing genuinely unusual, or are we pattern-matching on noise? · We're paying for calibrated expert judgment.Review curated convers ...

    $300 - $0 (USD) budget

    3 weeks ago

  • Work in company

    Researcher - Reinforcement Learning

    Only for registered members

    Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher. · ...

    - Street Northwest Edmonton, Alberta, TG C Canada

    1 week ago

  • Work in company

    Machine Learning Resident

    Only for registered members

    This is a paid residency in machine learning that will last for six months with potential hire afterwards. · Design models to capture health data across various types · Apply reinforcement learning methods for data collection processes · ...

    Edmonton

    4 days ago

  • Work in company

    Senior Lead Research Scientist, Agentic AI

    Only for registered members

    We're seeking a Senior Lead Research Scientist to push the frontier of autonomous, tool‑using AI and ensure that innovations make it into production. You'll split your time between novel research (benchmarks, learning algorithms, publications, and thought leadership) and building ...

    Toronto, Ontario, Canada

    4 days ago

  • Work in company

    Chemistry Specialist

    Only for registered members

    Mercor connects elite creative and technical talent with leading AI research labs. As a Chemistry AI Evaluator, you will write and refine prompts to guide model behavior in chemistry contexts. · PhD in Chemistry or a closely related field · Deep expertise in Organic & Biological ...

    Montreal

    6 days ago

  • Work in company

    Senior Financial Analyst | Upto 105/hr Hourly

    Only for registered members

    Mercor connects elite creative and technical talent with leading AI research labs. · Headquartered in San Francisco, · our investors include Benchmark, · General Catalyst, · Peter Thiel, · Adam D'Angelo, · Larry Summers, · and Jack Dorsey. · ...

    Montreal

    1 month ago

  • Work in company

    Physics PhD

    Only for registered members

    Mercor connects elite creative and technical talent with leading AI research labs. · Write and refine prompts to guide model behavior in physics contexts. · Evaluate LLM-generated responses to physics-related queries for conceptual accuracy and reasoning quality. · ...

    Montreal

    6 days ago

  • Work in company

    Conversational AI Specialist

    Only for registered members

    +Mercor connects elite creative and technical talent with leading AI research labs. · +Evaluate LLM-generated responses for effectiveness in answering user queries. · Conduct fact-checking using trusted public sources and external tools. · +Bachelor's degreeNative speaker or ILR ...

    Montreal

    1 month ago

  • Work in company

    Electrical Engineering Consultant

    Only for registered members

    ++Mercor connects elite creative and technical talent with leading AI research labs. As an Engineering AI Evaluator, you will write and refine prompts to guide model behavior in engineering scenarios. · ++PhD in Engineering or a closely related field · Must-Have: Deep expertise i ...

    Montreal

    1 month ago

  • Work in company

    Linguist

    Only for registered members

    Evaluate LLM-generated responses for effectiveness in answering user queries. · Evaluate LLM-generated responses for effectiveness in answering user queries. · Conduct fact-checking using trusted public sources and external tools. · Generate high-quality human evaluation data by ...

    Montreal

    1 month ago

  • Work in company

    Conversational AI Evaluator

    Only for registered members

    Mercor conecta talento creativo y técnico con laboratorios de investigación de IA. Se buscan evaluadores para evaluar respuestas generadas por modelos LLM. · ...

    Montreal

    3 weeks ago

  • Work in company

    Quality Assurance Specialist

    Only for registered members

    Evaluate LLM-generated responses for effectiveness in answering user queries. Conduct fact-checking using trusted public sources and external tools. Generate high-quality human evaluation data by annotating response strengths areas for improvement and factual inaccuracies. · Eval ...

    Montreal

    1 month ago