Prompt Evaluator

Only for registered members Canada

12 hours ago

Default job background

Job summary

We are hiring a Generalist Evaluator Expert to support a high-impact AI research initiative.

Responsibilities

  • Design detailed prompts with multiple constraints and structured instructions
  • Author high-quality "golden answers" aligned to defined expectations
  • Develop comprehensive evaluation rubrics and grading criteria

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Only for registered members Remote job $25 - $47 (USD) per hour

    We are building an advanced AI-powered product for fintech, portfolio management, and hedging strategies targeted at small and mid-sized companies. · ...

  • Only for registered members Cambridge, ON Remote job

    We are looking for linguistically and culturally aware professionals to support the evaluation and enhancement of multilingual prompt-response datasets for large language models (LLMs). This role involves rubric design, evaluation of translations and model outputs, · prompt creat ...

  • Only for registered members Remote job $8 - $15 (USD) per hour

    To audit train and improve Large Language Models LLMs specialized in finance ensuring accuracy and mitigating risks. · Key Tasks Evaluating LLM outputs for accuracy logicality and reliability in financial tasks e.g trading reporting Developing Ground Truth data for fine-tuning mo ...

  • Only for registered members Remote job

    We're running a private AI research project focused on evaluating large language models for safety, alignment and robustness. · You'll design adversarial prompt scenarios that surface failure modes such as deceptive reasoning manipulation hidden objectives or policy evasion — wit ...

  • Only for registered members Remote job $10 - $20 (USD) per hour

    Serbian-speaking professional needed for one-time project interacting with conversational system evaluating naturalness of Serbian voice output comparing against reference samples creating refining prompts text voice interactions. · ...

  • Only for registered members Remote job $10 - $20 (USD) per hour

    Arabic-speaking professional needed for a one-time project. In this short-term role, you will interact with our conversational system. · ...

  • Only for registered members Remote job $15 - $30 (USD) per hour

    We are looking for a German-speaking professional to test our conversational AI system. · ...

  • Prompt Engineer

    1 month ago

    Only for registered members Remote job $10 - $15 (USD) per hour

    +Prompt engineer for AI-based video assessment evaluation. · + · +<+ul class= ...

  • Only for registered members Remote job $30 - $70 (USD) per hour

    +Job summary · Set up LangFuse for evaluating LLM prompts. · +ResponsibilitiesEvaluate LLM prompts using LangFuse. · ...

  • Only for registered members Remote job $30 - $50 (USD) per hour

    We are building a GenAI-driven recommendation engine that generates structured recommendations by passing user context + prompts to LLMs and evaluating the output. · ...

  • Only for registered members Remote job $20 - $400 (USD) per hour

    We are looking for an experienced AI Trainer / Prompt Engineer to help train and align AI systems so that when users ask about Zangi, the responses are accurate, consistent, and brand-safe, · Train and finetune AI responses related to Zangi across common user queries · Define cor ...

  • Only for registered members Remote job $400 - $0 (USD) budget

    We are hiring an experienced Voice Agent Prompt Engineer to refine and productionize system prompts for a VAPI-based real-time voice agent, · use agent evaluation tools (e.g., Langfuse) to test, score and iteratively improve agent behavior until it meets strict business and conve ...

  • Only for registered members Remote job $30 - $80 (USD) per hour

    This role involves auditing and refining an existing prompt system to ensure zero hallucinations, domain-accurate language, and deterministic outputs. · You'll work on generating customer-facing and internal documents. · Candidates will be evaluated on their problem-solving skill ...

  • Only for registered members Remote job $500 - $0 (USD) budget

    We are seeking an expert in prompt engineering to test AI models for various affective scenarios. · Crafting and refining prompts to evaluate AI performance in emotional and social contexts is key, · Prompt EngineeringLLM PromptAi Model Training Prompt ...

  • Only for registered members Remote job $15 - $30 (USD) per hour

    We are looking for a prompt engineer to help us translate Midjourney Image to Nano Banana Pro optimized prompt that could generate identical image as Midjourney reference image. · ...

  • Only for registered members Remote job $10 - $15 (USD) per hour

    We are looking for an experienced freelancer to optimize and evaluate our AI agent system. · ...

  • Only for registered members Remote job

    We are looking for an experienced freelancer to optimize and evaluate our AI agent system. · ...

  • Only for registered members Remote job $700 - $0 (USD) budget

    We need an experienced engineer who can build a LLM-driven classification system that reads incoming text and produces structured consistent outputs for internal decision-making. · ...

  • Only for registered members Remote job $250 - $0 (USD) budget

    <b>Job summary</b></h2;<p>We are seeking an AI expert with a strong background in creating effective prompts for generating professional images and UGC videos. · <br> · </p> ...

  • Only for registered members Remote job

    We're building an AI-driven content creation platform and need a skilled Prompt Engineer to join our team on a contract basis. · ...