Prompt Evaluator
12 hours ago

Job summary
We are hiring a Generalist Evaluator Expert to support a high-impact AI research initiative.Responsibilities
- Design detailed prompts with multiple constraints and structured instructions
- Author high-quality "golden answers" aligned to defined expectations
- Develop comprehensive evaluation rubrics and grading criteria
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Prompt engineering and evaluation specialist
2 weeks ago
We are building an advanced AI-powered product for fintech, portfolio management, and hedging strategies targeted at small and mid-sized companies. · ...
GenAI - Prompt Evaluation - Rubric Definition
1 month ago
We are looking for linguistically and culturally aware professionals to support the evaluation and enhancement of multilingual prompt-response datasets for large language models (LLMs). This role involves rubric design, evaluation of translations and model outputs, · prompt creat ...
Finance LLM Reviewer/Evaluator prompt review
1 week ago
To audit train and improve Large Language Models LLMs specialized in finance ensuring accuracy and mitigating risks. · Key Tasks Evaluating LLM outputs for accuracy logicality and reliability in financial tasks e.g trading reporting Developing Ground Truth data for fine-tuning mo ...
AI Safety Researcher – Adversarial Prompt Design
1 month ago
We're running a private AI research project focused on evaluating large language models for safety, alignment and robustness. · You'll design adversarial prompt scenarios that surface failure modes such as deceptive reasoning manipulation hidden objectives or policy evasion — wit ...
Serbian Speaker for Testing Conversational AI
1 month ago
Serbian-speaking professional needed for one-time project interacting with conversational system evaluating naturalness of Serbian voice output comparing against reference samples creating refining prompts text voice interactions. · ...
Arabic Speaker for Testing Conversational AI
1 month ago
Arabic-speaking professional needed for a one-time project. In this short-term role, you will interact with our conversational system. · ...
German Speaker for Testing Conversational AI
1 month ago
We are looking for a German-speaking professional to test our conversational AI system. · ...
Prompt Engineer
1 month ago
+Prompt engineer for AI-based video assessment evaluation. · + · +<+ul class= ...
Set up LangFuse and help with LLM evals
1 month ago
+Job summary · Set up LangFuse for evaluating LLM prompts. · +ResponsibilitiesEvaluate LLM prompts using LangFuse. · ...
LLM Prompt Engineering
3 weeks ago
We are building a GenAI-driven recommendation engine that generates structured recommendations by passing user context + prompts to LLMs and evaluating the output. · ...
AI Prompt Engineer
4 weeks ago
We are looking for an experienced AI Trainer / Prompt Engineer to help train and align AI systems so that when users ask about Zangi, the responses are accurate, consistent, and brand-safe, · Train and finetune AI responses related to Zangi across common user queries · Define cor ...
Voice Agent Prompt Engineer
3 weeks ago
We are hiring an experienced Voice Agent Prompt Engineer to refine and productionize system prompts for a VAPI-based real-time voice agent, · use agent evaluation tools (e.g., Langfuse) to test, score and iteratively improve agent behavior until it meets strict business and conve ...
Senior AI Prompt Engineer for Construction AI
3 weeks ago
This role involves auditing and refining an existing prompt system to ensure zero hallucinations, domain-accurate language, and deterministic outputs. · You'll work on generating customer-facing and internal documents. · Candidates will be evaluated on their problem-solving skill ...
Prompt Engineering Expert for AI Model Testing
3 weeks ago
We are seeking an expert in prompt engineering to test AI models for various affective scenarios. · Crafting and refining prompts to evaluate AI performance in emotional and social contexts is key, · Prompt EngineeringLLM PromptAi Model Training Prompt ...
We are looking for a prompt engineer to help us translate Midjourney Image to Nano Banana Pro optimized prompt that could generate identical image as Midjourney reference image. · ...
AI Automation Expert needed
6 days ago
We are looking for an experienced freelancer to optimize and evaluate our AI agent system. · ...
AI Automation Expert needed
3 weeks ago
We are looking for an experienced freelancer to optimize and evaluate our AI agent system. · ...
We need an experienced engineer who can build a LLM-driven classification system that reads incoming text and produces structured consistent outputs for internal decision-making. · ...
<b>Job summary</b></h2;<p>We are seeking an AI expert with a strong background in creating effective prompts for generating professional images and UGC videos. · <br> · </p> ...
Prompt Engineer for AI-Powered Content Platform
1 month ago
We're building an AI-driven content creation platform and need a skilled Prompt Engineer to join our team on a contract basis. · ...