Lead applied research initiatives focused on AI evaluation, reliability, and robustness, defining success metrics tied to customer impact and production readiness.
Design and validate methods to measure and mitigate AI reliability risks, including uncertainty estimation, hallucination detection, and identification of model failure modes.
Partner cross-functionally with engineering, data science, and product teams to integrate research outcomes into customer-facing AI systems and workflows.
Own research projects end to end, from problem framing and hypothesis development through experimentation, prototyping, and synthesis of results.
Influence technical direction across teams by surfacing insights, proposing scalable solutions, and aligning stakeholders on priorities and tradeoffs.
Mentor researchers and engineers through technical guidance, feedback, and collaborative leadership on shared initiatives.
Contribute to Upwork's external research footprint through publications, presentations, and engagement with the broader AI research community.
Proven experience leading applied AI research that balances scientific rigor with real-world deployment constraints and business impact.
A strong record of research contribution through publications, internal innovation, or demonstrable influence on production AI systems.
Deep proficiency with Python and modern deep learning frameworks such as PyTorch, with hands-on experience evaluating and improving large-scale models.
An adaptive approach to integrating AI tools into research and development workflows to accelerate experimentation, improve evaluation quality, and share best practices with others.
A collaborative, growth-oriented mindset with the ability to mentor peers, communicate complex ideas clearly, and thrive in a fast-evolving, bottom-up environment.
- Only for registered members Toronto, Ontario, Canada
The AI Foundations team leads core research and development across the training, evaluation, · and deployment of AI systems that power Uma. · Lead applied research initiatives focused on AI evaluation, · reliability, and robustness. · ...
- Only for registered members Toronto, Ontario, Canada
The AI Foundations team leads core research and development across the training, evaluation, and deployment of AI systems that power Uma, Upwork's flagship AI model. · ...
- Only for registered members Toronto Full time
We are seeking a Sr Lead AI Research Scientist focused on AI Evaluation and Reliability, who will drive highimpact research initiatives that improve the trustworthiness robustness and realworld performance of AI systems operating at marketplace scale. · ...
- Centene Corporation Toronto Remote job with geographical restriction Full time $25 - $38 (CAD) per hour
We are seeking a highly organized and detail-oriented Virtual Assistant to provide remote administrative, customer service, and data entry support to our Claims Coordination Team. This is a fully remote position. All necessary equipment will be provided. Candidates must have acce ...
-
Manager, Corporate Reliability
4 weeks ago
Only for registered members Toronto, ONWe are a leading manufacturer of premium quality tissue products · . We pride ourselves in our commitment to providing the best quality products and exceptional service to our consumers and customers. · Lead the development, deployment, and ongoing maintenance of the Key Elements ...
- Only for registered members Toronto, ON Remote job
We are hiring a Lead/Senior Product Manager to own the Analytics, Reporting, and Conversational BI product surfaces in Agentic Studio—the measurement and trust layer that helps enterprises build, operate, and continuously improve agentic customer experiences across chat, mobile, ...
-
Machine Learning Engineer
1 month ago
Only for registered members Toronto, ON+Job summary · We are a clinician-led healthcare organization building an applied AI system to support an established clinical workflow. · +Implement machine learning models based on defined architecture and requirements · Develop data pipelines that support transient data proces ...
-
Roofing Sales Representative
2 weeks ago
Only for registered members Toronto, ONThe Roof Whisperer is a growing roofing and exterior services company serving homeowners across the Greater Toronto Area. We are looking for a motivated Roofing Sales Representative to join our team and help homeowners make confident, informed decisions about their roofing needs. ...
-
Bitcoin Mining: Senior ASIC Test
1 day ago
Only for registered members Toronto, ONJob summaryWe're building Bitkey, a simple and safe self-custody bitcoin wallet that will put customers in control, · as well as hardware and software that will help decentralize bitcoin mining and enable new and innovative use cases for bitcoin mining. · ...
-
Maximo Solutions Architect 0487-3017
2 weeks ago
Only for registered members Toronto, ONA Maximo Solutions Architect is needed to work with Business users to review processes and practices. · A Maximo Solutions Architect is needed to work with Business users to review processes and practices. · Toronto Water (TW) is looking for an IBM Maximo Solution Architect. · ...
-
Data Engineer, Financial Engineering
1 month ago
Only for registered members Toronto, ON Remote jobWe are looking for a mid level Data Engineer to join Spotify's Finance Engineering organization. · ...
-
Senior AI
1 month ago
Only for registered members Toronto, ONWe are seeking a Senior AI & Agent Systems Engineer to design, build, and operate production-grade AI systems embedded within enterprise software platforms. · 7+ years of professional software engineering experience · Strong proficiency in Java (non-negotiable) · Strong proficien ...
-
Quality Assurance Technology Enablement Analyst
1 month ago
Only for registered members Toronto, ONAir Canada seeks a QA Technology Enablement Analyst to deliver non-functional testing solutions and ensure robust risk assessment and diagnostics using enterprise and open-source tools. · ...
-
In-Service Engineering
3 weeks ago
Only for registered members Toronto, ONProvide Technical support and drive aircraft recovery action plans. Champion aircraft reliability as well as identify safety efficiency and cost improvement opportunities. · A graduate of an accredited Transport Canada approved Aircraft Maintenance Engineer program is required. · ...
-
AI Training
1 month ago
Only for registered members Toronto, ON Remote job+We're looking for AI Trainer – Lawyers to help train and evaluate cutting-edge AI models using real legal expertise. · ...
-
Head of Product
1 month ago
Only for registered members Toronto, ONBe the first product leader and ship what defines the category at Boam AI. · We are a well-funded, fast-growing B2B startup building managed data agents that transform messy signals into structured intelligence on SMBs and enterprises worldwide. · ...
-
AI Trainer
1 month ago
Only for registered members Toronto, ON Remote jobJob summary · Prolific busca a personas con experiencia avanzada en árabe para entrenar y evaluar modelos de IA. · Se requiere experiencia verificable como hablante avanzado en árabe. · ...
-
AI Trainer
1 month ago
Only for registered members Toronto, ON Remote job+ AI Trainer - Advanced Japanese Fluency · About Prolific · Prolific is not just another player in the AI space – we are building the biggest pool of quality human data in the world. · Over 35,000 AI developers, researchers and organizations use Prolific to gather data from paid ...
-
AI Trainer
1 month ago
Only for registered members Toronto, ON Remote jobWe're looking for Advanced Korean Speakers to help train and evaluate cutting-edge AI models. · About Prolific · Prolific is not just another player in the AI space – we are building the biggest pool of quality human data in the world. · We're looking for Advanced Korean Speakers ...
-
AI Trainer
1 month ago
Only for registered members Toronto, ON Remote jobWe're looking for Advanced Mandarin Speakers to help train and evaluate cutting-edge AI models.Prolific is not just another player in the AI space – we are building the biggest pool of quality human data in the world. · You must be prepared to complete paid tasks that require one ...
-
Chief Technology Officer
4 weeks ago
Only for registered members Toronto, ONThe Chief Technology Officer (CTO) provides visionary leadership and strategic direction for the hospital's digital infrastructure,cybersecurity,and telecommunications systems. · ...
Sr AI Research Scientist, AI Evaluation and Reliability - Toronto - Upwork
Description
Overview
Upwork Inc.'s (Nasdaq: UPWK) family of companies connects businesses with global, AI-enabled talent across every contingent work type including freelance, fractional, and payrolled. This portfolio includes the Upwork Marketplace, which connects businesses with on-demand access to highly skilled talent across the globe, and Lifted, which provides a purpose-built solution for enterprise organizations to source, contract, manage, and pay talent across the full spectrum of contingent work. From Fortune 100 enterprises to entrepreneurs, businesses rely on Upwork Inc. to find and hire expert talent, leverage AI-powered work solutions, and drive business transformation. With access to professionals spanning more than 10,000 skills across AI & machine learning, software development, sales & marketing, customer support, finance & accounting, and more, the Upwork family of companies enables businesses of all sizes to scale, innovate, and transform their workforces for the age of AI and beyond.
Since its founding, Upwork Inc. has facilitated more than $30 billion in total transactions and services as it fulfills its purpose to create opportunity in every era of work. Learn more about the Upwork Marketplace at and follow us on LinkedIn, Facebook, Instagram, TikTok, and X; and learn more about Lifted at Go-Lifted and follow on LinkedIn.
Sr. Lead AI Research Scientist, AI Evaluation and Reliability
The AI Foundations team leads core research and development across the training, evaluation, and deployment of AI systems that power Uma, Upwork's flagship AI model, and other customer-facing generative AI capabilities. As a Sr. Lead AI Research Scientist focused on AI Evaluation and Reliability, you will drive high-impact research initiatives that improve the trustworthiness, robustness, and real-world performance of AI systems operating at marketplace scale.
At the Sr. Lead level, this role combines deep technical expertise with cross-functional leadership. You will identify and lead research efforts that address systemic reliability challenges, partner closely with engineering and product teams to translate research into production outcomes, and help shape how Upwork evaluates AI performance in real work scenarios. Your work will support AI systems embedded in retrieval-based workflows, agentic architectures, and human plus AI collaboration patterns, while contributing to Upwork's broader AI research strategy and external presence.
Responsibilities
Qualifications
This position will initially be employed through a partner to ensure a seamless hiring process while we establish the hub. Once the hub is established, there may be opportunities to transition to employment with Upwork depending on business needs and other requirements. While employed by the partner, you'll work as part of Upwork's team, with access to our resources, culture, and growth opportunities.
Upwork is an Equal Opportunity Employer committed to recruiting and retaining a diverse and inclusive workforce. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or other legally protected characteristics under federal, state, or local law.
Please note that a criminal background check may be required once a conditional job offer is made. Qualified applicants with arrest or conviction records will be considered in accordance with applicable law, including the California Fair Chance Act and local Fair Chance ordinances. The Company is committed to conducting an individualized assessment and giving all individuals a fair opportunity to provide relevant information or context before making any final employment decision.
To learn more about how Upwork processes and protects your personal information as part of the application process, please review our Global Job Applicant Privacy Notice
#J-18808-Ljbffr
-
Sr AI Research Scientist, AI Evaluation and Reliability
Only for registered members Toronto, Ontario, Canada
-
Sr AI Research Scientist, AI Evaluation and Reliability
Only for registered members Toronto, Ontario, Canada
-
Sr AI Research Scientist, AI Evaluation and Reliability
Full time Only for registered members Toronto
-
Virtual Assistant – Remote Customer Service & Data Entry
Full time Centene Corporation- Toronto
-
Manager, Corporate Reliability
Only for registered members Toronto, ON
-
Lead / Senior Product Manager Analytics, Evals & Conversational BI (Agentic Studio)
Only for registered members Toronto, ON
-
Machine Learning Engineer
Only for registered members Toronto, ON
-
Roofing Sales Representative
Only for registered members Toronto, ON
-
Bitcoin Mining: Senior ASIC Test
Only for registered members Toronto, ON
-
Maximo Solutions Architect 0487-3017
Only for registered members Toronto, ON
-
Data Engineer, Financial Engineering
Only for registered members Toronto, ON
-
Senior AI
Only for registered members Toronto, ON
-
Quality Assurance Technology Enablement Analyst
Only for registered members Toronto, ON
-
In-Service Engineering
Only for registered members Toronto, ON
-
AI Training
Only for registered members Toronto, ON
-
Head of Product
Only for registered members Toronto, ON
-
AI Trainer
Only for registered members Toronto, ON
-
AI Trainer
Only for registered members Toronto, ON
-
AI Trainer
Only for registered members Toronto, ON
-
AI Trainer
Only for registered members Toronto, ON
-
Chief Technology Officer
Only for registered members Toronto, ON