AI Agent Architect Needed: GAIA Benchmark Optimization (31.5 → 86 )

Only for registered members Canada

1 month ago

Default job background
$2,000 - $0 (USD) budget
We need an experienced AI agent engineer to diagnose and fix architectural issues causing our CustomGPT Manus clone to underperform on the GAIA benchmark. · Current Performance: · Overall: 31.5% (target: 86%) · Level 1: 37.7% (target: 86.5%) · Level 2: 33.7% (target: 70.1%) · Lev ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company Remote job

    Performance & Optimization Specialist (Profile/Benchmark + Speed Up C# Code)

    Only for registered members

    We are looking for an experienced performance engineer to make existing code significantly faster and more efficient. · Proven experience shipping performance improvements (not just “clean code”) · Strong profiling/diagnostics skill: you can explain why it's slow · ...

    $400 - $0 (USD) budget

    1 month ago

  • Work in company Remote job

    Complete a Software Coding Challenge

    Only for registered members

    This challenge involves optimizing a kernel implementation in KernelBuilder.build_kernel. Your goal is to optimize it as much as possible. · Python · ...

    $500 - $0 (USD) budget

    4 weeks ago

  • Work in company Remote job

    Technical Creative Writing Benchmark Developer for LLMs

    Only for registered members

    We are seeking a skilled Technical Creative Writing Benchmark Developer to help us benchmark large language models (LLMs) with 30 hours per week. · Mandatory skills: · Creative Writing · Content Writing · Search Engine OptimizationWriting ...

    $12 - $55 (USD) per hour

    1 month ago

  • Work in company

    Principal Consultant, IT Project Benchmark Services

    Only for registered members

    The Principal Consultant, IT Project Benchmark Services (ITPB) is a senior-level advisor and delivery leader responsible for directing complex benchmarking, performance analytics, and delivery optimization engagements. · The leader will develop methodologies, oversee client engag ...

    Toronto

    3 weeks ago

  • Work in company Remote job

    LLM Evaluation, Benchmarking

    Only for registered members

    We're looking for an LLM Evaluation, Benchmarking & Experimentation Engineer to rigorously test our proprietary LLM API and build the infrastructure for systematic model improvement. · ...

    2 weeks ago

  • Work in company Remote job

    Strategy Management Office Set-Up and Benchmark

    Only for registered members

    We are seeking to engage a consultant to define the optimal structure and operating model for a Strategy function within a large-scale real estate development company. ...

    $20 - $125 (USD) per hour

    1 week ago

  • The International Public Fixed Income Team Lead oversees passive fixed income investment strategies across international markets. · Ensures optimal sectoral geographic maturity exposure while managing risks. · ...

    Toronto

    3 weeks ago

  • Work in company Remote job

    Linear Programming Expert for Shiny Solver

    Only for registered members

    We are seeking an expert in linear programming to enhance a Solver tool built in Shiny. · ...

    $150 - $0 (USD) budget

    1 month ago

  • Work in company Remote job

    Consultant Needed for Commercial PV System Performance Benchmarking – MENA

    Only for registered members

    We are seeking an experienced consultant to assist with performance benchmarking of commercial photovoltaic (PV) systems in the MENA region. · ...

    $100 - $0 (USD) budget

    1 month ago

  • Work in company

    Benchmark Consulting Manager

    Only for registered members

    The Benchmark Consulting Manager is a data-driven leader responsible for managing and delivering enterprise benchmarking engagements, · ResponsibilitiesLead benchmarking engagements end-to-end, including scoping, project management, data collection, analysis. · , · ...

    Toronto

    3 weeks ago

  • Work in company Remote job

    Expert On-Device LLM Engineer needed for STT Chunking

    Only for registered members

    Overview: · We are looking for an experienced AI/ML Engineer specializing in On-Device inference (Edge AI) to optimize the AI pipeline of our React Native tablet application. · Currently, our app uses a Speech-to-Text (STT) system that feeds transcriptions into a local LLM (gemma ...

    $600 - $0 (USD) budget

    6 days ago

  • Work in company Remote job

    Procurement Cost and Efficiency Benchmarks

    Only for registered members

    I am developing a B2B procurement analytics product and require benchmark data to support financial analysis, focusing on procurement cost and efficiency improvements. · ...

    1 month ago

  • We are seeking an experienced performance analytics professional to independently verify a historical performance comparison between a rule-based systematic trading strategy and the S&P 500. · Demonstrated experience in investment performance analysis for systematic strategies, h ...

    1 month ago

  • Work in company

    Senior Software Engineer, AI Inference Systems

    Only for registered members

    We are seeking highly skilled software engineers to build AI inference systems. You'll architect and implement high-performance stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across environments. · You'll collaborate with teams to push ...

    Toronto $170,000 - $275,000 (CAD)

    1 month ago

  • Work in company

    Senior Software Engineer, AI Inference Systems

    Only for registered members

    We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry ben ...

    Toronto $170,000 - $275,000 (CAD)

    2 days ago

  • Work in company Remote job

    Technical SEO Strategist as Freelancer

    Only for registered members

    Technical SEO Strategist as Freelancer · REQUIREMENT BRIEF · 1.Project Scope: · Improve Core Web Vitals · Page speed optimization · JS/CSS optimization · Lazy loading strategy · Image optimization · Reduce render-blocking scripts · 2.Technical/Non Technical Requirements: · ...

    $15 - $50 (USD) per hour

    22 hours ago

  • Work in company Remote job

    ASO & Paid Ads Expert for IPTV App Launch (iOS & Android)

    Only for registered members

    I am launching an IPTV app on iOS and Android and need support with ASO, paid user acquisition, and monetization. · ...

    3 weeks ago

  • Work in company Remote job

    Senior Pine Script Quant for strategy audit, research and optimization

    Only for registered members

    I am hiring a senior-level Pine Script quant to audit, research and optimize an advanced crypto strategy script for live deployment. This is a fast-iteration engagement focused on structural robustness, statistical integrity and production-grade implementation. · The script shows ...

    2 days ago

  • Work in company Remote job

    Expert Google Ads Auditor Needed for UK Automotive PPC Audit

    Only for registered members

    We are a UK-based used/luxury car dealership seeking a Google Ads expert to perform a full account audit and provide a detailed optimization plan. Our campaigns currently run on a modest budget, and we need an experienced specialist to identify gaps, inefficiencies, and opportuni ...

    $10 - $0 (USD) budget

    3 days ago

  • Work in company Remote job

    SEO Analysis for B2B Website

    Only for registered members

    We are seeking an experienced · SEO specialist to conduct a comprehensive SEO analysis for our B2B website.Highlight technical SEO issues, · - on-page optimization opportunities,- page speed improvements, · - internal linking gaps, · ...

    $10 - $0 (USD) budget

    1 month ago