Machine Learning Specialist for Inference Systems - Toronto - beBeePerformer

    beBeePerformer
    beBeePerformer Toronto

    3 weeks ago

    $1,500,000 - $3,000,000 (USD) per year

    Job title: LLMInference Performance&EvalsEngineer

    Description

    AI Inference Performance Engineer

    This role offers a unique opportunity to join a pioneering team dedicated to bringing up state-of-the-art models, numerically validating and accelerating new model ideas on wafer-scale hardware.

    • Prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers into changes that land in production.

    Key Responsibilities



    • drafts pull-requests.

    • to experience the full stack of software/hardware innovation.

    • < li Keep pace with the latest open-and-closed-source models; run them first on wafer scale
      to expose new optimization opportunities.

      < / li >
      < / ul > < br/ >< h3>Skill Requirements

      This position requires solid grounding in Transformer math – attention scaling KV-cache quantisation or clear evidence you learn this material rapidly.


      prior experience building high-performance ML-or systems software , strong debugging skills across performance numerical accuracy runtime integration . < br />],

  • Only for registered members Toronto, Ontario

    We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. · Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and o ...

  • Only for registered members Toronto $170,000 - $275,000 (CAD)

    We are seeking highly skilled software engineers to build AI inference systems. You'll architect and implement high-performance stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across environments. · You'll collaborate with teams to push ...

  • Only for registered members Toronto, ON Remote job

    We're hiring a Senior Full-Stack Engineer to own the evolution of our user-facing platform — the model catalog, documentation, and tools that make Featherless the best place to find and use AI models. · ...

  • Only for registered members Toronto, Ontario

    El Research Scientist Intern es un puesto que forma parte de la organización AI & Compute Foundation. El equipo tiene como misión explorar, desarrollar y contribuir a la puesta en producción de tecnologías logicielles y materiales para la IA a escala de centros de datos. · ...

  • AI Systems

    1 month ago

    Only for registered members Toronto, Ontario Remote job

    We believe efficiency is what makes AI possible - it's how we expand access and ensure innovation benefits the many, not the few. · We're looking for builders and creative thinkers ready to shape the next era of intelligence.The Role · You'll work directly with our founders to de ...

  • Only for registered members Toronto, ON Remote job

    We're looking for talented individuals who will grow robotic deliveries from surprising novelty to efficient ubiquity. · ...

  • Only for registered members Toronto, Ontario

    We are looking for highly skilled engineers with a focus on C/C++, low level systems, performance and power optimization to join our team full-time. · Innovate on the inference optimization pipeline through algorithmic and system optimization · Own end to end system characterizat ...

  • Lead Scientist

    3 weeks ago

    Only for registered members Toronto, ON

    + Lead ScientistWe're looking for an Applied ML Lead Scientist who blends predictive modelling & complex analysis with AI strategy and scalable delivery.You will own high-impact models (forecasting, optimization, causal inference) and shape the patterns, platforms, and guardrails ...

  • Only for registered members Toronto, ON

    We're looking for an Applied Scientist to work at the intersection of applied research and production AI, with a strong focus on knowledge-centric AI systems, graph-based learning, and advanced Retrieval-Augmented Generation (RAG) architectures. · ...

  • Only for registered members Toronto, ON Remote job

    · Rejoignez l'équipe pour transformer la façon dont les soins de santé sont prodigués pour les maladies chroniques et spécialisées dans le monde entier. · Faire progresser le dépistage précoce. · Mobiliser & définir la stratégie. · ...

  • Only for registered members Toronto, Ontario, Canada

    Cerebras Systems builds the world's largest AI chip. Our novel wafer-scale architecture provides AI compute power of dozens of GPUs on a single chip. · Build performance models to estimate state-of-the-art ML model performance. · Optimize kernel micro code and compiler algorithms ...

  • Only for registered members Toronto, Ontario, Canada

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, · ...

  • Only for registered members Toronto

    We are seeking individuals passionate about tackling challenges and are driven by execution. · ...

  • Only for registered members Toronto, Ontario

    We are seeking highly motivated and skilled systems engineers to join our team to help in developing an AI Platform that offers an efficient infrastructure for inference and training large scale models. · Taking part in the development of the NVIDIA's AI platform for training, fi ...

  • Only for registered members Toronto, ON Remote job

    ++Nelson Education Ltd. is seeking a Senior Back-end Software Developer to instrument, deploy, and improve multiple public-facing web apps in their cloud-based Kubernetes clusters or Cloudflare edge workers. · ++Remote-first · Flexible working time · Compensation aligned with exp ...

  • Only for registered members Canada, Toronto

    We are now looking for a Senior Machine Learning Applications and Compiler Engineer NVIDIA is seeking engineers to develop algorithms and optimizations for our inference and compiler stack. This is your chance to be part of something outstandingly innovative We are looking for a ...

  • Only for registered members Toronto, ON

    A software development manager leads a team in building measurement services for Amazon Ads. · Owns technical vision direction for measurement services identifying mitigating risks constructing project schedules preparing status reports. · ...

  • Only for registered members Canada, Toronto

    We are seeking highly motivated and skilled systems engineers to join our team to help in developing an AI Platform that offers an efficient infrastructure for inference and training large scale models. · Taking part in the development of the NVIDIA's AI platform for training, fi ...

  • QE, Automation

    1 month ago

    Only for registered members Toronto, ON

    We are seeking an experienced Automation Engineer for our team. · The ideal candidate will have 5-7 years of experience in software testing and test automation with expertise in Object Oriented Programming using core Java. · ...

  • Only for registered members Toronto, Ontario Remote job

    We are seeking a highly skilled Machine Learning Engineer to join our advanced model development team. This role focuses on pre-training, continued training, and post-training of models. · ...

  • Only for registered members Toronto, ON

    This onsite contract role supports Client's wide-ranging business information requirements by implementing and supporting business intelligence software solutions. · Develops and supports customized query applications which analyze key business data · Develops and supports guided ...

Jobs
>
Toronto