Full Stack LLM Engineer - Toronto - Cerebras

    Cerebras
    Cerebras Toronto

    5 days ago

    Description

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users to effortlessly run large‑scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

    Cerebras' current customers include top model labs, global enterprises, and cutting‑edge AI‑native startups. OpenAI recently announced a multi‑year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high‑speed inference.

    Thanks to the groundbreaking wafer‑scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU‑based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real‑time iteration and increasing intelligence via additional agentic computation.

    About the Role


    We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state‑of‑the‑art open‑source models (like LLaMA, Qwen, etc) or customer‑provided proprietary models on our Cerebras CSX systems. Success in this role requires a system‑minded generalist who thrives in fast‑paced bring‑up environments and is comfortable working across the entire Cerebras software stack.

    Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

    Responsibilities

    • Contribute to the end‑to‑end bring up of ML models on Cerebras CSX systems.
    • Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
    • Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
    • Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

    Skills & Qualifications

    • Bachelor's, Master's, or PhD in Computer Science, Engineering, or a related field.
    • Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
    • Strong debugging skills across performance, numerical accuracy, and runtime integration.
    • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).
    • Proficiency in C/C++ programming and experience with low‑level optimization.
    • Proven experience in compiler development, particularly with LLVM and/or MLIR.
    • Strong background in optimization techniques, particularly those involving NP‑hard problems.

    What We Offer

    • Competitive salary and benefits package.
    • Opportunities for professional growth and career advancement.
    • A dynamic and innovative work environment.
    • The chance to work on cutting‑edge technologies and make a significant impact on the future of AI.

    This offer is contingent upon Cerebras successfully obtaining an export license from the U.S. Department of Commerce's Bureau of Industry and Security authorizing the release to you of certain software source code and/or technology that is subject to the Export Administration Regulations. However, we can make no assurances with respect to the final disposition of an export license application.

    Why Join Cerebras

    • Build a breakthrough AI platform beyond the constraints of the GPU.
    • Publish and open source their cutting‑edge AI research.
    • Work on one of the fastest AI supercomputers in the world.
    • Enjoy job stability with startup vitality.
    • Our simple, non‑corporate work culture that respects individual beliefs.

    Read our blog: Five Reasons to Join Cerebras in 2026.


    Apply today and become part of the forefront of groundbreaking advancements in AI

    Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.


    #J-18808-Ljbffr

  • Work in company

    LLM Engineer

    Only for registered members

    We are actively ramping up for · Agentic AI · and LLM Engineer positions as our clients expand into autonomous AI systems, agent-based workflows and large-scale model integration. · ...

    Toronto, Ontario

    1 month ago

  • Work in company

    Full Stack LLM Engineer

    Only for registered members

    · Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver ind ...

    Toronto, Ontario, Canada

    12 hours ago

  • Work in company

    Full Stack LLM Engineer

    Only for registered members

    About the Role/We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX s ...

    Toronto

    3 weeks ago

  • Work in company

    Full Stack LLM Engineer

    Only for registered members

    + Full Stack LLM EngineerCerebras Systems builds the world's largest AI chip, · Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, · This approach allows Cerebras to deliver industry-leading training and inference speeds · and emp ...

    Toronto Full time

    2 weeks ago

  • Work in company Remote job

    Sr. LLM Engineer ( 100 Remote )

    Only for registered members

    We are seeking GenAI Developer with LLM expertise to join our AI Tools team focused on building advanced integrations and connectors across leading Generative AI platforms. · ...

    Toronto, Ontario

    2 weeks ago

  • Work in company

    LLM Serving Engineer

    Only for registered members

    Company · Qualcomm Technologies, Inc. · Job Area · Engineering Group, Engineering Group > Machine Learning Engineering · General Summary · LLM Serving Engineer (Cloud AI Engineering) · Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a cent ...

    Markham $158,400 - $237,600 (USD)

    4 days ago

  • Work in company

    LLM Serving Engineer

    Only for registered members

    We are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team. · This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, · strong execution, and excellent commu ...

    Markham $158,400 - $237,600 (USD)

    3 weeks ago

  • Work in company

    Sr. LLM Engineer ( 100 Remote )

    Only for registered members

    We are seeking GenAI Developer with LLM expertise to join our AI Tools team. · Design and develop GenAI-powered integrations and connectors for enterprise use cases. · Build and enhance platform-specific connectors for tools such as Microsoft Copilot. · Implement MCP (Model Conte ...

    Toronto

    2 weeks ago

  • Work in company Remote job

    Senior Machine Learning Engineer, LLM Compressor and Quantization

    Only for registered members

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification,Contri ...

    Toronto, Ontario

    1 month ago

  • Work in company

    Generative AI Engineer – Python, LLM, Agentic Workflows

    Astra North Infoteck Inc.

    · Experience Required: 8-10 · Role Overview · The Senior AI Engineer will lead the design, development, and deployment of enterprise scale Generative AI solutions. This role involves ownership of system architecture, agentic workflow design, end to end GenAI pipelines, and high ...

    Toronto

    1 day ago

  • At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.As leading developers maintainers of the vLLM project and inventors of state-of-the-art techniques for model quantization and sparsification ...

    Toronto $170,770 - $281,770 (USD)

    1 month ago

  • At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · Contribute to the design, development, and testing of various inference optimization algorithms in the vLLM LLM-compressor project · Crea ...

    Toronto

    1 month ago

  • This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.We thank all candidates in advance. · ...

    Greater Toronto Area

    1 month ago

  • This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.Work on LLM-powered systems and production-grade ML pipelines contributing to a greener future. · ...

    Toronto

    1 month ago

  • Work in company

    GenAI ML Engineer

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 20th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    1 week ago

  • Work in company

    GenAI ML Engineer-3

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 25th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    1 week ago

  • Work in company

    GenAI ML Engineer-4

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 26th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    6 days ago

  • Work in company

    GenAI ML Engineer-2

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 24th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    1 week ago

  • Work in company

    GenAI ML Engineer-1

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 21st, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    1 week ago

  • Work in company

    GenAI ML Engineer-5

    Only for registered members

    Toronto, Ontario M5V 3L9 Posted February 27th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

    Toronto, ON

    5 days ago

  • Work in company

    Data/AI Engineer

    Only for registered members

    Guidepoint seeks an experienced Data/AI Engineer as an integral member of the Toronto-based AI team. The Toronto Technology Hub serves as the base of our Data/AI/ML team, dedicated to building a modern data infrastructure for advanced analytics and the development of responsible ...

    Toronto, ON

    1 month ago

Jobs
>
Toronto