Full Stack LLM Engineer - Toronto - Cerebras

Cerebras Toronto

5 days ago

Description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users to effortlessly run large‑scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include top model labs, global enterprises, and cutting‑edge AI‑native startups. OpenAI recently announced a multi‑year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high‑speed inference.

Thanks to the groundbreaking wafer‑scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU‑based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real‑time iteration and increasing intelligence via additional agentic computation.

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state‑of‑the‑art open‑source models (like LLaMA, Qwen, etc) or customer‑provided proprietary models on our Cerebras CSX systems. Success in this role requires a system‑minded generalist who thrives in fast‑paced bring‑up environments and is comfortable working across the entire Cerebras software stack.

Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end‑to‑end bring up of ML models on Cerebras CSX systems.
Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor's, Master's, or PhD in Computer Science, Engineering, or a related field.
Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
Strong debugging skills across performance, numerical accuracy, and runtime integration.
Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).
Proficiency in C/C++ programming and experience with low‑level optimization.
Proven experience in compiler development, particularly with LLVM and/or MLIR.
Strong background in optimization techniques, particularly those involving NP‑hard problems.

What We Offer

Competitive salary and benefits package.
Opportunities for professional growth and career advancement.
A dynamic and innovative work environment.
The chance to work on cutting‑edge technologies and make a significant impact on the future of AI.

This offer is contingent upon Cerebras successfully obtaining an export license from the U.S. Department of Commerce's Bureau of Industry and Security authorizing the release to you of certain software source code and/or technology that is subject to the Export Administration Regulations. However, we can make no assurances with respect to the final disposition of an export license application.

Why Join Cerebras

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting‑edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non‑corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

#J-18808-Ljbffr

Work in company
LLM Engineer
Only for registered members

We are actively ramping up for · Agentic AI · and LLM Engineer positions as our clients expand into autonomous AI systems, agent-based workflows and large-scale model integration. · ...

Toronto, Ontario
1 month ago
Work in company
Full Stack LLM Engineer
Only for registered members

· Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver ind ...

Toronto, Ontario, Canada
12 hours ago
Work in company
Full Stack LLM Engineer
Only for registered members

About the Role/We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX s ...

Toronto
3 weeks ago
Work in company
Full Stack LLM Engineer
Only for registered members

+ Full Stack LLM EngineerCerebras Systems builds the world's largest AI chip, · Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, · This approach allows Cerebras to deliver industry-leading training and inference speeds · and emp ...

Toronto Full time
2 weeks ago
Work in company Remote job
Sr. LLM Engineer ( 100 Remote )
Only for registered members

We are seeking GenAI Developer with LLM expertise to join our AI Tools team focused on building advanced integrations and connectors across leading Generative AI platforms. · ...

Toronto, Ontario
2 weeks ago
Work in company
LLM Serving Engineer
Only for registered members

Company · Qualcomm Technologies, Inc. · Job Area · Engineering Group, Engineering Group > Machine Learning Engineering · General Summary · LLM Serving Engineer (Cloud AI Engineering) · Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a cent ...

Markham $158,400 - $237,600 (USD)
4 days ago
Work in company
LLM Serving Engineer
Only for registered members

We are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team. · This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, · strong execution, and excellent commu ...

Markham $158,400 - $237,600 (USD)
3 weeks ago
Work in company
Sr. LLM Engineer ( 100 Remote )
Only for registered members

We are seeking GenAI Developer with LLM expertise to join our AI Tools team. · Design and develop GenAI-powered integrations and connectors for enterprise use cases. · Build and enhance platform-specific connectors for tools such as Microsoft Copilot. · Implement MCP (Model Conte ...

Toronto
2 weeks ago
Work in company Remote job
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members

We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification,Contri ...

Toronto, Ontario
1 month ago
Work in company
Generative AI Engineer – Python, LLM, Agentic Workflows
Astra North Infoteck Inc.

· Experience Required: 8-10 · Role Overview · The Senior AI Engineer will lead the design, development, and deployment of enterprise scale Generative AI solutions. This role involves ownership of system architecture, agentic workflow design, end to end GenAI pipelines, and high ...

Toronto
1 day ago
Work in company
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.As leading developers maintainers of the vLLM project and inventors of state-of-the-art techniques for model quantization and sparsification ...

Toronto $170,770 - $281,770 (USD)
1 month ago
Work in company
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · Contribute to the design, development, and testing of various inference optimization algorithms in the vLLM LLM-compressor project · Crea ...

Toronto
1 month ago
Work in company
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members

This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.We thank all candidates in advance. · ...

Greater Toronto Area
1 month ago
Work in company
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members

This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.Work on LLM-powered systems and production-grade ML pipelines contributing to a greener future. · ...

Toronto
1 month ago
Work in company
GenAI ML Engineer
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 20th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
1 week ago
Work in company
GenAI ML Engineer-3
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 25th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
1 week ago
Work in company
GenAI ML Engineer-4
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 26th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
6 days ago
Work in company
GenAI ML Engineer-2
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 24th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
1 week ago
Work in company
GenAI ML Engineer-1
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 21st, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
1 week ago
Work in company
GenAI ML Engineer-5
Only for registered members

Toronto, Ontario M5V 3L9 Posted February 27th, 2026 · Looking for more job opportunities? Click here · Job Type: Full Time · Job Category: IT · Job Description · GenAI ML Engineer · Toronto, ON - Onsite · """Total Experience: 6-8 years · Required Skill Sets: · We are seeking a ...

Toronto, ON
5 days ago
Work in company
Data/AI Engineer
Only for registered members

Guidepoint seeks an experienced Data/AI Engineer as an integral member of the Toronto-based AI team. The Toronto Technology Hub serves as the base of our Data/AI/ML team, dedicated to building a modern data infrastructure for advanced analytics and the development of responsible ...

Toronto, ON
1 month ago

LLM Engineer
Only for registered members Toronto, Ontario
Full Stack LLM Engineer
Only for registered members Toronto, Ontario, Canada
Full Stack LLM Engineer
Only for registered members Toronto
Full Stack LLM Engineer
Full time Only for registered members Toronto
Sr. LLM Engineer ( 100 Remote )
Only for registered members Toronto, Ontario
LLM Serving Engineer
Only for registered members Markham
LLM Serving Engineer
Only for registered members Markham
Sr. LLM Engineer ( 100 Remote )
Only for registered members Toronto
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto, Ontario
Generative AI Engineer – Python, LLM, Agentic Workflows
Astra North Infoteck Inc.- Toronto
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members Greater Toronto Area
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members Toronto
GenAI ML Engineer
Only for registered members Toronto, ON
GenAI ML Engineer-3
Only for registered members Toronto, ON
GenAI ML Engineer-4
Only for registered members Toronto, ON
GenAI ML Engineer-2
Only for registered members Toronto, ON
GenAI ML Engineer-1
Only for registered members Toronto, ON
GenAI ML Engineer-5
Only for registered members Toronto, ON
Data/AI Engineer
Only for registered members Toronto, ON