Full Stack LLM Engineer - Toronto - Cerebras Systems Inc.

1 week ago

Full time

Description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.
Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor's, Master's, or PhD in Computer Science, Engineering, or a related field.
Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
Strong debugging skills across performance, numerical accuracy, and runtime integration.
Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).
Proficiency in C/C++ programming and experience with low-level optimization.
Proven experience in compiler development, particularly with LLVM and/or MLIR.
Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.
Opportunities for professional growth and career advancement.
A dynamic and innovative work environment.
The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we've reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

#J-18808-Ljbffr

LLM Engineer
1 month ago

Only for registered members Toronto, Ontario

We are actively ramping up for · Agentic AI · and LLM Engineer positions as our clients expand into autonomous AI systems, agent-based workflows and large-scale model integration. · ...
Full Stack LLM Engineer
1 day ago

Only for registered members Toronto, Ontario, Canada

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. · Contr ...
Full Stack LLM Engineer
17 hours ago

Only for registered members Toronto Full time

+ Full Stack LLM EngineerCerebras Systems builds the world's largest AI chip, · Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, · This approach allows Cerebras to deliver industry-leading training and inference speeds · and emp ...
Full Stack LLM Engineer
2 days ago

Only for registered members Toronto

About the Role/We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX s ...
LLM Serving Engineer
5 days ago

Only for registered members Markham $158,400 - $237,600 (USD)

We are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team. · This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, · strong execution, and excellent commu ...
Senior Machine Learning Engineer, LLM Compressor and Quantization
1 week ago

Only for registered members Toronto, Ontario Remote job

We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification,Contri ...
Senior Machine Learning Engineer, LLM Compressor and Quantization
1 week ago

Only for registered members Toronto $170,770 - $281,770 (USD)

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.As leading developers maintainers of the vLLM project and inventors of state-of-the-art techniques for model quantization and sparsification ...
Senior Machine Learning Engineer, LLM Compressor and Quantization
1 month ago

Only for registered members Toronto

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · Contribute to the design, development, and testing of various inference optimization algorithms in the vLLM LLM-compressor project · Crea ...
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
1 week ago

Only for registered members Greater Toronto Area

This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.We thank all candidates in advance. · ...
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
1 week ago

Only for registered members Toronto

This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.Work on LLM-powered systems and production-grade ML pipelines contributing to a greener future. · ...
Data/AI Engineer
1 month ago

Only for registered members Toronto, ON

Guidepoint seeks an experienced Data/AI Engineer as an integral member of the Toronto-based AI team. The Toronto Technology Hub serves as the base of our Data/AI/ML team, dedicated to building a modern data infrastructure for advanced analytics and the development of responsible ...
AI Engineer
1 month ago

Only for registered members Toronto, ON

Job summary · A Guidepoint seeks an experienced Senior AI Engineer as an integral member of the Toronto-based AI team. The position demands exceptional leadership and technical prowess to drive the development of next-generation research enablement platforms and AI-driven data pr ...
Gen AI Developer
4 weeks ago

Only for registered members Toronto, ON

AI Developer specialized in Agentic AI, OpenAI development, and RAG (Retrieval-Augmented Generation) · 4–8+ years in software engineering / ML engineering; 1–3+ years in LLM/agent/RAG systems. · Strong in Python (or TypeScript), async programming, API design, distributed systems. ...
Machine Learning Engineer
1 month ago

Only for registered members Toronto, ON

+Job summary: · We are seeking a talented and experienced Machine Learning Engineer to join our growing AI team at Thrive. · +Design, develop, and evaluate LLM agents and agentic frameworks. · Research and implement multi-agent architectures. · Train fine-tune machine learning mo ...
Senior Generative AI Consultant-1
4 weeks ago

Only for registered members Toronto, ON

+8 years in AI/ML/Data/Engineering · +Strong hands-on experience with LLMs (OpenAI, Claude, Gemini) · +Experience with RAG, vector databases, · and GenAI frameworks (LangChain) ...
GenAI ML Engineer
1 day ago

Only for registered members Toronto, ON

Tata Consultancy Services (TCS) is an equal opportunity employer that embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity. · ...
Senior Backend Engineer, Entities
1 month ago

Only for registered members Toronto, ON

We empower personal injury lawyers and victims to get the justice they deserve using technology and AI. · About EvenUp · EvenUp is on a mission to close the justice gap using technology and AI. · We empower personal injury lawyers and victims to obtain the justice they deserve. ...
Applied Scientist
1 week ago

Only for registered members Toronto, ON

We're looking for an Applied Scientist to work at the intersection of applied research and production AI, with a strong focus on knowledge-centric AI systems, graph-based learning, and advanced Retrieval-Augmented Generation (RAG) architectures. · ...
Lead / Senior Product Manager Analytics, Evals & Conversational BI (Agentic Studio)
1 month ago

Only for registered members Toronto, ON Remote job

We are hiring a Lead/Senior Product Manager to own the Analytics, Reporting, and Conversational BI product surfaces in Agentic Studio—the measurement and trust layer that helps enterprises build, operate, and continuously improve agentic customer experiences across chat, mobile, ...
Senior Fullstack Software Engineer A.I
3 weeks ago

Only for registered members Toronto, ON

We are looking for a Senior Fullstack Software Engineer A.I to join our AI & Knowledge team responsible for building our conversational AI assistant (Alfie). The ideal candidate will be responsible for designing building and scaling innovative LLM based solutions that improves ou ...
Senior Software Engineer
2 weeks ago

Only for registered members Toronto, ON Remote job

We're looking for a Senior Software Engineer to join Kong's Office of the CTO team and work on special, high-impact projects that influence the direction of our platform and engineering strategy. · ...

LLM Engineer
Only for registered members Toronto, Ontario
Full Stack LLM Engineer
Only for registered members Toronto, Ontario, Canada
Full Stack LLM Engineer
Full time Only for registered members Toronto
Full Stack LLM Engineer
Only for registered members Toronto
LLM Serving Engineer
Only for registered members Markham
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto, Ontario
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto
Senior Machine Learning Engineer, LLM Compressor and Quantization
Only for registered members Toronto
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members Greater Toronto Area
URGENT - Machine Learning Engineer (Python, LLM, SQL), PERMANENT Hybrid
Only for registered members Toronto
Data/AI Engineer
Only for registered members Toronto, ON
AI Engineer
Only for registered members Toronto, ON
Gen AI Developer
Only for registered members Toronto, ON
Machine Learning Engineer
Only for registered members Toronto, ON
Senior Generative AI Consultant-1
Only for registered members Toronto, ON
GenAI ML Engineer
Only for registered members Toronto, ON
Senior Backend Engineer, Entities
Only for registered members Toronto, ON
Applied Scientist
Only for registered members Toronto, ON
Lead / Senior Product Manager Analytics, Evals & Conversational BI (Agentic Studio)
Only for registered members Toronto, ON
Senior Fullstack Software Engineer A.I
Only for registered members Toronto, ON
Senior Software Engineer
Only for registered members Toronto, ON