Full Stack LLM Engineer - Toronto - Cerebras Systems Inc.

    Cerebras Systems Inc.
    Cerebras Systems Inc. Toronto

    1 week ago

    Full time
    Description

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

    Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

    About the Role


    We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

    Responsibilities

    • Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.
    • Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
    • Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
    • Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

    Skills & Qualifications

    • Bachelor's, Master's, or PhD in Computer Science, Engineering, or a related field.
    • Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
    • Strong debugging skills across performance, numerical accuracy, and runtime integration.
    • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).
    • Proficiency in C/C++ programming and experience with low-level optimization.
    • Proven experience in compiler development, particularly with LLVM and/or MLIR.
    • Strong background in optimization techniques, particularly those involving NP-hard problems.

    What We Offer

    • Competitive salary and benefits package.
    • Opportunities for professional growth and career advancement.
    • A dynamic and innovative work environment.
    • The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

    Why Join Cerebras


    People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we've reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

    • Build a breakthrough AI platform beyond the constraints of the GPU.
    • Publish and open source their cutting-edge AI research.
    • Work on one of the fastest AI supercomputers in the world.
    • Enjoy job stability with startup vitality.
    • Our simple, non-corporate work culture that respects individual beliefs.

    Equal Employment Opportunity


    Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

    #J-18808-Ljbffr

  • LLM Engineer

    1 month ago

    Only for registered members Toronto, Ontario

    We are actively ramping up for · Agentic AI · and LLM Engineer positions as our clients expand into autonomous AI systems, agent-based workflows and large-scale model integration. · ...

  • Only for registered members Toronto, Ontario, Canada

    We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. · Contr ...

  • Only for registered members Toronto Full time

    + Full Stack LLM EngineerCerebras Systems builds the world's largest AI chip, · Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, · This approach allows Cerebras to deliver industry-leading training and inference speeds · and emp ...

  • Only for registered members Toronto

    About the Role/We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX s ...

  • Only for registered members Markham $158,400 - $237,600 (USD)

    We are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team. · This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, · strong execution, and excellent commu ...

  • Only for registered members Toronto, Ontario Remote job

    We believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification,Contri ...

  • Only for registered members Toronto $170,770 - $281,770 (USD)

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.As leading developers maintainers of the vLLM project and inventors of state-of-the-art techniques for model quantization and sparsification ...

  • Only for registered members Toronto

    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. · Contribute to the design, development, and testing of various inference optimization algorithms in the vLLM LLM-compressor project · Crea ...

  • Only for registered members Greater Toronto Area

    This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.We thank all candidates in advance. · ...

  • Only for registered members Toronto

    This is an exciting position for a talented Machine Learning Engineer to join a dynamic team designing intelligent systems that fuel core platforms.Work on LLM-powered systems and production-grade ML pipelines contributing to a greener future. · ...

  • Data/AI Engineer

    1 month ago

    Only for registered members Toronto, ON

    Guidepoint seeks an experienced Data/AI Engineer as an integral member of the Toronto-based AI team. The Toronto Technology Hub serves as the base of our Data/AI/ML team, dedicated to building a modern data infrastructure for advanced analytics and the development of responsible ...

  • AI Engineer

    1 month ago

    Only for registered members Toronto, ON

    Job summary · A Guidepoint seeks an experienced Senior AI Engineer as an integral member of the Toronto-based AI team. The position demands exceptional leadership and technical prowess to drive the development of next-generation research enablement platforms and AI-driven data pr ...

  • Gen AI Developer

    4 weeks ago

    Only for registered members Toronto, ON

    AI Developer specialized in Agentic AI, OpenAI development, and RAG (Retrieval-Augmented Generation) · 4–8+ years in software engineering / ML engineering; 1–3+ years in LLM/agent/RAG systems. · Strong in Python (or TypeScript), async programming, API design, distributed systems. ...

  • Only for registered members Toronto, ON

    +Job summary: · We are seeking a talented and experienced Machine Learning Engineer to join our growing AI team at Thrive. · +Design, develop, and evaluate LLM agents and agentic frameworks. · Research and implement multi-agent architectures. · Train fine-tune machine learning mo ...

  • Only for registered members Toronto, ON

    +8 years in AI/ML/Data/Engineering · +Strong hands-on experience with LLMs (OpenAI, Claude, Gemini) · +Experience with RAG, vector databases, · and GenAI frameworks (LangChain) ...

  • Only for registered members Toronto, ON

    Tata Consultancy Services (TCS) is an equal opportunity employer that embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity. · ...

  • Only for registered members Toronto, ON

    We empower personal injury lawyers and victims to get the justice they deserve using technology and AI. · About EvenUp · EvenUp is on a mission to close the justice gap using technology and AI. · We empower personal injury lawyers and victims to obtain the justice they deserve. ...

  • Only for registered members Toronto, ON

    We're looking for an Applied Scientist to work at the intersection of applied research and production AI, with a strong focus on knowledge-centric AI systems, graph-based learning, and advanced Retrieval-Augmented Generation (RAG) architectures. · ...

  • Only for registered members Toronto, ON Remote job

    We are hiring a Lead/Senior Product Manager to own the Analytics, Reporting, and Conversational BI product surfaces in Agentic Studio—the measurement and trust layer that helps enterprises build, operate, and continuously improve agentic customer experiences across chat, mobile, ...

  • Only for registered members Toronto, ON

    We are looking for a Senior Fullstack Software Engineer A.I to join our AI & Knowledge team responsible for building our conversational AI assistant (Alfie). The ideal candidate will be responsible for designing building and scaling innovative LLM based solutions that improves ou ...

  • Only for registered members Toronto, ON Remote job

    We're looking for a Senior Software Engineer to join Kong's Office of the CTO team and work on special, high-impact projects that influence the direction of our platform and engineering strategy. · ...

Jobs
>
Toronto