Deployment Engineer, AI Inference - Toronto - Cerebras Systems Inc.

Cerebras Systems Inc. Toronto

1 day ago

Description

AI Inference Deployment Engineer

Cerebras Systems builds AI compute power using its Wafer-Scale Engine (WSE). This architecture enables industry-leading training and inference speeds.

Deploy AI replicas across datacenters
Maximize capacity allocation with constraint-solver algorithms

AI Inference Engineer
1 month ago

Only for registered members Toronto, Ontario

We are looking for highly skilled engineers with a focus on C/C++, low level systems, performance and power optimization to join our team full-time. · Innovate on the inference optimization pipeline through algorithmic and system optimization · Own end to end system characterizat ...
AI Inference Engineer
1 month ago

Only for registered members Toronto

We are building the Inference infrastructure for Edge, where we are unlocking data center ai inference capability closer to where the data lives. · We are looking for highly skilled engineers with a focus on C/C++, low level systems, performance and power optimization to join our ...
Audio Inference Engineer, Model Efficiency
1 month ago

Only for registered members Toronto, Ontario

We re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation semantic search RAG and agents. · We obsess over what we build Each one of us is responsible for contributing to increasing ...
Site Reliability Engineer, Inference Infrastructure
1 day ago

Only for registered members Toronto

+ Job summary: Who are we? · Our mission is to scale intelligence to serve humanity. · We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. W ...
Staff Software Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto, Ontario

+We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · + · +Developing, deploying, and operating the AI platform delivering Cohere's large l ...
Site Reliability Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto, Ontario

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Build self-service systems that automate managing, deploying and operating services. · ...
Site Reliability Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto

We're looking for a Site Reliability Engineer to join our Model Serving team. The team develops, deploys, and operates AI platforms delivering large language models through easy-to-use API endpoints. · You will build self-service systems that automate managing, deploying, and ope ...
Full-Stack Software Engineer, Inference
1 day ago

Only for registered members Toronto

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
Full-Stack Software Engineer, Inference
1 month ago

Only for registered members Toronto, Ontario

Cohere is training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents. · We believe that our work is instrumental to the widespread adoption of AI. · We ...
Full-Stack Software Engineer, Inference
1 month ago

Only for registered members Toronto

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, · semantic search, · RAG, · and agents.Cohere customers self-serve our API without any intervention. · ...
Sr. Inference ML Runtime Engineer
1 day ago

Only for registered members Toronto

+Job summary · Cerebras Systems builds the world's largest AI chip, · +Drive and provide technical guidance to a team of software engineers working on complex machine learning integration projects. · Design and implement ML features (e.g., structured outputs, biased sampling, pre ...
Sr. Deployment Engineer, AI Inference
3 days ago

Only for registered members Toronto

Cerebras Systems builds the world's largest AI chip which is 56 times larger than GPUs. · Deploy AI inference replicas and cluster software across multiple datacenters · Operate across heterogeneous datacenter environments undergoing rapid 10x growth · Maximize capacity allocat ...
Senior Software Engineer, AI Inference Systems
4 weeks ago

Only for registered members Toronto, Ontario

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. · Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and o ...
Staff Software Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto

+We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents.We believe that our work is instrumental to the widespread adoption of AI. · +In this ...
Site Reliability Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto Full time

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. · ...
Site Reliability Engineer, Inference Infrastructure
1 month ago

Only for registered members Toronto

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Build self-service systems that automate managing, deploying and operating services. · ...
Full-Stack Software Engineer, Inference
1 month ago

Only for registered members Toronto

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents.We believe that our work is instrumental to the widespread adoption of AI. · Improve th ...
Full-Stack Software Engineer, Inference
1 month ago

Only for registered members Toronto Full time

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
Senior Software Engineer, AI Inference Systems
4 weeks ago

Only for registered members Toronto $170,000 - $275,000 (CAD)

We are seeking highly skilled software engineers to build AI inference systems. You'll architect and implement high-performance stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across environments. · You'll collaborate with teams to push ...
Neural Rendering Research Inference Engineer – Advanced Graphics Programs
1 month ago

Only for registered members Markham Full time

We are seeking an exceptional Neural Rendering Research Inference Engineer, Advanced Graphics Program who has deep technical expertise in translating neural network models and algorithms to efficient inference solutions. · ...
Neural Rendering Research Inference Engineer – Advanced Graphics Programs
1 month ago

Only for registered members Markham

We are seeking an exceptional Neural Rendering Research Inference Engineer, Advanced Graphics Program who has deep technical expertise in translating neural network models and algorithms to efficient inference solutions. · AMD is looking for a strategic research inference enginee ...

AI Inference Engineer
Only for registered members Toronto, Ontario
AI Inference Engineer
Only for registered members Toronto
Audio Inference Engineer, Model Efficiency
Only for registered members Toronto, Ontario
Site Reliability Engineer, Inference Infrastructure
Only for registered members Toronto
Staff Software Engineer, Inference Infrastructure
Only for registered members Toronto, Ontario
Site Reliability Engineer, Inference Infrastructure
Only for registered members Toronto, Ontario
Site Reliability Engineer, Inference Infrastructure
Only for registered members Toronto
Full-Stack Software Engineer, Inference
Only for registered members Toronto
Full-Stack Software Engineer, Inference
Only for registered members Toronto, Ontario
Full-Stack Software Engineer, Inference
Only for registered members Toronto
Sr. Inference ML Runtime Engineer
Only for registered members Toronto
Sr. Deployment Engineer, AI Inference
Only for registered members Toronto
Senior Software Engineer, AI Inference Systems
Only for registered members Toronto, Ontario
Staff Software Engineer, Inference Infrastructure
Only for registered members Toronto
Site Reliability Engineer, Inference Infrastructure
Full time Only for registered members Toronto
Site Reliability Engineer, Inference Infrastructure
Only for registered members Toronto
Full-Stack Software Engineer, Inference
Only for registered members Toronto
Full-Stack Software Engineer, Inference
Full time Only for registered members Toronto
Senior Software Engineer, AI Inference Systems
Only for registered members Toronto
Neural Rendering Research Inference Engineer – Advanced Graphics Programs
Full time Only for registered members Markham
Neural Rendering Research Inference Engineer – Advanced Graphics Programs
Only for registered members Markham