Work in company

AI Inference Engineer

Only for registered members

We are looking for highly skilled engineers with a focus on C/C++, low level systems, performance and power optimization to join our team full-time. · Innovate on the inference optimization pipeline through algorithmic and system optimization · Own end to end system characterizat ...

Toronto, Ontario

1 month ago

Work in company

AI Inference Engineer

Only for registered members

We are building the Inference infrastructure for Edge, where we are unlocking data center ai inference capability closer to where the data lives. · We are looking for highly skilled engineers with a focus on C/C++, low level systems, performance and power optimization to join our ...

Toronto

1 month ago

Work in company

Audio Inference Engineer, Model Efficiency

Only for registered members

We re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation semantic search RAG and agents. · We obsess over what we build Each one of us is responsible for contributing to increasing ...

Toronto, Ontario

1 month ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We're looking for a Site Reliability Engineer to join our Model Serving team. The team develops, deploys, and operates AI platforms delivering large language models through easy-to-use API endpoints. · You will build self-service systems that automate managing, deploying, and ope ...

Toronto

1 month ago

Work in company

Staff Software Engineer, Inference Infrastructure

Only for registered members

+We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · + · +Developing, deploying, and operating the AI platform delivering Cohere's large l ...

Toronto, Ontario

1 month ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

Who are we? · Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that ou ...

Toronto

1 week ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Build self-service systems that automate managing, deploying and operating services. · ...

Toronto, Ontario

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

Cohere is training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents. · We believe that our work is instrumental to the widespread adoption of AI. · We ...

Toronto, Ontario

1 month ago

Work in company

Sr. Deployment Engineer, AI Inference

Only for registered members

Cerebras Systems builds the world's largest AI chip which is 56 times larger than GPUs. · Deploy AI inference replicas and cluster software across multiple datacenters · Operate across heterogeneous datacenter environments undergoing rapid 10x growth · Maximize capacity allocat ...

Toronto

3 weeks ago

Work in company

Staff Software Engineer, Inference Infrastructure

Only for registered members

+We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents.We believe that our work is instrumental to the widespread adoption of AI. · +In this ...

Toronto

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

Who are we? · Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that ou ...

Toronto

6 days ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · Build self-service systems that automate managing, deploying and operating services. · ...

Toronto

1 month ago

Work in company

Sr. Inference ML Runtime Engineer

Only for registered members

+Job summary · Cerebras Systems builds the world's largest AI chip, · +Drive and provide technical guidance to a team of software engineers working on complex machine learning integration projects. · Design and implement ML features (e.g., structured outputs, biased sampling, pre ...

Toronto

3 weeks ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry ben ...

Toronto, Ontario

1 week ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. · Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and o ...

Toronto, Ontario

1 month ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry ben ...

Canada, Toronto $170,000 - $270,000 (CAD) per year

2 days ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. · ...

Toronto Full time

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, · and agents.We believe that our work is instrumental to the widespread adoption of AI. · Improve th ...

Toronto

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

Toronto Full time

1 month ago

Work in company

Senior Software Engineer, AI Inference Systems

Only for registered members

We are seeking highly skilled software engineers to build AI inference systems. You'll architect and implement high-performance stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across environments. · You'll collaborate with teams to push ...

Toronto $170,000 - $275,000 (CAD)

1 month ago

Full-Stack Software Engineer, Inference - Toronto

Job description

Similar jobs

AI Inference Engineer

AI Inference Engineer

Audio Inference Engineer, Model Efficiency

Site Reliability Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Sr. Deployment Engineer, AI Inference

Staff Software Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Site Reliability Engineer, Inference Infrastructure

Sr. Inference ML Runtime Engineer

Senior Software Engineer, AI Inference Systems

Senior Software Engineer, AI Inference Systems

Senior Software Engineer, AI Inference Systems

Site Reliability Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Full-Stack Software Engineer, Inference

Senior Software Engineer, AI Inference Systems

Directory

for Recruiters

Information