-
Research DevOps Engineer GEMINI Systems
2 weeks ago
Only for registered members Toronto, ONWe are seeking an experienced DevOps Engineer to join our team and champion the evolution of our HPC infrastructure. · This role is pivotal in transforming our configuration management into a robust, scalable GitOps architecture. ...
-
GCP Cloud – DevOps Engineer
1 month ago
Only for registered members Toronto, Ontario Remote jobWe are seeking a Senior GCP DevOps – HPC Engineer to support a large-scale Pharmaceuticals / Life Sciences initiative. · Lead the migration of on-premises SLURM-based HPC clusters to Google Cloud Platform . · Design, implement, and manage scalable and secure HPC infrastructure on ...
-
Site Reliability Engineer, AI/ML Infrastructure
4 weeks ago
Only for registered members TorontoWe're looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters around. · We'll be hands-on with the full lifecycle of HPC infrastructure: planning, building, testing, deploying, · and keeping everything running smoothly. · You'll also he ...
-
Research DevOps Engineer GEMINI Systems
2 weeks ago
Only for registered members Toronto, OntarioGEMINI Systems is at the forefront of medical research and innovation. We are seeking an experienced DevOps Engineer to join our team and champion the evolution of our HPC infrastructure. · We operate a 100% Linux environment and are deeply committed to automating our infrastruct ...
-
Site Reliability Engineer, AI/ML Infrastructure
53 minutes ago
Only for registered members TorontoWere looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters around—our Toronto datacenter packed with NVIDIA H100 and A100 GPUs over 20PB of Ceph storage terabit networking and hundreds of servers. · Manage and optimize HPC cluster ope ...
-
Product Manager HPCWorks
1 month ago
Only for registered members Toronto, OntarioWe are shaping the future of compute-intensive engineering, science, and AI. Our mission is to make HPC more accessible, efficient, and intelligent for users and administrators across the world's leading industries. We are hiring a Product Manager to lead our intuitive portal for ...
-
Senior GPU Solutions Consultant
4 weeks ago
Only for registered members Toronto, OntarioWe're hiring a Senior Sales Executive with deep experience selling servers, GPU systems, cloud/compute infrastructure, or data-center hardware into complex accounts. · ...
-
Senior GPU Solutions Consultant
1 month ago
Only for registered members Toronto, Ontario+Sell into one of the fastest-growing infrastructure markets in the world; GPU compute. Work directly with founders and engineering. Influence GTM and hardware roadmap. · ...
-
Product Manager
1 month ago
Only for registered members Toronto, OntarioWe are seeking a Product Manager to evolve HPCWorks as next-generation workloads integrate AI and quantum computing.HPCWorks enables the world's most demanding compute workloads across industries including semiconductor design, automotive, aerospace, life sciences, · and research ...
- Only for registered members Toronto, Ontario
This position involves building and operating mission-critical platform infrastructure using DevOps practices. You'll design, scale and automate platforms to improve productivity. · ...
-
Network Engineer, AI/ML Infrastructure
1 month ago
Only for registered members TorontoWe're seeking an experienced Network Engineer to design build and optimize the high-performance networking infrastructure powering our AI/ML operations in Toronto. You'll work at the cutting edge of network technology—managing InfiniBand and ultra-high-speed Ethernet fabrics that ...
-
Network Engineer, AI/ML Infrastructure
53 minutes ago
Only for registered members TorontoWe're seeking an experienced Network Engineer to design, build, and optimize the high-performance networking infrastructure powering our AI/ML operations in Toronto. You'll work at the cutting edge of network technology—managing InfiniBand and ultra-high-speed Ethernet fabrics th ...
-
Freelance AI
1 day ago
Only for registered members Toronto, Ontario Remote job+We are seeking a technical content writer skilled in AI hardware computing infrastructure who can create content translating complex topics into clear narratives for both technical business audiences supporting enterprise use cases across finance private equity hedge funds healt ...
-
Network Engineer, AI/ML Infrastructure
1 month ago
Only for registered members Toronto, OntarioWe're seeking an experienced Network Engineer to design, build and optimize the high-performance networking infrastructure powering our AI/ML operations in Toronto. · We'll work at the cutting edge of network technology—managing InfiniBand and ultra-high-speed Ethernet fabrics th ...
- Only for registered members Toronto, Ontario
This is one of the best opportunities for a passionate Linux infrastructure enthusiast out there working on the newest and best tech around with a chance to make your mark on a growing organsisation · ...
-
Site Reliability Engineer
1 month ago
Only for registered members Toronto, OntarioWe are seeking a Site Reliability Engineer (SRE) with experience spanning cloud and data center environments to drive infrastructure reliability, observability, and scalability. · 3-5 years in a Site Reliability Engineering (SRE) or DevOps role. · Strong software development back ...
-
Lead Software Developer
4 weeks ago
Only for registered members Toronto, ONThe Faculty of Arts & Science is the heart of Canada's leading university and one of the most comprehensive and diverse academic divisions in the world. The strength of Arts & Science derives from our combined teaching and research excellence in the humanities, sciences and socia ...
-
Senior ML Systems Engineer, Frameworks
1 week ago
Only for registered members Toronto, OntarioWe're looking for a senior engineer to help build, maintain and evolve the training framework that powers our frontier-scale language models. This role sits at the intersection of large-scale training, distributed systems, and HPC infrastructure.Build and own the training framewo ...
-
Senior Solutions Specialist
1 month ago
Only for registered members Toronto, OntarioWe are seeking a strategic and driven AI Solutions Specialist to join our national sales team. · ...
-
Senior Memory Controller Verification Engineer
3 weeks ago
Only for registered members Toronto, OntarioNVIDIA is seeking hardworking and motivated Senior Verification Engineer for Tegra SoC Memory Subsystem IP verification Team. · Develop verification infrastructure (testbenches, BFMs, checkers, monitors). · Craft and implement verification test plans. · ...
-
Chip Firmware Development, Intern
1 month ago
Only for registered members Toronto, OntarioWe are hiring an Intern for our Software team at Lightmatter We are a photonic computer company redefining what computers and human beings are capable of by building the engines that will power discoveries and drive progress in a sustainable way. · ...
Senior SRE: AI/ML HPC Infrastructure - Toronto - Boson AI
Description
A leading technology company in Toronto is seeking a Senior Site Reliability Engineer to manage and optimize their high-performance computing (HPC) cluster.
The ideal candidate will have over 5 years of experience in SRE or HPC operations, proficiency in Linux, and expertise in Kubernetes and automation.
This role involves deploying infrastructure-as-code solutions and supporting research teams. A competitive salary ranging from $150,000 to $250,000 per year is offered along with opportunities for continuous learning.#J-18808-Ljbffr
-
Research DevOps Engineer GEMINI Systems
Only for registered members Toronto, ON
-
GCP Cloud – DevOps Engineer
Only for registered members Toronto, Ontario
-
Site Reliability Engineer, AI/ML Infrastructure
Only for registered members Toronto
-
Research DevOps Engineer GEMINI Systems
Only for registered members Toronto, Ontario
-
Site Reliability Engineer, AI/ML Infrastructure
Only for registered members Toronto
-
Product Manager HPCWorks
Only for registered members Toronto, Ontario
-
Senior GPU Solutions Consultant
Only for registered members Toronto, Ontario
-
Senior GPU Solutions Consultant
Only for registered members Toronto, Ontario
-
Product Manager
Only for registered members Toronto, Ontario
-
Platform Engineer – Elite Quant - Up to 240,000 CAD Starting base + Exceptional benefits/bonus package Fund – Toronto
Only for registered members Toronto, Ontario
-
Network Engineer, AI/ML Infrastructure
Only for registered members Toronto
-
Network Engineer, AI/ML Infrastructure
Only for registered members Toronto
-
Freelance AI
Only for registered members Toronto, Ontario
-
Network Engineer, AI/ML Infrastructure
Only for registered members Toronto, Ontario
-
DevOps Engineer (Kubernetes SME) - Up to 250k CAD + Industry Leading Bonus - Elite FinTech Firm
Only for registered members Toronto, Ontario
-
Site Reliability Engineer
Only for registered members Toronto, Ontario
-
Lead Software Developer
Only for registered members Toronto, ON
-
Senior ML Systems Engineer, Frameworks
Only for registered members Toronto, Ontario
-
Senior Solutions Specialist
Only for registered members Toronto, Ontario
-
Senior Memory Controller Verification Engineer
Only for registered members Toronto, Ontario
-
Chip Firmware Development, Intern
Only for registered members Toronto, Ontario