Python Systems Engineer for Local Air-Gapped LLM Inference

Only for registered members Canada

3 weeks ago

$30 - $55 (USD) per hour

Job Description · I'm building a purely local, offline AI system that runs on a dedicated workstation (NVIDIA RTX 4090).The system is designed to work without any external APIs or cloud dependencies and focuses on deterministic, structured outputs, not chat-style interactions. · ...

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

LLM Inference Deployment Engineer

Only for registered members

· EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class ...

U.S., Canada, Germany, Norway $100,000 - $150,000 (USD) per year

4 days ago

Work in company Remote job

Massive-Scale LLM Inference

Only for registered members

Engineer massive-scale inference provider to undercut current market price. Serve 600,000 concurrent users on decentralized fleet of GPUs. Optimize throughput to achieve cost basis below $0.02 per million tokens. · Create highly optimized container for GPU hardware detection. · C ...

$500 - $0 (USD) budget

1 month ago

Work in company Remote job

Local LLM inference + RAG system

Only for registered members

I'm looking to hire a consultant to help build a local LLM inference + RAG system that can ingest and reason over large document sets — primarily aircraft sales contracts and theological texts. · ...

1 month ago

Work in company

AI Systems

Only for registered members

We think that AI systems should be flexible, personalized and accessible to everyone. · Design and build our LLM inference stack from zero to one. · Develop and optimize inference using modern frameworks. · Collaborate closely with founders and model developers to analyze bottlen ...

Toronto

1 month ago

Work in company

LLMOps Engineer

Only for registered members

About the Company · A next-generation AI startup with Silicon Valley roots is hiring a LLMOps Engineer in Toronto to design, deploy, and optimize large-scale LLM infrastructure powering AI-native semiconductor design tools. With $33M+ in funding and rapid growth, the company is s ...

Toronto

3 days ago

Work in company Remote job

AI Engineer – Vision + LLM Pipeline

Only for registered members

We are looking for a senior AI/backend engineer to work on a production-grade vision + LLM inference pipeline. · This role is focused on system design and implementation, · not experimentation or prompt-only work.Python (FastAPI or similar backend frameworks) · ,Vision-enabled LL ...

2 weeks ago

Work in company

Staff LLMOps Engineer

Only for registered members

We are building the next generation, enterprise product suite to empower semiconductor design engineers to achieve a 10x productivity boost with proprietary AI/ML models and modern cloud technologies. · Design and implement production-ready LLM deployment pipelines on AWS and Kub ...

Toronto

1 month ago

Work in company Remote job

AI/LLM Engineer for Clinical Text Analysis

Only for registered members

We are looking for an experienced AI / LLM Engineer to support the design and implementation of a secure on-premise Large Language Model (LLM) pipeline for clinical text analysis. ...

1 month ago

Work in company Remote job

AI Engineer – Vision + LLM Pipeline

Only for registered members

We are looking for a senior AI / backend engineer to work on a production-grade vision + LLM inference pipeline. This role is focused on system design and implementation, not experimentation or prompt-only work. · The system combines: · Vision-enabled LLMs · Optional secondary mo ...

13 hours ago

Work in company Remote job

Korean-English Bilingual AI Engineer — Healthcare LLM

Only for registered members

We're building an LLM-powered assistant (agent) that can answer Korean healthcare-domain questions accurately and safely. · ...

$700 - $0 (USD) budget

1 month ago

Work in company Remote job

Senior AI Engineer

Only for registered members

We are building a serious AI product focused on transforming real-world business conversations into structured intelligence insights and automation. · AI pipelines that analyze recorded conversations speech text structured insights · LLM-based systems for summarization classifi ...

4 weeks ago

Work in company Remote job

AI Engineer

Only for registered members

We're looking for an AI engineer who can architect and implement production-grade LLM systems—someone equally comfortable fine-tuning models on custom datasets and orchestrating inference APIs that serve at scale. · This is a hybrid role that sits at the intersection of ML engine ...

$19 - $40 (USD) per hour

1 month ago

Work in company

Research Co-Op intern

Only for registered members

As a Research Intern at Dell Technologies Office of CTO (OCTO), you will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques, · working on developing highly performance systems that push the state-of-a ...

Ottawa $39 - $42 (CAD)

3 weeks ago

Work in company

Senior Software Engineer

Only for registered members

We're hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, · authentication and inference to all models · subscription management and subscription entitlement (e.g. context-length, concurrency limits) · and providing the necessary API sur ...

Toronto

1 month ago

Work in company

Research Co-Op intern

Only for registered members

You will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques working on developing highly performance systems that push the state-of-art. · For this specific project you will use your deep expertise in ...

Ottawa $39.02 - $42.54 (CAD) Internship

3 weeks ago

Work in company Remote job

AI Systems

Only for registered members

We believe efficiency is what makes AI possible - it's how we expand access and ensure innovation benefits the many, not the few. · We're looking for builders and creative thinkers ready to shape the next era of intelligence.The Role · You'll work directly with our founders to de ...

Toronto, Ontario

1 month ago

Work in company Remote job

Senior AI/LLM Engineer Job Description

Only for registered members

We are building an AI platform that blends long context LLMs and tool calling agents into a seamless developer and end user experience. · Design and implement systems that make large models faster, smarter, and more context aware. · Evaluate LoRA adapters for domain specific beha ...

$20 - $45 (USD) per hour

3 weeks ago

Work in company Remote job

Senior ML

Only for registered members

We are building LINA, an AI-powered communication assistant designed to help neurodivergent people interpret tone, intent. · LINA is moving into a local / edge-deployed architecture (enterprise and wearable use cases), with strict latency and privacy requirements.This is not a ch ...

$30 - $60 (USD) per hour

1 month ago

Work in company

AI Architect

Only for registered members

We are seeking an experienced AI Architect to design next-generation enterprise AI systems with a strong focus on Agentic AI. · Define enterprise AI architecture incorporating Agentic RAG. · Establish architectural guardrails for tool schemas. · ...

Toronto

1 week ago

Work in company Remote job

Azure Environment Setup for Open-Source LLM Model

Only for registered members

We are seeking an experienced · Azure specialist to set up an environment · for hosting the open-source LLM model.Cloud infrastructure · Deploying LLM models · ...

1 month ago

Python Systems Engineer for Local Air-Gapped LLM Inference

Job description

Similar jobs

LLM Inference Deployment Engineer

Massive-Scale LLM Inference

Local LLM inference + RAG system

AI Systems

LLMOps Engineer

AI Engineer – Vision + LLM Pipeline

Staff LLMOps Engineer

AI/LLM Engineer for Clinical Text Analysis

AI Engineer – Vision + LLM Pipeline

Korean-English Bilingual AI Engineer — Healthcare LLM

Senior AI Engineer

AI Engineer

Research Co-Op intern

Senior Software Engineer

Research Co-Op intern

AI Systems

Senior AI/LLM Engineer Job Description

Senior ML

AI Architect

Azure Environment Setup for Open-Source LLM Model

Directory

for Recruiters

Information