Python Systems Engineer for Local Air-Gapped LLM Inference

Only for registered members Canada

3 weeks ago

Default job background
$30 - $55 (USD) per hour
Job Description · I'm building a purely local, offline AI system that runs on a dedicated workstation (NVIDIA RTX 4090).The system is designed to work without any external APIs or cloud dependencies and focuses on deterministic, structured outputs, not chat-style interactions. · ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    LLM Inference Deployment Engineer

    Only for registered members

    · EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class ...

    U.S., Canada, Germany, Norway $100,000 - $150,000 (USD) per year

    4 days ago

  • Work in company Remote job

    Massive-Scale LLM Inference

    Only for registered members

    Engineer massive-scale inference provider to undercut current market price. Serve 600,000 concurrent users on decentralized fleet of GPUs. Optimize throughput to achieve cost basis below $0.02 per million tokens. · Create highly optimized container for GPU hardware detection. · C ...

    $500 - $0 (USD) budget

    1 month ago

  • Work in company Remote job

    Local LLM inference + RAG system

    Only for registered members

    I'm looking to hire a consultant to help build a local LLM inference + RAG system that can ingest and reason over large document sets — primarily aircraft sales contracts and theological texts. · ...

    1 month ago

  • Work in company

    AI Systems

    Only for registered members

    We think that AI systems should be flexible, personalized and accessible to everyone. · Design and build our LLM inference stack from zero to one. · Develop and optimize inference using modern frameworks. · Collaborate closely with founders and model developers to analyze bottlen ...

    Toronto

    1 month ago

  • Work in company

    LLMOps Engineer

    Only for registered members

    About the Company · A next-generation AI startup with Silicon Valley roots is hiring a LLMOps Engineer in Toronto to design, deploy, and optimize large-scale LLM infrastructure powering AI-native semiconductor design tools. With $33M+ in funding and rapid growth, the company is s ...

    Toronto

    3 days ago

  • Work in company Remote job

    AI Engineer – Vision + LLM Pipeline

    Only for registered members

    We are looking for a senior AI/backend engineer to work on a production-grade vision + LLM inference pipeline. · This role is focused on system design and implementation, · not experimentation or prompt-only work.Python (FastAPI or similar backend frameworks) · ,Vision-enabled LL ...

    2 weeks ago

  • Work in company

    Staff LLMOps Engineer

    Only for registered members

    We are building the next generation, enterprise product suite to empower semiconductor design engineers to achieve a 10x productivity boost with proprietary AI/ML models and modern cloud technologies. · Design and implement production-ready LLM deployment pipelines on AWS and Kub ...

    Toronto

    1 month ago

  • Work in company Remote job

    AI/LLM Engineer for Clinical Text Analysis

    Only for registered members

    We are looking for an experienced AI / LLM Engineer to support the design and implementation of a secure on-premise Large Language Model (LLM) pipeline for clinical text analysis. ...

    1 month ago

  • Work in company Remote job

    AI Engineer – Vision + LLM Pipeline

    Only for registered members

    We are looking for a senior AI / backend engineer to work on a production-grade vision + LLM inference pipeline. This role is focused on system design and implementation, not experimentation or prompt-only work. · The system combines: · Vision-enabled LLMs · Optional secondary mo ...

    13 hours ago

  • Work in company Remote job

    Korean-English Bilingual AI Engineer — Healthcare LLM

    Only for registered members

    We're building an LLM-powered assistant (agent) that can answer Korean healthcare-domain questions accurately and safely. · ...

    $700 - $0 (USD) budget

    1 month ago

  • Work in company Remote job

    Senior AI Engineer

    Only for registered members

    We are building a serious AI product focused on transforming real-world business conversations into structured intelligence insights and automation. · AI pipelines that analyze recorded conversations speech text structured insights · LLM-based systems for summarization classifi ...

    4 weeks ago

  • Work in company Remote job

    AI Engineer

    Only for registered members

    We're looking for an AI engineer who can architect and implement production-grade LLM systems—someone equally comfortable fine-tuning models on custom datasets and orchestrating inference APIs that serve at scale. · This is a hybrid role that sits at the intersection of ML engine ...

    $19 - $40 (USD) per hour

    1 month ago

  • Work in company

    Research Co-Op intern

    Only for registered members

    As a Research Intern at Dell Technologies Office of CTO (OCTO), you will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques, · working on developing highly performance systems that push the state-of-a ...

    Ottawa $39 - $42 (CAD)

    3 weeks ago

  • Work in company

    Senior Software Engineer

    Only for registered members

    We're hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, · authentication and inference to all models · subscription management and subscription entitlement (e.g. context-length, concurrency limits) · and providing the necessary API sur ...

    Toronto

    1 month ago

  • Work in company

    Research Co-Op intern

    Only for registered members

    You will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques working on developing highly performance systems that push the state-of-art. · For this specific project you will use your deep expertise in ...

    Ottawa $39.02 - $42.54 (CAD) Internship

    3 weeks ago

  • Work in company Remote job

    AI Systems

    Only for registered members

    We believe efficiency is what makes AI possible - it's how we expand access and ensure innovation benefits the many, not the few. · We're looking for builders and creative thinkers ready to shape the next era of intelligence.The Role · You'll work directly with our founders to de ...

    Toronto, Ontario

    1 month ago

  • Work in company Remote job

    Senior AI/LLM Engineer Job Description

    Only for registered members

    We are building an AI platform that blends long context LLMs and tool calling agents into a seamless developer and end user experience. · Design and implement systems that make large models faster, smarter, and more context aware. · Evaluate LoRA adapters for domain specific beha ...

    $20 - $45 (USD) per hour

    3 weeks ago

  • Work in company Remote job

    Senior ML

    Only for registered members

    We are building LINA, an AI-powered communication assistant designed to help neurodivergent people interpret tone, intent. · LINA is moving into a local / edge-deployed architecture (enterprise and wearable use cases), with strict latency and privacy requirements.This is not a ch ...

    $30 - $60 (USD) per hour

    1 month ago

  • Work in company

    AI Architect

    Only for registered members

    We are seeking an experienced AI Architect to design next-generation enterprise AI systems with a strong focus on Agentic AI. · Define enterprise AI architecture incorporating Agentic RAG. · Establish architectural guardrails for tool schemas. · ...

    Toronto

    1 week ago

  • Work in company Remote job

    Azure Environment Setup for Open-Source LLM Model

    Only for registered members

    We are seeking an experienced · Azure specialist to set up an environment · for hosting the open-source LLM model.Cloud infrastructure · Deploying LLM models · ...

    1 month ago