Python Systems Engineer for Local Air-Gapped LLM Inference
3 weeks ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
· EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class ...
4 days ago
Engineer massive-scale inference provider to undercut current market price. Serve 600,000 concurrent users on decentralized fleet of GPUs. Optimize throughput to achieve cost basis below $0.02 per million tokens. · Create highly optimized container for GPU hardware detection. · C ...
1 month ago
I'm looking to hire a consultant to help build a local LLM inference + RAG system that can ingest and reason over large document sets — primarily aircraft sales contracts and theological texts. · ...
1 month ago
We think that AI systems should be flexible, personalized and accessible to everyone. · Design and build our LLM inference stack from zero to one. · Develop and optimize inference using modern frameworks. · Collaborate closely with founders and model developers to analyze bottlen ...
1 month ago
About the Company · A next-generation AI startup with Silicon Valley roots is hiring a LLMOps Engineer in Toronto to design, deploy, and optimize large-scale LLM infrastructure powering AI-native semiconductor design tools. With $33M+ in funding and rapid growth, the company is s ...
3 days ago
We are looking for a senior AI/backend engineer to work on a production-grade vision + LLM inference pipeline. · This role is focused on system design and implementation, · not experimentation or prompt-only work.Python (FastAPI or similar backend frameworks) · ,Vision-enabled LL ...
2 weeks ago
We are building the next generation, enterprise product suite to empower semiconductor design engineers to achieve a 10x productivity boost with proprietary AI/ML models and modern cloud technologies. · Design and implement production-ready LLM deployment pipelines on AWS and Kub ...
1 month ago
We are looking for an experienced AI / LLM Engineer to support the design and implementation of a secure on-premise Large Language Model (LLM) pipeline for clinical text analysis. ...
1 month ago
We are looking for a senior AI / backend engineer to work on a production-grade vision + LLM inference pipeline. This role is focused on system design and implementation, not experimentation or prompt-only work. · The system combines: · Vision-enabled LLMs · Optional secondary mo ...
13 hours ago
Korean-English Bilingual AI Engineer — Healthcare LLM
Only for registered members
We're building an LLM-powered assistant (agent) that can answer Korean healthcare-domain questions accurately and safely. · ...
1 month ago
We are building a serious AI product focused on transforming real-world business conversations into structured intelligence insights and automation. · AI pipelines that analyze recorded conversations speech text structured insights · LLM-based systems for summarization classifi ...
4 weeks ago
We're looking for an AI engineer who can architect and implement production-grade LLM systems—someone equally comfortable fine-tuning models on custom datasets and orchestrating inference APIs that serve at scale. · This is a hybrid role that sits at the intersection of ML engine ...
1 month ago
As a Research Intern at Dell Technologies Office of CTO (OCTO), you will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques, · working on developing highly performance systems that push the state-of-a ...
3 weeks ago
We're hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, · authentication and inference to all models · subscription management and subscription entitlement (e.g. context-length, concurrency limits) · and providing the necessary API sur ...
1 month ago
You will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques working on developing highly performance systems that push the state-of-art. · For this specific project you will use your deep expertise in ...
3 weeks ago
We believe efficiency is what makes AI possible - it's how we expand access and ensure innovation benefits the many, not the few. · We're looking for builders and creative thinkers ready to shape the next era of intelligence.The Role · You'll work directly with our founders to de ...
1 month ago
We are building an AI platform that blends long context LLMs and tool calling agents into a seamless developer and end user experience. · Design and implement systems that make large models faster, smarter, and more context aware. · Evaluate LoRA adapters for domain specific beha ...
3 weeks ago
We are building LINA, an AI-powered communication assistant designed to help neurodivergent people interpret tone, intent. · LINA is moving into a local / edge-deployed architecture (enterprise and wearable use cases), with strict latency and privacy requirements.This is not a ch ...
1 month ago
We are seeking an experienced AI Architect to design next-generation enterprise AI systems with a strong focus on Agentic AI. · Define enterprise AI architecture incorporating Agentic RAG. · Establish architectural guardrails for tool schemas. · ...
1 week ago
Azure Environment Setup for Open-Source LLM Model
Only for registered members
We are seeking an experienced · Azure specialist to set up an environment · for hosting the open-source LLM model.Cloud infrastructure · Deploying LLM models · ...
1 month ago