Linux System Administrator - Burlington, Canada - BitFirms Inc

BitFirms Inc
BitFirms Inc
Verified Company
Burlington, Canada

3 weeks ago

Sophia Lee

Posted by:

Sophia Lee

beBee Recruiter


Description
'''Overview:


We are an innovative startup specializing in high-performance computing solutions for a diverse range of tasks, I've personally been doing this for 8 years with the company beginning last year.

We're colocated in a tier 3 data center in Mississauga with 6 racks of hardware consisting of hundreds of NVIDIA GPUs of all flavours, we decide where to deliver our computational hardware to best optimize utilization & return.

We are seeking a highly skilled HPC Systems Administrator to manage and optimize our GPU-intensive computing environments, ensuring maximum performance, reliability, uptime and efficiency.


Key Responsibilities:

-
GPU Management & Optimization: Oversee the software operation of hundreds of NVIDIA GPUs, ensuring they are optimized for high-performance tasks. Monitor performance, manage workloads, and troubleshoot any issues to maintain peak efficiency.
-
Linux Systems Administration: Administer a large infrastructure of Ubuntu and Debian servers dedicated to HPC. Perform installations, configurations, updates, and maintenance tasks to ensure the stability and security of the computing environment.
-
Automation & Scripting: Develop and implement scripts and automation tools to streamline operations. Utilize Python, Bash, or other scripting languages to automate deployment, monitoring, and management tasks.
-
Research & Development: Stay abreast of the latest developments in HPC and GPU technologies. Test and evaluate new tools, software, and methodologies to enhance our computing capabilities.
-
Project Management: Lead and manage projects aimed at expanding and enhancing our HPC resources.


Requirements:

-
Expertise in NVIDIA GPU Computing: Deep understanding of NVIDIA GPU architectures and experience managing GPU-accelerated computing environments.
-
Proficiency in Linux (Ubuntu/Debian): Extensive experience with Linux system administration, specifically in Ubuntu or Debian environments. Familiarity with Linux networking, ports and security.
-
Scripting & Automation Skills: Strong scripting skills in Python, Bash, or similar, with a focus on automation and systems management.
-
Problem-Solving & Analytical Skills: Excellent analytical abilities and a problem-solving mindset, capable of addressing complex technical challenges in an HPC context.
-
Communication & Teamwork: Strong communication skills and the ability to work collaboratively within a small team.
-
Education & Experience: Self trained or a degree in Computer Science, Engineering, or a related field, ideally with several years of experience in managing or using gpu systems for any high-performance compute tasks (including mining).


Benefits:


  • Opportunity to work with cuttingedge HPC technologies and make a significant impact in various industries.
  • Flexible working hours, ability to work remote in the future and a commitment to worklife balance.
  • A cool startup environment utilizing the latest cutting edge hardware that encourages creativity and professional growth.

Join Our Team:


If you are passionate about HPC, possess a deep understanding of NVIDIA GPUs, and are skilled in managing Linux-based computing environments, we invite you to apply.

Join us at the forefront of computational innovation and play a key role in driving the success of our high-performance computing services.


Job Types:
Full-time, Freelance


Salary:
$48,594.89-$86,992.96 per year


Benefits:


  • Dental care
  • Extended health care
  • Paid time off
  • Work from home

Flexible Language Requirement:

  • French not required

Schedule:

  • Monday to Friday

Supplemental pay types:

  • Overtime pay

Experience:

- system administration: 2 years (preferred)

Work Location:
Hybrid remote in Burlington, ON L7L 6X6

More jobs from BitFirms Inc