Site Reliability Engineer (BB-0A4B2)

Found in: Talent CA

Job Description


Site Reliability Engineering (SRE) team members are responsible for keeping all production systems running smoothly. The SRE team is a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments. We specialize in systems covering operating systems, storage, networking, Linux, VMware, Windows, databases, monitoring, and performance. SRE team members are expected to be competent in many areas, and are expected to become subject matter experts in select areas.


  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide operational support and engineering for multiple large distributed software applications
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation



  • Bachelor’s degree in Engineering, Computer Science or equivalent experience.
  • Ability to program with one or more high level languages, such as Python, Go, Java, C/C++
  • A proactive approach to spotting problems, areas of improvement, and bottlenecks
  • Ability to adapt to working with a wide array of technologies and languages
  • Excellent verbal communication skills and ability to communicate technical subjects to a broad range of stakeholders
  • Knowledge of systems architecture, requirements development, integration, systems design, performance tuning, technology qualification, and site-reliability engineering

Strong knowledge of several of the following:

  • VMware Enterprise, Windows Server, Linux
  • Experience with networking, switches, routers, firewall configuration, and troubleshooting
  • Experience with distributed storage technologies like NFS, SAN, etc
  • AWS networking and security concepts


  • Previous success in technical engineering
  • Coding experience beyond simple scripts
  • Database administration: PostgreSQL or MS SQL

Additional Information

All applicants meeting minimum qualifications will be required to complete a 30 minutes online assessment as part of your candidate application

calendar_today11 hours ago


location_on Montreal, Canada

work Intelerad

I expressly authorise the Terms and Conditions

Similar jobs