Jobs
>
Toronto

    System Reliability Engineer - Toronto, Canada - CGI

    CGI
    CGI Toronto, Canada

    Found in: Talent CA C2 - 5 days ago

    CGI background
    Full time
    Description

    Position Description:

    We are Canada's largest independent information technology services firm, and after 40 years, we're still growing Innovation, technology, and service delivery are our focus. Our goal is to ensure our clients remain ahead of the competition. We provide a full spectrum of managed services from IT and business process outsourcing to systems integration and consulting that are transforming our clients' operations and helping them to succeed.

    Do you enjoy working with a highly motivated and talented team to deliver mission critical developer tooling? We are currently expanding our System Reliability Engineering team that helps one of our key clients deploy, manage, troubleshoot, and enhance their developer tooling platform, servicing over developers.

    As a System Reliability Engineer, you will be responsible for designing, implementing, and supporting a verity of developer productivity tools that include Ansible Tower, GitLab, Artifactory and SonarQube. The technology stack used to manage the platform includes Ansible, Terraform, Python, Prometheus, Splunk, and ELK.

    You will build automation solutions to provision and validate infrastructure and help debug and resolve problems. You will help to improve operational performance by focusing on user experience, effectively assessing and managing risk, and minimizing the impact of failures.

    Responsibilities

    •Keeping all components of the developer productivity platform up and running

    •Working closely with internal partners and platform users to ensure that all services meet security, SLA, and performance requirements

    •Writing, updating, and using documentation, including runbooks and playbooks

    •Automating infrastructure deployment, testing, application failover, failure mitigation, user self-service functions, and more

    •Debugging complex problems across the entire stack

    •Participating in various meetings with the Operations and Delivery teams.

    •Lead Daily/Weekly Meetings to discuss the overall health of the systems.

    •Leading Root Cause Analysis calls

    •Propose and implement Monitoring Improvements/Optimization and Automation Opportunities

    •Take part in PI (Program Increment) Planning sessions

    Key Skills and Attributes

    •5 years experience with software engineering, software development, or system operations

    •Experience working with Linux and can write shell scripts and understands Linux internals and performance tuning

    •Strong understanding of networking principles

    •Experience debugging large scale complex systems in production

    •Experience in building, implementing, and supporting highly available production systems

    •Experience automating infrastructure and deployments using Terraform, Ansible, and Python or equivalent technologies

    •Understanding of DevOps engineering, CI/CD, and software deployment

    •Working knowledge of developer tooling such as Artifactory, GitLab, SonarQube, and Ansible Tower

    •Experience with various monitoring and observability tools

    •Experience deploying and managing workloads on one of the major public cloud platforms, private clouds such as OpenStack

    •Experience deploying and managing workloads on one of the major container management platforms like Kubernetes, OpenShift, PCF or Rancher

    •A curiosity about how complex socio-technical systems operate and what happens during failure

    It's not expected that any single candidate would have experience across all these areas – we are looking for someone who is strong in a few areas and has interest and curiosity in others.

    #LI-SH1

    Skills:

  • DevOps Engineering
  • GitHub
  • OpenShift
  • Linux

  • Manulife Insurance Malaysia

    Senior Site Reliability Engineer

    Found in: Jooble CA O C2 - 1 day ago


    Manulife Insurance Malaysia Toronto, ON, Canada

    Senior Site Reliability Engineer page is loaded · Senior Site Reliability Engineer · Postuler locations Waterloo, Ontario Toronto, siège social mondial (200 Bloor) time type Temps plein posted on Publié hier job requisition id JR Nous sommes un fournisseur de services financie ...

  • Tata Consultancy Services

    Reliability Engineer

    Found in: Appcast CA C2 A - 2 days ago


    Tata Consultancy Services Toronto, Canada

    About TCS: · TCS operates on a global scale, with a diverse talent base of more than 600,000 associates representing 153 nationalities across 55 countries. TCS has been recognized as a Global Top Employer by the Top Employers Institute - one of only eight companies worldwide to h ...

  • CSG Talent

    Reliability Engineer

    Found in: beBee S2 CA - 2 days ago


    CSG Talent Ontario, Canada Full time

    Join a Leading Mining Company in Canada as a Reliability Engineer. This is the best opportunity to grow your career in the maintenance department with a large mining company with its global assets. · This is residential role and it comes with very attractive salary and a great re ...

  • Cedent Consulting Inc

    Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Cedent Consulting Inc Toronto, ON, Canada

    Site Reliability Engineer (Mississauga, ON; Toronto, ON) · Title : Site Reliability Engineer · Terms of Hire: Full Time. · Salary: $ Open / yr + Benefits. · Job Description: · seeking a highly qualified Site Reliability Engineers and Architects with experience developing and ...

  • Akamai

    Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Akamai Toronto, ON, Canada

    Site Reliability Engineer II · Do you have a passion for cutting edge technologies and tackling system problems? Are you a self-starting professional who thrives in a dynamic environment? Join our Site Reliability team. Our Team builds and delivers highly secure network security ...

  • Tata Consultancy Services

    Reliability Engineer Job

    Found in: Jooble CA O L C2 - 2 days ago


    Tata Consultancy Services Toronto, ON, Canada

    TCS has been recognized as a Global Top Employer by the Top Employers Institute - one of only eight companies worldwide to have achieved this status. Our organizational structure is domain-led and designed to offer businesses a single window into industry-specific solutions. Our ...

  • Cedent Consulting Inc

    site reliability engineer

    Found in: Jooble CA O L C2 - 2 days ago


    Cedent Consulting Inc Toronto, ON, Canada Full time

    Site Reliability Engineer (Mississauga, ON; Title : Site Reliability Engineer · Terms of Hire: Full Time. · seeking a highly qualified Site Reliability Engineers and Architects with experience developing and building high-performing, scalable, enterprise applications. Our engin ...

  • CB Canada

    Site Reliability Engineer

    Found in: Talent CA 2 C2 - 2 days ago


    CB Canada Toronto, Canada

    Site Reliability Engineer · On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. · Site Reliability Engineer – Job Description · Azure cloud · Jira and confluence · CICD · Experience with automating (provisioning, configuration m ...

  • Umicore Belgium

    Reliability Engineer Job

    Found in: Jooble CA O L C2 - 1 day ago


    Umicore Belgium Ontario, Canada

    Maintenance · Powering the cars of the future. We are the leading circular materials technology company fulfilling its mission to create materials for a better life. Umicore is preparing to build a battery materials production plant in Loyalist Township, Ontario - the first of i ...

  • Matillion

    Site Reliability Engineer

    Found in: Jooble CA O L C2 - 2 days ago


    Matillion Toronto, ON, Canada

    Matillion is The Data Productivity Cloud · We are on a mission to power the data productivity of our customers and the world, by helping teams get data business ready, faster. Our technology allows customers to load, transform, sync, and orchestrate their data. · We are looking ...

  • eTeam

    Site Reliability Engineer

    Found in: Talent CA C2 - 4 days ago


    eTeam Toronto, Canada

    Remote work · Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. · Job description - ::: · Role Desc : · Defining and measuring reliability goals—SLIs, SLOs, a ...

  • Autodesk

    Site Reliability Engineer

    Found in: Talent CA C2 - 4 days ago


    Autodesk Toronto, Canada Full time

    Position Overview · Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Pl ...

  • Autodesk

    Site Reliability Engineer

    Found in: Talent CA C2 - 2 days ago


    Autodesk Toronto, Canada Full time

    Position Overview · We have an exciting new opportunity for a Site Reliability Engineer within the Autodesk Fusion 360 Data and Process Management team. The successful candidate will be first responder, performance analyst, system architect, capacity planner, and monitoring expe ...

  • KBC Technologies Group

    Site Reliability Engineer"

    Found in: Jooble CA O C2 - 2 days ago


    KBC Technologies Group Toronto, ON, Canada

    KBC Technologies Job Description · KBC Technologies is a 'ISO Certified' Global IT Solutions, Services and Outsourcing Company with it's major focus in IT Secondment and Managed Services domain to support our clients in expanding their business operations on global basis. KBC ha ...

  • KBC Technologies Group

    Site Reliability Engineer"

    Found in: Jooble CA O L C2 - 15 hours ago


    KBC Technologies Group Toronto, ON, Canada

    KBC Technologies is an 'ISO Certified' Global IT Solutions, Services, and Outsourcing Company with its major focus in IT Secondment and Managed Services domain to support our clients in expanding their business operations on a global basis. KBC has its registered offices in UAE, ...

  • Interop Labs

    Site Reliability Engineer

    Found in: Talent CA C2 - 3 weeks ago


    Interop Labs Toronto, Canada Full time

    Axelar delivers secure cross-chain communication for Web3. As a universal overlay network, Axelar supports general message passing and composability of programs via a proof-of-stake transport layer. Developer tools and APIs make it easy for both protocol and application developer ...

  • Atlantis IT group

    Site Reliability Engineer

    Found in: beBee S2 CA - 5 days ago


    Atlantis IT group Toronto, Canada Full time

    Role: SRE · Location: Toronto, ONDuration: FulltimeSkills and Responsibilities: · Owner of the Production Environment: Has independent veto power on changes. Is business aligned and understands business outcomes. · Experience owning change management, release management and Produ ...

  • Autodesk, Inc.

    Site Reliability Performance Engineer

    Found in: Jooble CA O C2 - 1 day ago


    Autodesk, Inc. Ontario, Canada

    Site Reliability Engineer page is loaded Site Reliability Engineer · Apply locations Toronto, ON, CAN time type Full time posted on Posted Yesterday job requisition id 24WD77326 Job Requisition ID # · 24WD77326 Position Overview · Autodesk, the leading Design and Make Software ...

  • Akamai

    Senior Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Akamai Toronto, ON, Canada

    Join our Origin Service SRE Team · Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges? Our team is responsible for monitoring and measuring the reliability of our suite of origin service produ ...

  • Matillion

    Site Reliability Performance Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Matillion Toronto, ON, Canada

    Matillion is The Data Productivity Cloud. · We are on a mission to power the data productivity of our customers and the world, by helping teams get data business ready, faster. Our technology allows customers to load, transform, sync and orchestrate their data. We are looking fo ...