Jobs
>
Mississauga

    Site Reliability Engineer - Mississauga, Canada - OSL Retail Services

    OSL Retail Services
    OSL Retail Services Mississauga, Canada

    Found in: Talent CA C2 - 6 days ago

    OSL Retail Services background
    Description

    Overview

    It's an exciting time to be at OSL Retail Services, working for a people-focused company that's at the top of its game. The momentum we've generated in recent years with our commitments to client customers, innovation, business results, and an entrepreneurial spirit has created energy, enthusiasm, and engagement among our employees that is pushing us to new heights. And we're on the lookout for talented people who share our vision and values and want to join us in this journey. At OSL, our culture is our foundation. Passionate employees, great customer service and long-term relationships are all built upon that foundation. We value people, passion, honesty, respect, and integrity.

    About the role:

    As the Site Reliability Engineer , you will be instrumental in managing and maintaining the infrastructure for multiple partner products, which includes brands such as Walmart, Samsung Canada, Ted Baker, Brooks Brothers, and Lucky Brand. Your expertise will ensure the highest levels of reliability and performance, supporting our commitment to delivering exceptional service to our clients and customers. This hybrid role will be based out of our Mississauga, Ontario location.

    What you'll do:

  • Manage OSL Omni Channel Production environment: Hybrid cloud environment (. Public/Private) interconnected to several partners and OSL-owned and field equipment supporting multiple brands across the OSL portfolio.
  • Build out production monitoring: design and deploy infrastructure monitoring for all services, including API endpoints and external web applications and services.
  • Create active reporting and alerting: create dashboards to identify trends and bottlenecks, developing alerting/escalation strategies to manage incidents effectively.
  • Implement security controls: ensuring that the environment is secure and adheres to security best practices.
  • Implement operational controls and processes : ensuring the mechanisms, safety controls, fuse breakers, and due diligence are in place to responsibly introduce changes and react to planned or unplanned events.
  • What you've done:

  • 5+ years demonstrated experience in the field.
  • Strong knowledge of networking protocols and services (TCP/IP, DNS, DHCP, VPN, .
  • Experience with cloud platforms such as AWS, Azure, or GCP is a plus.
  • Expert or near-expert skills with Linux, Windows, networking, storage, and virtualization
  • Experience with server provisioning and configuration management, utilizing tools such as Terraform, Ansible or Chef, delivering Infrastructure as Code
  • Experience with database administration of MSSQL and PostgreSQL
  • Observability, monitoring and alerting with tools like Prometheus and Datadog
  • Strong engineering background with experience in automation, configuration management, scripting, and security best practices
  • Familiarity with Infrastructure as Code (IaC) principles, automated builds, monitoring, and scaling
  • Experience with system and application monitoring tools such as Nagios, Graphite, Prometheus, Grafana, ELK, CollectD, StatsD, DataDog
  • Proficiency in Windows and Linux systems administration, including scripting, troubleshooting, and upgrading
  • Significant experience with automation and configuration management in a production environment (Puppet, Chef, Ansible)
  • Ability to take ownership of technical delivery and collaborate effectively with business partners
  • The ability to assume leadership responsibilities for troubleshooting and managing incidents in the environment is crucial. It requires weighing desired outcomes against risk and urgency. This responsibility may also involve working with OSL's wide network of partner service providers.
  • Strong scripting and coding capabilities, sufficient to integrate / instrument OSL's technology stack
  • Strong mentoring and advocacy skills for good design and engineering values
  • Excellent communication and stakeholder management skills, with the ability to convey complex technical concepts to non-technical audiences
  • Experience maintaining a 24x7 SaaS environment covering multiple time zones
  • Operational support experience with after-hours on-call responsibilities
  • ITIL Knowledge (Incident, Change, and Problem Management) and tools
  • Working Conditions:

  • Flexibility to work various schedules, including evenings and weekends as required.
  • What's in it for you:

  • Competitive base salary $80-120K plus bonuses and other perks
  • Vacation plus additional flex days
  • Comprehensive benefits
  • Training and development opportunities to grow your career with one of Canada's Best Managed Companies
  • A supportive workplace culture and work environment

  • Thermo Fisher Scientific

    Reliability Engineer

    Found in: Talent CA C2 - 2 weeks ago


    Thermo Fisher Scientific Mississauga, Canada Full time

    Job Description · As part of the Thermo Fisher Scientific team, you'll discover meaningful work that makes a positive impact on a global scale. Join our colleagues in bringing our Mission to life every single day to enable our customers to make the world healthier, cleaner and sa ...

  • Thermo Fisher Scientific

    Reliability Engineering

    Found in: beBee S2 CA - 2 weeks ago


    Thermo Fisher Scientific Mississauga, Canada TEMPORARY

    Job Description · This Co-Op position is a minimum of 12 months and will run from May 2024 through April 2025 · Summary: · The main focus of this position is to provide support for the Engineering department. · Essential Functions: · Researches, develops and implements processes ...

  • Abbott Laboratories

    Director, Site Reliability Engineering

    Found in: beBee S2 CA - 2 weeks ago


    Abbott Laboratories Mississauga, Canada OTHER

    About Abbott · Abbott is a global healthcare leader, creating breakthrough science to improve people's health. We're always looking towards the future, anticipating changes in medical science and technology. · Working at Abbott · At Abbott, you can do work that matters, grow, a ...

  • TWD

    Reliability Engineer

    Found in: Talent CA C2 - 1 day ago


    TWD Burlington, Canada Full time

    Reliability Engineer · TWD is an engineering, procurement, and construction management consulting company providing project development, execution and specialty engineering services for the oil and gas industry with expertise ranging from refinery, pipelines, terminalling and ble ...

  • Atlantic

    Mechanical Reliability Engineer

    Found in: Talent CA C2 - 5 days ago


    Atlantic Toronto, Canada Full time

    Posting Details · Job Details · Description · Reporting to the Maintenance Manager, the mechanical reliability engineer provides technical support to the production and facilities operations and to initiate, prioritize, and execute engineering and maintenance work to improve s ...

  • eTeam

    Site Reliability Engineer

    Found in: Talent CA C2 - 1 day ago


    eTeam Toronto, Canada

    Remote work · Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. · Job description - ::: · Role Desc : · Defining and measuring reliability goals—SLIs, SLOs, a ...

  • Autodesk

    Site Reliability Engineer

    Found in: Talent CA C2 - 2 days ago


    Autodesk Toronto, Canada Full time

    Position Overview · Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Pl ...

  • IQVIA

    Site Reliability Engineer

    Found in: Talent CA C2 - 1 day ago


    IQVIA Vaughan, Canada Full time

    IQVIA's Digital Enablement Center of Excellence powers exceptional brand experiences, delivering innovative solutions based on a customer-first, insights-driven, and integrated omnichannel vision. We provide authenticated, enhanced data and analytics, innovative fit-for-purpose h ...

  • CGI

    System Reliability Engineer

    Found in: Talent CA C2 - 2 days ago


    CGI Toronto, Canada Full time

    Position Description: · We are Canada's largest independent information technology services firm, and after 40 years, we're still growing Innovation, technology, and service delivery are our focus. Our goal is to ensure our clients remain ahead of the competition. We provide a f ...

  • Royal Bank of Canada

    Senior Site Reliability Engineer

    Found in: beBee S2 CA - 2 weeks ago


    Royal Bank of Canada Brampton, Canada OTHER

    · Job Posting · Job Summary · Job Description · WHAT IS THE OPPORTUNITY? · The Personal and Commercial Banking (P&CB) arm of RBC is in the preparation stages of launching a groundbreaking program to deliver the best customer experiences by equipping our advisors with the very la ...

  • Interop Labs

    Site Reliability Engineer

    Found in: Talent CA C2 - 3 weeks ago


    Interop Labs Toronto, Canada Full time

    Axelar delivers secure cross-chain communication for Web3. As a universal overlay network, Axelar supports general message passing and composability of programs via a proof-of-stake transport layer. Developer tools and APIs make it easy for both protocol and application developer ...

  • Atlantis IT group

    Site Reliability Engineer

    Found in: beBee S2 CA - 3 days ago


    Atlantis IT group Toronto, Canada Full time

    Role: SRE · Location: Toronto, ONDuration: FulltimeSkills and Responsibilities: · Owner of the Production Environment: Has independent veto power on changes. Is business aligned and understands business outcomes. · Experience owning change management, release management and Produ ...

  • BMO

    Director Site Reliability Engineering

    Found in: Talent CA 2 C2 - 6 days ago


    BMO Toronto, Canada

    Application Deadline: · 04/29/2024 · Address: · 33 Dundas Street West · This role is Hybrid (1-2 days per week in the office) · The Director - Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and business partners to cont ...

  • BMO

    Director Site Reliability Engineering

    Found in: Talent CA C2 - 6 days ago


    BMO Toronto, Canada Full time

    Application Deadline: · 04/29/2024 Address: · 33 Dundas Street West Job Family Group: · Technology This role is Hybrid (1-2 days per week in the office) · The Director – Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and ...

  • BenchSci

    Senior Site Reliability Engineer

    Found in: Talent CA C2 - 3 days ago


    BenchSci Toronto, Canada Full time (Remote)

    We are looking for a Senior Site Reliability Engineer to join our growing Core Services group, the Site Reliability Engineering team Reporting to the Senior Engineering Manager, you'll apply your technical and domain expertise to solve complex technical and business challenges; ...

  • 0000050007 Royal Bank of Canada

    Lead Site Reliability Engineer

    Found in: Talent CA C2 - 4 days ago


    0000050007 Royal Bank of Canada Toronto, Canada Full time

    Job Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are you looking ...

  • BMO

    Director Site Reliability Engineering

    Found in: beBee S2 CA - 2 weeks ago


    BMO Toronto, Canada Full time

    Application Deadline: · 04/29/2024Address: · 33 Dundas Street WestJob Family Group: · TechnologyThis role is Hybrid (1-2 days per week in the office) · The Director – Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and bu ...

  • Royal Bank of Canada

    Lead Site Reliability Engineer

    Found in: Talent CA C2 - 3 weeks ago


    Royal Bank of Canada TORONTO, Canada Full time

    Job Summary · Job Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are ...

  • Tucows

    Sr. Database Reliability Engineer

    Found in: beBee S2 CA - 2 weeks ago


    Tucows Toronto, Canada Full time

    Sr Database Reliability Engineer · Wavelo is a SaaS business on a mission to make telecoms a breeze. · We provide flexible software that modernizes how communication service providers (CSPs) do business, helping them drive more value, focus on customer experience, and scale their ...

  • RBC - Royal Bank

    Lead Site Reliability Engineer

    Found in: beBee S2 CA - 2 weeks ago


    RBC - Royal Bank Toronto, Canada Full time

    Job Summary · Job Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are ...