- Manage OSL Omni Channel Production environment: Hybrid cloud environment (. Public/Private) interconnected to several partners and OSL-owned and field equipment supporting multiple brands across the OSL portfolio.
- Build out production monitoring: design and deploy infrastructure monitoring for all services, including API endpoints and external web applications and services.
- Create active reporting and alerting: create dashboards to identify trends and bottlenecks, developing alerting/escalation strategies to manage incidents effectively.
- Implement security controls: ensuring that the environment is secure and adheres to security best practices.
- Implement operational controls and processes : ensuring the mechanisms, safety controls, fuse breakers, and due diligence are in place to responsibly introduce changes and react to planned or unplanned events.
- 5+ years demonstrated experience in the field.
- Strong knowledge of networking protocols and services (TCP/IP, DNS, DHCP, VPN, .
- Experience with cloud platforms such as AWS, Azure, or GCP is a plus.
- Expert or near-expert skills with Linux, Windows, networking, storage, and virtualization
- Experience with server provisioning and configuration management, utilizing tools such as Terraform, Ansible or Chef, delivering Infrastructure as Code
- Experience with database administration of MSSQL and PostgreSQL
- Observability, monitoring and alerting with tools like Prometheus and Datadog
- Strong engineering background with experience in automation, configuration management, scripting, and security best practices
- Familiarity with Infrastructure as Code (IaC) principles, automated builds, monitoring, and scaling
- Experience with system and application monitoring tools such as Nagios, Graphite, Prometheus, Grafana, ELK, CollectD, StatsD, DataDog
- Proficiency in Windows and Linux systems administration, including scripting, troubleshooting, and upgrading
- Significant experience with automation and configuration management in a production environment (Puppet, Chef, Ansible)
- Ability to take ownership of technical delivery and collaborate effectively with business partners
- The ability to assume leadership responsibilities for troubleshooting and managing incidents in the environment is crucial. It requires weighing desired outcomes against risk and urgency. This responsibility may also involve working with OSL's wide network of partner service providers.
- Strong scripting and coding capabilities, sufficient to integrate / instrument OSL's technology stack
- Strong mentoring and advocacy skills for good design and engineering values
- Excellent communication and stakeholder management skills, with the ability to convey complex technical concepts to non-technical audiences
- Experience maintaining a 24x7 SaaS environment covering multiple time zones
- Operational support experience with after-hours on-call responsibilities
- ITIL Knowledge (Incident, Change, and Problem Management) and tools
- Flexibility to work various schedules, including evenings and weekends as required.
- Competitive base salary $80-120K plus bonuses and other perks
- Vacation plus additional flex days
- Comprehensive benefits
- Training and development opportunities to grow your career with one of Canada's Best Managed Companies
- A supportive workplace culture and work environment
-
Reliability Engineer
Found in: Talent CA C2 - 2 weeks ago
Thermo Fisher Scientific Mississauga, Canada Full timeJob Description · As part of the Thermo Fisher Scientific team, you'll discover meaningful work that makes a positive impact on a global scale. Join our colleagues in bringing our Mission to life every single day to enable our customers to make the world healthier, cleaner and sa ...
-
Reliability Engineering
Found in: beBee S2 CA - 2 weeks ago
Thermo Fisher Scientific Mississauga, Canada TEMPORARYJob Description · This Co-Op position is a minimum of 12 months and will run from May 2024 through April 2025 · Summary: · The main focus of this position is to provide support for the Engineering department. · Essential Functions: · Researches, develops and implements processes ...
-
Director, Site Reliability Engineering
Found in: beBee S2 CA - 2 weeks ago
Abbott Laboratories Mississauga, Canada OTHERAbout Abbott · Abbott is a global healthcare leader, creating breakthrough science to improve people's health. We're always looking towards the future, anticipating changes in medical science and technology. · Working at Abbott · At Abbott, you can do work that matters, grow, a ...
-
Reliability Engineer
Found in: Talent CA C2 - 1 day ago
TWD Burlington, Canada Full timeReliability Engineer · TWD is an engineering, procurement, and construction management consulting company providing project development, execution and specialty engineering services for the oil and gas industry with expertise ranging from refinery, pipelines, terminalling and ble ...
-
Mechanical Reliability Engineer
Found in: Talent CA C2 - 5 days ago
Atlantic Toronto, Canada Full timePosting Details · Job Details · Description · Reporting to the Maintenance Manager, the mechanical reliability engineer provides technical support to the production and facilities operations and to initiate, prioritize, and execute engineering and maintenance work to improve s ...
-
Site Reliability Engineer
Found in: Talent CA C2 - 1 day ago
eTeam Toronto, CanadaRemote work · Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. · Job description - ::: · Role Desc : · Defining and measuring reliability goals—SLIs, SLOs, a ...
-
Site Reliability Engineer
Found in: Talent CA C2 - 2 days ago
Autodesk Toronto, Canada Full timePosition Overview · Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Pl ...
-
Site Reliability Engineer
Found in: Talent CA C2 - 1 day ago
IQVIA Vaughan, Canada Full timeIQVIA's Digital Enablement Center of Excellence powers exceptional brand experiences, delivering innovative solutions based on a customer-first, insights-driven, and integrated omnichannel vision. We provide authenticated, enhanced data and analytics, innovative fit-for-purpose h ...
-
System Reliability Engineer
Found in: Talent CA C2 - 2 days ago
CGI Toronto, Canada Full timePosition Description: · We are Canada's largest independent information technology services firm, and after 40 years, we're still growing Innovation, technology, and service delivery are our focus. Our goal is to ensure our clients remain ahead of the competition. We provide a f ...
-
Senior Site Reliability Engineer
Found in: beBee S2 CA - 2 weeks ago
Royal Bank of Canada Brampton, Canada OTHER· Job Posting · Job Summary · Job Description · WHAT IS THE OPPORTUNITY? · The Personal and Commercial Banking (P&CB) arm of RBC is in the preparation stages of launching a groundbreaking program to deliver the best customer experiences by equipping our advisors with the very la ...
-
Site Reliability Engineer
Found in: Talent CA C2 - 3 weeks ago
Interop Labs Toronto, Canada Full timeAxelar delivers secure cross-chain communication for Web3. As a universal overlay network, Axelar supports general message passing and composability of programs via a proof-of-stake transport layer. Developer tools and APIs make it easy for both protocol and application developer ...
-
Site Reliability Engineer
Found in: beBee S2 CA - 3 days ago
Atlantis IT group Toronto, Canada Full timeRole: SRE · Location: Toronto, ONDuration: FulltimeSkills and Responsibilities: · Owner of the Production Environment: Has independent veto power on changes. Is business aligned and understands business outcomes. · Experience owning change management, release management and Produ ...
-
Director Site Reliability Engineering
Found in: Talent CA 2 C2 - 6 days ago
BMO Toronto, CanadaApplication Deadline: · 04/29/2024 · Address: · 33 Dundas Street West · This role is Hybrid (1-2 days per week in the office) · The Director - Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and business partners to cont ...
-
Director Site Reliability Engineering
Found in: Talent CA C2 - 6 days ago
BMO Toronto, Canada Full timeApplication Deadline: · 04/29/2024 Address: · 33 Dundas Street West Job Family Group: · Technology This role is Hybrid (1-2 days per week in the office) · The Director – Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and ...
-
Senior Site Reliability Engineer
Found in: Talent CA C2 - 3 days ago
BenchSci Toronto, Canada Full time (Remote)We are looking for a Senior Site Reliability Engineer to join our growing Core Services group, the Site Reliability Engineering team Reporting to the Senior Engineering Manager, you'll apply your technical and domain expertise to solve complex technical and business challenges; ...
-
Lead Site Reliability Engineer
Found in: Talent CA C2 - 4 days ago
0000050007 Royal Bank of Canada Toronto, Canada Full timeJob Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are you looking ...
-
Director Site Reliability Engineering
Found in: beBee S2 CA - 2 weeks ago
BMO Toronto, Canada Full timeApplication Deadline: · 04/29/2024Address: · 33 Dundas Street WestJob Family Group: · TechnologyThis role is Hybrid (1-2 days per week in the office) · The Director – Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and bu ...
-
Lead Site Reliability Engineer
Found in: Talent CA C2 - 3 weeks ago
Royal Bank of Canada TORONTO, Canada Full timeJob Summary · Job Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are ...
-
Sr. Database Reliability Engineer
Found in: beBee S2 CA - 2 weeks ago
Tucows Toronto, Canada Full timeSr Database Reliability Engineer · Wavelo is a SaaS business on a mission to make telecoms a breeze. · We provide flexible software that modernizes how communication service providers (CSPs) do business, helping them drive more value, focus on customer experience, and scale their ...
-
Lead Site Reliability Engineer
Found in: beBee S2 CA - 2 weeks ago
RBC - Royal Bank Toronto, Canada Full timeJob Summary · Job Description · What Is The Opportunity? · We are looking to expand our Digital team at RBC. If you are looking for an exciting, high-growth opportunity with a leading financial institution that is accelerating cloud native area this could be the job for you. Are ...
Site Reliability Engineer - Mississauga, Canada - OSL Retail Services
Description
Overview
It's an exciting time to be at OSL Retail Services, working for a people-focused company that's at the top of its game. The momentum we've generated in recent years with our commitments to client customers, innovation, business results, and an entrepreneurial spirit has created energy, enthusiasm, and engagement among our employees that is pushing us to new heights. And we're on the lookout for talented people who share our vision and values and want to join us in this journey. At OSL, our culture is our foundation. Passionate employees, great customer service and long-term relationships are all built upon that foundation. We value people, passion, honesty, respect, and integrity.
About the role:
As the Site Reliability Engineer , you will be instrumental in managing and maintaining the infrastructure for multiple partner products, which includes brands such as Walmart, Samsung Canada, Ted Baker, Brooks Brothers, and Lucky Brand. Your expertise will ensure the highest levels of reliability and performance, supporting our commitment to delivering exceptional service to our clients and customers. This hybrid role will be based out of our Mississauga, Ontario location.
What you'll do:
What you've done:
Working Conditions:
What's in it for you: