Site Reliability Engineering - Montreal, Canada - Cisco

Cisco Montreal, Canada

2 weeks ago

Description

Who We Are

As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi" - multi-layer, multi-domain, and multi-vendor networks. Accedian's open and scalable platform removes roadblocks to innovation, enabling cloud-native analytics and empowering customers to launch new assured services based on 5G, SD-WAN, and edge technologies.

Who You Are

You are an expert in deployment and network operations, skilled in using scripts and automation tools to enhance software processes. With a passion for scripting and automation, you contribute to effective software strategies, oversee maintenance, and optimize systems. Proficient with Kubernetes and Docker Swarm, you seek new ways to monitor deployment health and performance. Your proactive nature and dedication to tech excellence make you a valuable team member in operational efficiency and reliability.

Who You'll Work With

Our team prioritizes your growth in technical, business, and soft skills within a culture that values team strength and investment. We adopt a "You build it, you run it" approach, empowering team members to actively manage and improve our software. Committed to continuous learning, we support mastering new technologies and champion a culture of ambition and innovation in cloud computing.

What You'll Do

Our growing team is looking for dedicated Service Reliability Engineering professional (SRE) to work with a small, innovative team of industry experts to help perfect our platform by improving our automation processes around deployment and operations.

You will take charge of enhancing the product life cycle, manage configuration, assist in deployment and scripting for management purposes, and collaborate within a cross-functional team. Your responsibility will be to spearhead the initiatives and orchestrate the DevOps cycle. Your responsibilities will include:

Monitoring our cloud and Customer On-Premise infrastructure: Assessing its health to offer 24/7 service to our customers.

Detecting potential issues : Configure monitoring to intercept them before an outage occurs.

Participating in system troubleshooting: and recommend improvements to our platform and tools, regular and systematic code testing, and deployment.

Supporting our public cloud deployments : Research, propose and participate in the implementation of security best practices for public cloud deployments and data management.

Prioritizing and escalating: Raising problems to Development, collaborating with our Operations lead and on-call engineer to investigate operational issues impacting users and identify root causes.

Driving automation development: Build configuration management tools and scripts to address operational incidents.

Improving our Security posture: Enforce policies for environment security and their application to our DevOps tools.

This role includes periodic participation in an on-call rotation approximately once every six weeks.

Minimum Qualifications:

12 years of related experience as a Software Engineer, DevOps Engineer, Site Reliability Engineer or a role in a related field.

Experience administering Cloud or Virtualized environments using UNIX/LINUX command line and scripting.

IT support experience focused on handling and troubleshooting system-wide solutions.

Demonstrated experience deploying multi-service applications on cloud platforms such as AWS, Google Cloud, or Azure using a modern toolset.

Experience in developing continuous monitoring and automated alerting systems to ensure the stability and reliability of IT systems.

Preferred Qualifications:

Experience with configuration management tools such as Ansible, Salt, Puppet, Chef, or similar.

Bachelors in a STEM related discipline.

A deep understanding of Docker containerization and orchestration, with Kubernetes experience.

Knowledge of IP networking, VPNs, DNS, load balancing, and firewall management.

Familiarity with infrastructure management solutions; experience with HashiCorp Terraform and HashiCorp Vault is.

Experience in setting up and maintaining continuous integration and deployment pipelines.

Ability to write and speak French.

Emploi: Dé logiciel – Site Reliability Engineering

1 week ago

Stingray Montreal, Canada

Dé logiciel SRE - Département IT · Lieu: Montréal · Chez Stingray, la créativité, la collaboration et la technologie innovante sont les piliers de notre ADN. Es-tu prêt.e à rocker ta carrière en rejoignant une entreprise en pleine croissance, une équipe de passionnés.es de musi ...
Emploi: Dé logiciel – Site Reliability Engineering

1 week ago

Stingray Montreal, Canada

Dé logiciel SRE - Département IT Lieu: Montréal Chez Stingray, la créativité, la collaboration et la technologie innovante sont les piliers de notre ADN. Es-tu prêt.e à rocker ta carrière en rejoignant une entreprise en pleine croissance, une équipe de passionnés.es de musique da ...
Emploi: Dé logiciel – Site Reliability Engineering

2 weeks ago

Stingray Montreal, Canada

Département IT Lieu Montréal Chez Stingray, la créativité, la collaboration et la technologie innovante sont les piliers de notre ADN. Es-tu prêt.e à rocker ta carrière en rejoignant une entreprise en pleine croissance, une équipe de passionnés.es de musique dans un environnemen ...
Site Reliability Engineering

2 days ago

Cisco Montreal, Canada

```html · Who We Are · As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the m ...
Site Reliability Engineer

1 week ago

LanceSoft, Inc. Montreal, Canada

Job Description: · We are growing our team globally. It's a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by de ...
Site Reliability Engineering

2 days ago

Cisco Systems, Inc. Montreal, Canada

Site Reliability Engineering - Technical Leader Location: Alternate Location Area of Interest Compensation Range CAD CAD Job Type Professional Cloud and Data Center, Software Development Job Id Who We Are As a part of Cisco, Accedian is a leader in per ...
Site Reliability Engineer

5 days ago

LanceSoft, Inc. Montréal, QC, Canada

Job Description:We are growing our team globally. It's a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by develop ...
Site Reliability Engineer

5 days ago

Cisco Montréal, QC, Canada

As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi" - multi-l ...
Site Reliability Engineer

7 hours ago

Lyft Montreal, Canada

At Lyft, our mission is to improve people's lives with the world's best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering ...
Site Reliability Engineering

2 weeks ago

Cisco Montreal, Canada

Who We Are · As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive m ...
Site Reliability Engineer

5 days ago

Cisco Systems, Inc. Montréal, QC, Canada Full time

Cloud and Data Center, Software Development · As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end ...
Site Reliability Engineer

2 days ago

SAP Montreal, Canada Regular Full time

We help the world run better · Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, an ...
Site Reliability Engineer

1 day ago

Lyft Montreal, Canada

At Lyft, our mission is to improve people's lives with the world's best transportation. To create the best transportation experience for all, we start in our own community by creating an open, inclusive, and diverse organization where all team members are recognized for what they ...
Site Reliability Engineer

1 week ago

TMX Montreal, Canada

Venture outside the ordinary - TMX Careers The TMX group of companies includes leading global exchanges such as the Toronto Stock Exchange, Montreal Exchange, and numerous innovative organizations enhancing capital markets. United as a global team, we're connecting cross-function ...
Site Reliability Engineer

1 week ago

SAP Montreal, Canada

About Us We help the world run better. Our company culture is focused on helping our employees enable innovation by building breakthroughs together. Every day, we focus on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, ...
Site Reliability Engineer

2 days ago

SAP Montreal, Canada

We help the world run better · Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and ...
Site Reliability Engineering

1 week ago

Cisco Montreal, Canada

Who We Are As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi ...
Site Reliability Engineer

1 week ago

LanceSoft, Inc. Montreal, Canada

Job Title: Production Reliability & Support Expert (SRE) Location : Montreal ( Office attendance from Day 1 – Hybrid mode 3x per week) Years of experience : 3 to 5 years · • Ensure Production Management is closely aligned/embedded in the Agile software development process and ou ...
Site Reliability Engineer

1 week ago

LanceSoft, Inc. Montreal, Canada

Job Description: We are growing our team globally. It's a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by develo ...
Site Reliability Engineer

1 week ago

LanceSoft, Inc. Montreal, Canada

Responsibilities include: · • SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms. · • A commitment to understanding ITSM's range of products with a view to ...

Site Reliability Engineering - Montreal, Canada - Cisco

Description

Emploi: Dé logiciel – Site Reliability Engineering

Emploi: Dé logiciel – Site Reliability Engineering

Emploi: Dé logiciel – Site Reliability Engineering

Site Reliability Engineering

Site Reliability Engineer

Site Reliability Engineering

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineering

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineering

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

Alireza Ahmadian

Dany Awad

Nafsika Spyropoulou

Gunaretnam Sivashanker

ATHUL KUMAR

Amir Alimoradi

for Recruiters

Information

Site Reliability Engineering - Montreal, Canada - Cisco

Description

Site Reliability Engineering professionals in Montréal