- Ensure Production Management is closely aligned/embedded in the Agile software development process and our code meets production standards
- Incorporate System Reliability Engineering and DevOps implementations into the day-to-day role by developing automated solutions to long standing problems to ensuring minimal downtime and manual effort
- Configuring application monitors using industry standard monitoring tools, as well as developing customized monitoring solutions
- Build extensive business and application knowledge required for supporting client facing applications
- Revisit SRE Metrics and confirm against the firm and department goals
- Implement tooling / create automations to help with Toil Elimination (manual or repetitive work)
- Engage early in SDLC with our Development teams to have an active role in creating a resilient and reliable solution
- Prioritize project work based on critical incidents and key business stakeholders
- Interface with clients and other technology teams to provide governance and control around the production environment.
- Bachelor's degree in Computer Science or related field
- Experience with Service Oriented Architecture, Distributed Systems, Business Intelligence Reporting such as Power BI, Scripting such as Python or shell, Front end development (HTML, Java Script, AngularJS), Cloud Computing such as MS AZURE and SaaS integrations
- Clear understanding of Logging, Monitoring, and Knowledge Management practices such as Docs as Code
- Ability to manage an incident call and coordinate multiple teams towards a common goal of resolving a business impactful outage, once trained
- Strong knowledge of DevOps and SRE Principles with grasp over tools / approach to apply them
- Strong infrastructure knowledge in Linux / Unix admin, Storage, Networking and Web Technologies
- Advanced Unix Shell / Python scripting experience
- Advanced SQL query language knowledge such as Sybase, DB2, MongoDB and Snowflake preferred.
-
Tecsys Inc. Montreal, Canada Full timeLa version française suit ci-dessous · Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we inve ...
-
Junior Site Reliability Engineer
1 day ago
Plexia Montreal, Canada Full timedu poste · A titre de Junior Site Reliability Engineer (SRE), vous jouerez un rôle crucial au sein du département R&D et Innovation. Vous serez invité à collaborer avec l'équipe chargée de développement des logiciels et de l'architecture core de Plexia. Le caractère très sensible ...
-
Site Reliability Performance Engineer
1 week ago
Synechron Montréal, QC, CanadaNous sommes Synechron est un cabinet de conseil leader mondial en transformation numérique, axé sur les services financiers et les organisations technologiques. Nos spécialités incluent l'intelligence artificielle de bout en bout, le conseil, le numérique, le cloud & DevOps, les ...
-
Site Reliability Engineer
23 hours ago
LanceSoft, Inc. Montreal, CanadaJob Title: Production Reliability & Support Expert (SRE) · Location : Montreal ( Office attendance from Day 1 – Hybrid mode 3x per week) · Years of experience : 3 to 5 years · • Ensure Production Management is closely aligned/embedded in the Agile software development process and ...
-
Site Reliability Engineer
3 days ago
LanceSoft, Inc. Montreal, CanadaJob Title: Production Reliability & Support Expert (SRE)Location : Montreal ( Office attendance from Day 1 – Hybrid mode 3x per week)Years of experience : 3 to 5 years · • Ensure Production Management is closely aligned/embedded in the Agile software development process and our ...
-
Site Reliability Engineering
6 days ago
Cisco Montreal, CanadaWho We Are · As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive m ...
-
Site Reliability Engineering
4 days ago
Cisco Montreal, CanadaWho We Are · As a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive m ...
-
Principal Site Reliability Engineer
6 days ago
Lightspeed Montréal, QC, CanadaWe're looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supportin ...
-
Principal Site Reliability Engineer
3 days ago
Lightspeed Montreal, Canada Full timeHi there Thanks for stopping by · Are you actively looking for a new opportunity? Or just checking the market? Well... you might just be in the right place · We're looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER ...
-
Junior Site Reliability Engineer
1 day ago
Plexia Montreal, Canada Full timeJob Description · As a Junior Site Reliability Engineer (SRE) you will play a crucial role within the R&D and Innovation department. You will be called upon to collaborate with the Plexia product-aligned and core architecture team. The highly sensitive nature of health and medica ...
-
Windows Site Reliability Engineer,
6 days ago
Hunter Bond Montréal, QC, CanadaJob Title: Application Support Engineer Client: Fintech · My client are looking to expand their Application Support team, and would like someone with prior front office experience to provide technical support and engineering functions in support of their proprietary and third pa ...
-
Site Reliability Engineer 3
3 days ago
Behavox Montreal, CanadaAbout the Role · The Behavox Platform is a scalable, fault-tolerant and highly performant storage and processing system which allows us to manage and analyze massive volumes of data. We have an extensive and flexible set of APIs to develop products that allow our clients to work ...
-
Site Reliability Engineer 3
4 days ago
Behavox Montreal, CanadaAbout Behavox · Behavox is shaping the future for how businesses harness their most important raw material - data. Our mission is bold: Organize enterprise data into actionable information that protects and promotes the business growth of multinational companies around the world. ...
-
Windows Site Reliability Engineer,
1 week ago
Hunter Bond Montréal, QC, CanadaJob Title: Application Support EngineerClient: FintechSalary: Circa $125,000 + Bonuses & PackageLocation: Montreal/HybridMy client are looking to expand their Application Support team, and would like someone with prior front office experience to provide technical support and engi ...
-
Site Reliability Performance Engineer
2 weeks ago
Soho Square Solutions Montréal, QC, CanadaBachelor's degree in Computer Science or related field · • Experience with Service Oriented Architecture, Distributed Systems, Business Intelligence Reporting such as PowerBI, Scripting such as Python or shell, Front end development (HTML, Java Script, AngularJS), Cloud Computing ...
-
Site Reliability Engineering Developer SRE
1 week ago
National Bank Montreal, Canada PermanentAttendance Hybrid Job Number 19678 Category Senior Professional Status: Permanent Type of Contract Permanent Schedule: Full-Time Full Time / Part Time? Full-Time Posting date 19-Mar-2024 Location: Montreal, Quebec City Montreal Province/State Quebec Area of Interest: Information ...
-
CGI Montreal, Canada Full timePosition Description: · CGI is a dynamic and innovative technology firm committed to delivering cutting-edge solutions. We are currently seeking a highly skilled and motivated individual to join our team as a FinOps and Site Reliability Engineer (SRE). This role is pivotal in br ...
-
Site Reliability Engineering Developer SRE
4 days ago
NBC Montreal, Canada Full timeArea of Interest: Information technology A career in technology at National Bank means being part of the transformation to have a direct impact on the client. As a Systems Reliability Specialist, you will be expected to help all IT teams put in place the necessary mechanisms ...
-
Site Reliability Engineering Developer SRE
2 days ago
National Bank Montreal, Canada OTHER· Job Posting · Attendance Hybrid Job Number 19678 · Category: Senior Professional · Status: Permanent · Type of Contract: Permanent · Schedule: Full-Time · Full Time / Part Time? Full-Time · Posting date: 19-Mar-2024 · Location: Montreal, Quebec City Montreal · Province/State: ...
-
Stingray Montreal, CanadaDepartment IT Location Montreal At Stingray, creativity, collaboration, and cutting-edge technology are the pillars of our DNA. Are you ready to watch your career take off by joining a fast-growing company with a team of tech-savvy music lovers and a stimulating and fun work env ...
Site Reliability Engineer - Montreal, Canada - LanceSoft, Inc.
Description
Job Description:
We are growing our team globally. It's a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by developing a deep understanding of how our application code is running, configured, and scaled. This allows us to effectively resolve open incidents in the shortest amount of time, develop monitors to detect future occurrences and implement automation technologies to enable the environment to self-heal. Our team manages all entitlements/accesses in Production in a scope of more than 35 systems and user distributed globally around the world with accesses span from Trading to payment to vendor apps. Role and
Responsibilities:
Qualifications:
You should apply on this requisition if you have, at minimum, the following profile: