- Contribute code to increase the scalability and reliability of the service
- Contribute software tests and participate in peer review to increase the quality of our codebase
- Help and develop peers' capabilities through knowledge sharing, mentoring, and collaboration
- Participate in a regular on-call schedule, including occasional paid weekends and holidays
- Practice sustainable incident response and blameless postmortems
- Resolve customer issues escalated from the Red Hat Global Support team
- Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
- 2+ years of experience programming with at least one object-oriented language; Golang or Python are preferred
- 2+ years of experience delivering a hosted service
- Demonstrated ability to quickly and accurately troubleshoot system issues
- 2+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure
- 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
- 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef
- 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred
- 2+ years of experience delivering a hosted service
- Demonstrated ability to quickly and accurately troubleshoot system issues
- Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
- Solid communications skills and experience working directly with and presenting to customers
- 1+ year(s) of experience with Kubernetes is a plus
- 1+ year(s) of experience with docker-based containers is a plus #LI-LS2 Pay Transparency
- Comprehensive medical, dental, and vision coverage
- Flexible Spending Account - healthcare and dependent care
- Health Savings Account - high deductible medical plan
- Retirement 401(k) with employer match
- Paid time off and holidays
- Paid parental leave plans for all new parents
- Leave benefits including disability, paid family medical leave, and paid military leave
- Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more
-
Site Reliability Engineer
1 week ago
Stafflink Vancouver, BC, CanadaJob Description · Position: Site Reliability Engineer · Duration: 12 Months · Location: Principally remote, with at least one day per month in office for applicants in the lower mainland. Local candidates are given preference. · Work hours: Monday – Friday, 9:00 am – 5:00 ...
-
Site Reliability Engineer
1 week ago
T-Net British Columbia Vancouver, BC, CanadaSite Reliability Engineer Co-op (Sept May 2025) Job Overview · Our innovative technology transforms the way that organisations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting new chapter in our growth story, w ...
-
Site Reliability Engineer
5 days ago
Dapper Labs Vancouver, Canada Full timeWe're looking for a Site Reliability Engineer who wants to be at the technical core of an organization that's completely reshaping how distributed applications on blockchains can reach massive audiences. · You will join a Site Reliability Engineering team that has the ability t ...
-
Site Reliability Engineer
1 week ago
Axiom Zen Vancouver, CanadaWe're looking for a Site Reliability Engineer who wants to be at the technical core of an organization that's completely reshaping how distributed applications on blockchains can reach massive audiences. · You will join a Site Reliability Engineering team that has the ability to ...
-
Site Reliability Engineer
1 week ago
Visier, Inc Vancouver, BC, CanadaOur innovative technology transforms the way that organizations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting new chapter in our growth story, we are looking for talented individuals who can help both Visier ...
-
Site Reliability Engineer
1 week ago
Visier Inc. Vancouver, BC, CanadaOur co-op experience is unique and designed to prepare you for professional success as you work on real, impactful work from the beginning. Our ultimate goal is to give you the mentorship, training, and work experience you need to start your career. A number of our students retur ...
-
Site Reliability Engineer
6 days ago
Visier, Inc Vancouver, BC, CanadaVisier Co-op Opportunity · Our innovative technology transforms the way that organisations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting new chapter in our growth story, we are looking for talented individua ...
-
Senior Site Reliability Engineer
1 week ago
Red Hat British Columbia, CanadaAbout the job · Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat's enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling cus ...
-
Site Reliability Engineer Vancouver
6 hours ago
Taurus SA Vancouver, BC, CanadaAre you ready to take on an entrepreneurial challenge in the digital asset industry? Taurus, a global leader in digital asset infrastructure, has an exciting opportunity for you. · Founded in April 2018, Taurus provides enterprise-grade solutions to issue, custody, and trade dig ...
-
Senior Site Reliability Engineer
1 week ago
RAZR Marketing, Inc. Vancouver, BC, CanadaYou will be required to be in our office In Vancouver, BC three times per week. · These values have made RAZR what it is for years, and today, they are more important than ever. You can't wait to get out of bed in the morning & get on with your day · We are seeking a skilled an ...
-
Senior Site Reliability Engineer
2 weeks ago
Razr Marketing Vancouver, BC, CanadaSenior Site Reliability Engineer · These values have made RAZR what it is for years, and today, they are more important than ever. You can't wait to get out of bed in the morning & get on with your day · We are seeking a skilled and motivated Site Reliability Engineer (SRE) to ...
-
Senior Site Reliability Engineer
1 week ago
Sentry Vancouver, BC, CanadaAbout the role · The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1 ...
-
Site Reliability Performance Engineer
1 week ago
Stafflink Vancouver, BC, CanadaPosition: Site Reliability Engineer · Location: Principally remote, with at least one day per month in office for applicants in the lower mainland. Local candidates are given preference. · Monday - Friday, 9:00 am - 5:00 pm PST · Serve as the subject matter expert (SME) for Dynat ...
-
Site Reliability Performance Engineer
4 days ago
Dapper Labs Vancouver, BC, CanadaWe're looking for a Site Reliability Engineer who wants to be at the technical core of an organization that's completely reshaping how distributed applications on blockchains can reach massive audiences. · You will join a Site Reliability Engineering team that has the ability to ...
-
Site Reliability Engineer Vancouver
3 days ago
Taurus SA Vancouver, Canada CDIAre you ready to take on an entrepreneurial challenge in the digital asset industry? Taurus, a global leader in digital asset infrastructure, has an exciting opportunity for you. · Founded in April 2018, Taurus provides enterprise-grade solutions to issue, custody, and trade dig ...
-
Senior Site Reliability Engineer
5 days ago
TEEMA Vancouver, Canada Full timeMUST LIVE IN CANADA NEAR AN AIRPORT · Looking for a technical lead with 10+ years of DevOps/SRE experience · MUST HAVE - 5+ years permanent residence or Citizenship (cant have lived out of Canada for the last 5 years) · MUST LIVE IN CANADA NEAR AN AIRPORT · Looking for a technica ...
-
Senior Site Reliability Engineer
1 week ago
Red Hat, Inc. British Columbia, CanadaAbout the job · Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. . OpenShift is a cloud native application platform for the enterprise, powered by Kubernetes. As an SRE you will contribute to runnin ...
-
Site Reliability Engineer III
1 week ago
Electronic Arts Vancouver, CanadaEA's Digital Platform (EADP) organization drives important technology decisions and investments for EA on a global basis, across all divisions and studio teams. Technology and engineering leadership at EA is essential to making the industry's best games and services and the EADP ...
-
Site Reliability Engineer II
5 days ago
Electronic Arts Vancouver, Canada RegularResponsibilities · : You will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and business metrics. · You will help design and develop robust, supportable tools to automate the deployment and management of distrib ...
-
Site Reliability Engineer
2 weeks ago
New Value Solutions Richmond, CanadaNew Value Solutions, a national IT consulting company, is seeking a Site Reliability Engineer for our client. · Responsibilities: · Serve as the subject matter expert (SME) for Dynatrace, responsible for configuring, optimizing, and managing Dynatrace monitoring solutions. · Des ...
Senior Site Reliability Engineer - British Columbia, Canada - Red Hat, Inc.
Description
About the job
Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat's enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation.
On the SRE team, you will have the opportunity to influence the complex challenges of scale which are unique to Red Hat managed cloud services, while using your skills in coding, operations, and large-scale distributed system design.
Red Hat relies on teamwork and openness for its success. We are a global team and strive to cultivate a transparent environment that makes room for different voices. We learn from our failures in a blameless environment to support the continuous improvement of the team. At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth.
What you will do
The day-to-day responsibilities of an SRE involve working with live systems and coding automation. As an SRE you will be expected to:
What you will bring
A bachelor's degree in Computer Science or a related technical field involving software or systems engineering is required. However, hands-on experience that demonstrates your ability and interest in Site Reliability Engineering are valuable to us, and may be considered in lieu of degree requirements. You must have some experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language. You must have experience working with public clouds such as AWS, GCP, or Azure. You must also have the ability to collaboratively troubleshoot and solve problems in a team setting.
As an SRE you will be most successful if you have some experience troubleshooting an as-a-service offering (SaaS, PaaS, and some experience working with complex distributed systems. Direct experience with Kubernetes or OpenShift is a plus. We like to see a demonstrated ability to debug, optimize code and automate routine tasks. We are Red Hat, so you need a basic understanding of Unix/Linux operating systems.
Desired skills:
Red Hat determines compensation based on several factors including but not limited to job location, experience, applicable skills and training, external market value, and internal pay equity. Annual salary is one component of Red Hat's compensation package. This position may also be eligible for bonus, commission, and/or equity. For positions with Remote-US locations, the actual salary range for the position may differ based on location but will be commensurate with job duties and relevant work experience.
About Red Hat
is the world's leading provider of enterprise software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates have the flexibility to choose the work environment that suits their needs from in-office to fully remote to office-flex. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact. Opportunities are open. Join us.
Benefits
Note : These benefits are only applicable to full time, permanent associates at Red Hat located in the United States.
Diversity, Equity & Inclusion at Red Hat
Red Hat's culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from diverse backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions of diversity that compose our global village.
Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.