Staff Site Reliability Engineer - Vancouver, British Columbia
10 hours ago

Job summary
Walt Disney Animation Studios' world-class filmmakers artists and technical collaborators create the magic of animation Bring your unique talents passion and ideas to our team and prepare to play in a creative artist-friendly environment We are seeking a Staff SRE with expertise in systems administration skills in Linux platforms and also has experience with software development e.g Python Go Java Node CI Pipeline tools e.g Jenkins Git source management cloud hosting AWS GCP & Azure container computing e.g Docker OCI web technologies The ideal candidate will enjoy the diversity challenges of working at various levels in foundational deployment stack defining configuration management developing CI CD infrastructure processes This role resides within Platform Infrastructure team Walt Disney Animation Studios WDAS we build tools manage infrastructure artists use daily create celebrated animated content SRE team within Platform Engineering optimizing service deployments improving availability latency performance efficiency observability systems WDAS All projects common pursuit simple performant solutions complex problems Agile DevOps methodologies high-energy proficient teams Critical success this role aptitude for working collaboratively technical team translate ideas tangible products shape experiences systematic approach automation resiliency efficiency stability security performance capacity management documentation serve subject matter expert multiple areas looked fellow team members go-to individual clear understanding elaborate SRE principles best practices given audience maintain uphold improve relevant reliability aspects services increased focus SLIs SLOs raise reliability large scale user-facing internal services work engineering creative production teams collaborative high-energy environment brainstorm architect gather requirements troubleshoot provide stellar customer support passionate constantly learning applying technology solve complex problems highly motivated optimistic proactive creative thought leader project manager Additional Responsibilities Include Support wide range on-premises cloud deployments using infrastructure-as-code self-healing security automation patterns facilitate others use Infrastructure Code paradigm deploy manage array on-premises cloud deployments develop useful telemetry alerts response reduce Mean Time To Repair MTTR collaborate provide technical excellence within across teams consult best practices develop tools smooth adoptions good service reliability practices methods identify areas improvement reliability efficiency operations build tools help your SRE quickly pinpoint isolate resolve issues related infrastructure platform services applications refine monitoring processes configurations thresholds practice promote sustainable incident response blameless postmortems develop runbooks streamlines problem resolution time write code improves scalability maintainability security add tune alert configurations documentation needed improve CI CD processes release cadence success Use Chaos Engineering principles test what you build real-world conditions mentor SRE Sysadmins Systems Engineers technical non-technical responsibilities Required Education BS Computer Science Computer Engineering Electrical Engineering related field Key Qualifications years experience SRE devops technical operations systems engineering software engineering discipline Proficient collaborative experienced building reliable scalable enterprise systems Excellent communication skills verbal written Passionate curious leveraging technology continually learning efficiently skilled containers container orchestration enterprise production environments Docker Kubernetes Rancher AWS ECS EKS Experience configuration management infrastructure as code Terraform Helm Cloud Formation Ansible Puppet Hands-on experience source control GitHub feature branching strategies Experience continuous integration tools Jenkins Gitlab CI CD AWS CodeBuild CodeDeploy Spinnaker Knowledge best practices IT operations always-up always-available service Possess expertise scalable testing automation frameworks SDLC distributed systems networking hardware logistics capacity planning UNIX Linux administration troubleshooting performance tuning security Bonus Qualifications Expertise web server administration
Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Product Reliability Engineer
4 weeks ago
Requisito para ingeniero de pruebas de hardware con experiencia en análisis de datos e identificación de tendencias. · Recibir errores escalados. · Analizar fallos en campo. · ...
Reliability and Integrity Engineer
1 week ago
The Reliability & Integrity (R&I) Engineer is an integral part of Woodfibre LNG's dynamic team. · During the construction phase of the project, work in collaboration with Project and Operations staff to ensure that the findings and recommendations from the Reliability, Availabili ...
Site Reliability Engineer
1 week ago
We are looking for a Site Reliability Engineer to join our Managed Cloud Optimization (MCO) team. Our SREs combine Google Cloud Platform expertise with a passion for devops methodologies to help clients maintain, optimize, and scale their cloud implementations. · ...
Site Reliability Engineer
1 month ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools and automation for managing distributed systems in production environments. · Our software ensures that Apple's services are reliable, scalable and se ...
Product Reliability Engineer
4 weeks ago
The Product Reliability Engineer position at Motorola Solutions involves monitoring product performance by analyzing failures identifying root cause tracking reliability trends preventing recurrence providing feedback to design manufacturing quality teams improving product qualit ...
Site Reliability Engineer
2 weeks ago
We're looking for thoughtful builders who want their work to matter. · ...
Site Reliability Engineer
4 weeks ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments. · BS/MS in Computer Science or Equivalent · At least 2-5 years in a Relia ...
Database Reliability Engineer
1 month ago
About this role: As a database reliability engineer (DBRE) at Cover Genius you will ensure seamless secure functioning of datastores which support our platform helping design solutions large scale engineering challenges such as multi regional scaling data sovereignty requirements ...
Site Reliability Engineer
6 days ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools and automation for managing distributed systems in production environments. Our software ensures that Apples services are reliable scalable and secure ...
Site Reliability Engineer
1 month ago
Vancity is modernizing its technology foundation and scaling its cloud first strategy across digital banking core banking data platforms and member facing services. · ...
Site Reliability Engineer
4 weeks ago
The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes tools and automation for managing distributed systems in production environments. · Our software ensures that Apples services are reliable scalable secure we ...
Site Reliability Engineer
1 week ago
This is an end-to-end AI transformation partner that guides enterprises from complex business challenges to clear, quantifiable outcomes. Our company is the culmination of several successful firms each a leader in its own right in cloud artificial intelligence and data. This conv ...
Site Reliability Engineer
1 month ago
We're seeking a Site Reliability Engineer to deliver fast seamless experiences across web iOS and Android · We are looking for an experienced Site Reliability Engineer to join our team in Vancouver. · A strong understanding of performance metrics page load responsiveness app star ...
Database Reliability Engineer
1 month ago
About the company:Cover Genius is an insurtech that protects customers of digital companies. · Create datastore architectures and technologies. Implement automation for datastore management. · ...
Site Reliability Engineer
1 month ago
Vancity is modernizing its technology foundation and scaling its cloud first strategy across digital banking core banking data platforms member facing services. We are looking for a highly skilled Site Reliability Engineer who will help build operate continuously improve the reli ...
Database Reliability Engineer
2 weeks ago
As a Database Reliability Engineer (DBRE) on our Technology Team, your primary purpose will be to ensure the seamless and secure functioning of the datastores which support our platform. · To drive success in this role, you will have a strong background in site reliability engine ...
Site Reliability Engineer
1 month ago
Vancity is modernizing its technology foundation and scaling its cloud first strategy across digital banking core banking data platforms and member facing services. We are looking for a highly skilled Site Reliability Engineer (SRE) who will help build operate and continuously im ...
Service Reliability Engineer
3 weeks ago
We are looking for an experienced Service Reliability Engineer who can provide input into all areas of service development, ensuring that millions of gamers enjoy a trouble-free experience. · ...
Site Reliability Engineer
1 month ago
As a Frontend Performance Engineer you will ensure fast smooth experiences across web iOS Android by defining performance budgets monitoring key metrics. · ...
Senior Site Reliability Engineer
1 week ago
We're seeking a senior Site Reliability Engineer/DevOps who is passionate about building the best infrastructure and maintaining the health of the systems. · ...
Senior Site Reliability Engineer
1 week ago
Arista Networks is looking for a Senior Site Reliability Engineer to join their CloudVision-as-a-Service global SRE team. · ...