Site Reliability Engineer (SQCE)

Vor Ort Lead vor 11 Tagen
Site Reliability Engineer Cloud & Infrastructure DevOps Site Reliability & SRE

Role Description

Site Reliability of our Development, Test & Prod environments hosted in Azure

  • Driving operational excellence for Payments Cloud services to deliver an "always on" operation, year-round, at the right cost

  • Rollout of Infrastructure, Operating System and Application updates with no impact to consumers

  • Experience with implementing end to end monitoring & alerting

  • Implementing and Delivering robust Infrastructure as code

  • Managing desired state configuration of Java Applications hosted on Cloud

  • Leading Root Cause Analysis through Blameless Post Mortems of Incidents and Failure Mode Analysis

  • Should prepare Run Books, Training Material and conducted sessions

  • Converts OPS issues into Stories to fix root cause

Key Responsibilities

Own value stream and application issue resolution to completion