Senior Site Reliability Engineering Program & Compliance Manager
About the role
Senior Site Reliability Engineering (SRE) Program & Compliance Manager
We are seeking an experienced and strategic Senior Site Reliability Engineering (SRE) Program & Compliance Manager to lead business architecture, process governance, and operational maturity initiatives supporting network and infrastructure services. This role is responsible for defining, analyzing, and optimizing the business processes, policies, controls, and lifecycle frameworks that govern infrastructure services across Plan, Build, Run, and Release functions.
The ideal candidate will bring deep expertise in infrastructure lifecycle management, SRE policy and procedures, process design, audit and compliance readiness, and documentation governance, with the ability to independently lead current-state (as-is) to future-state transformation initiatives. This role partners across engineering, operations, security, and business stakeholders to improve service reliability, strengthen governance, and drive continual service improvement.
Key Responsibilities
Business Architecture & Process Design
- Lead business process mapping, analysis, and redesign for infrastructure and reliability services, documenting as-is, identifying gaps, and designing future-state operating models.
- Define and mature business architecture principles, process standards, policies, procedures, and governance controls supporting infrastructure lifecycle management.
- Facilitate cross-functional workshops to drive consensus on process improvements, operating models, roles, and decision frameworks.
- Develop process documentation, control narratives, RACI models, workflows, and supporting artifacts to improve consistency and operational effectiveness.
Infrastructure Lifecycle Management (Plan, Build, Run, Release)
- Architect and govern business processes supporting the full lifecycle of network and infrastructure services.
- Establish lifecycle controls, process checkpoints, and release governance aligned to reliability, risk, and business objectives.
- Partner with engineering and operational teams to improve service onboarding, operational readiness, and release management processes.
- Support integration of lifecycle processes across infrastructure, service management, and reliability functions.
SRE Policy, Procedures & Continual Service Improvement
- Develop and mature policies and procedures supporting Site Reliability Engineering (SRE) practices, including service reliability governance, operational controls, and procedural standards.
- Support adoption of SRE principles, including service level indicators (SLIs), service level objectives (SLOs), error budgets, and operational performance measurements.
- Drive continual service improvement through process optimization, root cause trend analysis, and control effectiveness reviews.
- Partner with technical teams to align business processes with reliability engineering and operational excellence practices.
Audit, Risk & Compliance
- Ensure processes and controls align with internal governance requirements and external frameworks, including ISO 9001, IS