Product Owner - Operational Resilience
DescriptionOwn and evolve a Proactive Resilience product/capability that anticipates, prevents, and mitigates technology and service disruption. You''ll translate resilience outcomes (availability, recoverability, performance, operational readiness) into a clear product roadmap, measurable value, and repeatable adoption across platforms and teams.Key responsibilities include:Product strategy andamp; roadmap- Define product vision, target users and a prioritised roadmap aligned to business services.- Maintain a clear backlog of resilience features Outcome-driven delivery- Set OKRs/KPIs for proactive resilience.- Maintain a Community of Practice to surface potential resilience improvements, maintained and prioritised via a backlogResilience-by-design- Embed resilience enhancements into SDLC and change processes (non-functional requirements, release readiness, operational acceptance).- Champion practices such as chaos engineering, game days, fault injection, capacity and performance testing, and DR readiness.Observability andamp; insights- Partner with monitoring/observability teams to improve telemetry, alert quality, and actionable dashboards.- Use data to identify systemic risks, recurring failure modes, and top offenders across services.Automation andamp; operational excellence- Prioritise automation for detection, triage, and remediation.Stakeholder management- Align engineering, operations, architecture, risk, and business stakeholders on resilience priorities.- ..... full job details .....
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!