Skip to main content
Team members enjoying time outside
Join Our Team

Head of Production Management Resiliency - Director

Job Req Id:
25865714
Location(s):
London, United Kingdom
Job Type:
Hybrid
Posted:
Jul. 16, 2025

Discover your future at Citi

Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.

Job Overview

Citi is a world-leading global bank. We have approximately 200 million customer accounts and a presence in more than 160 countries and jurisdictions worldwide. We provide consumers, corporations, governments, and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and investment banking, securities brokerage, transaction services, and wealth management. We enable clients to achieve their strategic financial objectives by providing them with cutting-edge ideas, best-in-class products and solutions, and unparalleled access to capital and liquidity.

The successful candidate will ensure the resilience of critical applications by adhering to enhanced testing and recovery standards, and proactively identifying and mitigating vulnerabilities. This role is crucial for maintaining the stability and operational resilience of Citi's critical business services, ensuring they remain within defined impact tolerances and minimizing client impact duration.

Responsibilities:

Implement Enhanced Testing and Recovery:

  • Oversee the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days.

  • Implement and oversee Data Recovery testing, ensuring applications can recover critical data from backup solutions within the defined Impact Tolerance (ITOL).

  • Drive the onboarding of critical applications to the One-Touch Recovery orchestration solution.

  • Minimize the Recovery Time Actual (TRTA) for critical applications.

Design and Architecture:

  • Champion resilient application design by advocating for and integrating resiliency principles into architectures, and promoting the use of established resiliency patterns.

  • Leverage cloud-native services and features to enhance application resiliency. This includes services for auto-scaling, load balancing, and disaster recovery.

  • Explore and implement chaos engineering practices to proactively identify and address system weaknesses under stress.

Proactive Vulnerability Management:

  • Proactively identify vulnerabilities through regular architecture reviews, comprehensive scenario testing, and foundational testing.

  • Document and demonstrate mitigation efforts for all discovered vulnerabilities. This includes developing remediation plans, implementing necessary changes, and validating the effectiveness of mitigations.

  • Ensure that all identified vulnerabilities have remediation plans scheduled.

Operational Resilience Adherence:

  • Ensure that all critical applications adhere to operational resilience testing and recovery requirements.

  • Collaborate with relevant stakeholders to define and maintain appropriate impact tolerances for critical business services.

Performance Measurement and Reporting:

  • Monitor and report on key resilience metrics, including the number of applications executing production swing tests, the number of applications on One-ouch Recovery, recovery times and adherence to operational resilience requirements.

  • Provide regular updates to senior management on the status of resilience initiatives and key performance indicators.

Key Qualifications:

  • Relevant professional software engineering experience - and in particular in SRE roles

  • Expertise analyzing complex application, database, network, and OS issues across a distributed large scale customer facing systems

  • Strong communication skills and ability to work effectively across multiple business and technical team

  • Experience in Java, .NET, Maven, Gradle, Jenkins, Helm, Puppet, Chef, Ansible, Kubernetes, AWS, Splunk, Prometheus

  • BS degree in computer science or equivalent field

What we’ll provide you:

By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (which is annually reviewed), and enjoy a whole host of additional benefits such as:

  • 27 days annual leave (plus bank holidays)

  • A discretional annual performance related bonus

  • Private Medical Care & Life Insurance

  • Employee Assistance Program

  • Pension Plan

  • Paid Parental Leave

  • Special discounts for employees, family, and friends

  • Access to an array of learning and development resources

Alongside these benefits Citi is committed to ensuring our workplace is where everyone feels comfortable coming to work as their whole self, every day. We want the best talent around the world to be energized to join us, motivated to stay and empowered to thrive.

#LI-BH1

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Technology Product Management

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

A man walks his dog, enjoying a well-earned break from work.

Global Benefits

Discover the top benefits offered to our global workforce, designed to support your well-being, growth and work-life balance. Explore a few of the highlights that make working with us rewarding.

Learn More

A woman enjoying work-life balance with her family

Explore More Jobs

  • Group of young professionals in an office setting

    Early Careers Talent Network

    Sign up to receive personalized job matches based on your skills and interests. We'll help you discover opportunities that align with your goals.

    Discover More

  • Four coworkers walking down stairs and talking

    Career Professionals Talent Network

    Sign up to receive tailored job matches based on your skills and experience. Discover opportunities that align with your ambitions.

    Discover More