Citi Logo

Citi

SRE Observability Technical Lead - Vice President

Posted 3 Hours Ago
Be an Early Applicant
In-Office
London, England, GBR
Expert/Leader
In-Office
London, England, GBR
Expert/Leader
Lead observability strategy and delivery for Services Technology, building end-to-end payment monitoring, telemetry, SLOs, dashboards, and integrations across on-prem and cloud. Partner with SREs, developers, and central platform teams to reduce toil, improve MTTD/MTTR, embed telemetry, and drive adoption of observability tooling and AI/ML insights for critical payment flows.
The summary above was generated by AI

Engineer the future of global finance. At Citi, our Tech team doesn’t just support finance – we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech.

The SRE Observability Specialist is a hands-on expert, delivering the future of Observability across Services Technology. This role is a part of a central SRE enablement team within Services Production, working closely with SREs, developers, and platform teams to embed telemetry, implement SLOs, and build meaningful visualizations for key production flows — particularly in critical Payments Business.

The ideal candidate will have deep technical knowledge, a collaborative mindset, and the ability to translate strategy into scalable engineering outcomes. You will also act as a bridge between Services Technology teams and central infrastructure/CTO teams, prioritizing observability needs from line-of-business teams and driving improvements. A strong understanding of observability tooling, evolving AI/ML capabilities, and enterprise tooling ecosystems will be essential.

This role requires providing technological Support solution for Function called Project Orion which provides End-to-End payment monitoring like Building an End-to-End payments Dashboard, Toil Reduction, Transformation of legacy monitoring into observability based monitoring solution, requires good understanding of different Payments Taxonomy (ACH, Wires, Instant Payments, etc.). Strong commercial awareness, technical credibility, and excellent communication skills are essential to negotiate internally, influence peers, and drive change. Some external communication may be necessary.

Key Responsibilities:

  • Define the roadmap for Engineering enablers for Project Orion team aligned with enterprise reliability and SRE Services organization goals.

  • Translate Organization strategy into an actionable delivery plan in partnership with Services Products, Operations & Engineering function, delivering incremental, high-value milestones.

  • Understand Critical Business Services functional scope and translate into End-to-End monitoring solutions.

  • Deliver against the observability roadmap for Services Technology by building scalable, reusable telemetry solutions.

  • Periodic review and analyze application monitoring TOIL and collaborate with stakeholders and remediate them as per organization goal.

  • Create and maintain dashboards and visualizations for critical client journeys, including real-time flows across Payments.

  • Guide line-of-business teams in implementing SLIs/SLOs, golden signals, and effective alerting to support operational excellence.

  • Support integration and adoption of observability tooling across on-prem, public cloud (AWS/GCP), and containerized environments (ECS, Kubernetes).

  • Customize shared dashboards and observability components in partnership with CTI and other central Engineering functions, ensuring usability and flexibility.

  • Provide technical support and implementation guidance to SREs and developers facing integration or tooling challenges.

  • Effectively manage the observability book of work for Services Technology and drive initiatives to reduce MTTD and improve recovery outcomes.

  • Serve as a key connection point between line-of-business SREs and central infrastructure functions by gathering tooling feedback, surfacing systemic issues, and influencing platform enhancements via the Services Observability Forum.

  • Stay current with observability trends, including AI/ML-driven insights, anomaly detection, and emerging OSS practices, and assess their applicability.

  • Maintain strong knowledge of observability platform features and vendor offerings to advise teams and maximize the value of tooling investments.

  • Foster AI adoption by building use cases performed by Orion L1 Functions and remediation using Citi AI tech stack.

Qualifications:

  • Experience in SRE, Observability Engineering, or platform infrastructure roles focused on operational telemetry.

  • Bachelor's degree (computer science or related fields) or equivalent experience in building scalable solutions to improve the service reliability and/or increase productivity and efficiency

  • Hands-on experience in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms.

  • Deep understanding of SLIs, SLOs, Error Budgets, and telemetry best practices in high-availability environments.

  • Proven ability to troubleshoot integration issues and support observability across hybrid platforms (on-prem, cloud, containers).

  • Experience building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments.

  • Experience with agile systems development methodologies, building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments.

  • Familiarity with modern observability tooling ecosystems, including AI/ML capabilities, trace correlation, baselining, and alert tuning.

  • Strong interpersonal and collaboration skills — able to operate across federated engineering teams and central infrastructure groups.

  • Experience in enablement or platform teams with a track record of scaling best practices across diverse business units.

What we’ll provide you:

By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (which is annually reviewed), and enjoy a whole host of additional benefits such as:

  • 27 days annual leave (plus bank holidays)

  • A discretional annual performance related bonus

  • Private Medical Care & Life Insurance

  • Employee Assistance Program

  • Pension Plan

  • Paid Parental Leave

  • Special discounts for employees, family, and friends

  • Access to an array of learning and development resources

Alongside these benefits Citi is committed to ensuring our workplace is where everyone feels comfortable coming to work as their whole self, every day. We want the best talent around the world to be energized to join us, motivated to stay and empowered to thrive.
 

#LI-BH3

------------------------------------------------------

Job Family Group: Technology

------------------------------------------------------

Job Family:Applications Support

------------------------------------------------------

Time Type:Full time

------------------------------------------------------

Most Relevant Skills Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citi London, England Office

33 Canada Square, London, United Kingdom, E14 5LB

Similar Jobs

2 Hours Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Mobile • Payments • Software • Financial Services
Lead analytics for FinCrime operations: build forecasting and capacity-planning models, own data pipelines, develop predictive models and cause-effect analysis, track KPIs and initiative performance, conduct cost and operational health analyses, and standardise forecasting and real-time processes with stakeholders.
2 Hours Ago
Hybrid
London, England, GBR
Mid level
Mid level
Artificial Intelligence • Fintech • Software
Manage a book of existing SaaS clients to drive renewals, expansions, and upsells. Coordinate with Customer Success, Sales, and Product teams; maintain forecasts and CRM data; meet monthly and quarterly targets; research accounts, build relationships, and report sales activity.
Top Skills: CloseFloqastMS OfficeOutreachSalesforce
3 Hours Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality
Lead fraud prevention and card operations for Perk Pay corporate card program. Develop fraud risk strategy, define policies and metrics, work with product/engineering and partners to implement systems, monitor authorizations and fraud rates, manage dispute/chargeback operations, provide stakeholder reporting, and lead a team of card operations specialists.
Top Skills: 3D SecureMastercardRisk Scoring ModelsRule EnginesStrong Customer AuthenticationTransaction Risk AnalysisVisa

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account