BNY Logo

BNY

Vice President, Site Reliability Engineering

Posted 5 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Expert/Leader
In-Office
London, Greater London, England, GBR
Expert/Leader
Lead design, build, and scale centralized SRE platforms and internal tools to improve operational efficiency, resiliency, monitoring, automation, and runbooked support for production services. Hands-on full‑stack development, infrastructure automation, observability, SLIs/SLOs, AIOps, and cross-team partnership to reduce toil and improve reliability.
The summary above was generated by AI

We’re seeking a future team member for the role of Vice President - Site Reliability Engineer to join our team. This role is located in London. 

Role Summary

BNY is seeking a Vice President - Site Reliability Engineer to design, build, deploy, and scale resilient, automated, and centrally managed engineering solutions for Production Services. This role is ideal for a strong full-stack engineer who combines application development, UI engineering, backend services, infrastructure automation, and production reliability expertise.

The successful candidate will build reusable platforms, internal tools, and automation capabilities that improve operational efficiency, reduce manual effort, strengthen resiliency, and enable Production Services teams to support critical business platforms more effectively. This role requires a hands-on engineer who can take solutions from concept and development through deployment, operationalization, and continuous improvement.

In this role, you’ll make an impact in the following ways: 

  • Design, develop, and deploy centralized engineering solutions that improve operational efficiency, reduce toil, and enhance resiliency across Production Services.

  • Build full-stack applications and internal engineering tools, including backend services, APIs, automation layers, and user-facing interfaces using technologies such as Python, Java, React, or Angular.

  • Engineer scalable solutions that support central operational use cases such as self-service tooling, operational dashboards, alert enrichment, incident reduction, service recovery, and workflow automation.

  • Develop reusable frameworks and components that can be adopted broadly across Production Services teams to standardize and accelerate operational processes.

  • Automate infrastructure, deployment, configuration, and runtime support activities using tools such as Ansible and Kubernetes.

  • Define, implement, and continuously improve Service Level Indicators, Service Level Objectives, and service health measures aligned to operational and business priorities.

  • Build and optimize monitoring, observability, and alerting capabilities using tools such as Prometheus, Grafana, AppDynamics, and Splunk.

  • Apply AIOps capabilities to improve event correlation, anomaly detection, root cause analysis, predictive insights, and proactive issue prevention.

  • Partner with engineering, infrastructure, production support, security, and risk teams to ensure developed solutions are secure, scalable, supportable, and aligned to enterprise standards.

  • Identify manual, fragmented, or repetitive processes across Production Services and convert them into efficient, automated, centrally consumable solutions.
     

To be successful in this role, we’re seeking the following: 

  • Required Qualifications:

    • Bachelor degree in Computer Science, Engineering, or a related technical discipline, or equivalent practical experience.

    • Strong full-stack development experience, with hands-on expertise in Python and Java for backend or service-layer engineering.

    • Strong working knowledge of front-end development using React or Angular, including building interfaces for operational or engineering use cases.

    • Proven experience designing and deploying end-to-end solutions, from application development through production deployment and operational support.

    • Experience in Site Reliability Engineering, Production Engineering, DevOps, Platform Engineering, or similar roles supporting business-critical applications.

    • Strong foundation in Linux/Unix systems administration, scripting, troubleshooting, and infrastructure concepts.

    • Hands-on experience with Ansible and Kubernetes in enterprise or production environments.

    • Demonstrated ability to define and operationalize SLIs, SLOs, dashboards, alerts, and health indicators.

    • Hands-on experience with enterprise monitoring and observability platforms including Prometheus, Grafana, AppDynamics, and Splunk.

    • Strong troubleshooting, analytical, and problem-solving skills in complex distributed or production environments.

    • Strong verbal and written communication skills, with the ability to collaborate effectively across technical and non-technical stakeholders.

 

  • Preferred Qualifications

    • Experience building centralized internal platforms or shared engineering services for operational or enterprise users.

    • Experience applying AIOps, machine learning, or intelligent automation within production support or reliability engineering environments.

    • Exposure to CI/CD pipelines, infrastructure as code, API-driven automation, and modern software delivery practices.

    • Experience supporting distributed systems, cloud-native platforms, or container-based architectures.

    • Knowledge of Agile, DevOps, and SRE operating models, including continuous improvement and blameless post-incident practices.

    • Ability to influence engineering standards and drive adoption of common tooling and automation patterns across teams.

About Us

At BNY, our culture allows us to run our company better and enables employees’ growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world’s investible assets. Every day, our teams harness cutting-edge AI and breakthrough technologies to collaborate with clients, driving transformative solutions that redefine industries and uplift communities worldwide.

Recognized as a top destination for innovators, BNY is where bold ideas meet advanced technology and exceptional talent. Together, we power the future of finance – and this is what #LifeAtBNY is all about. Join us and be part of something extraordinary. About the Team

At BNY, our culture speaks for itself, check out the latest BNY news at: BNY Newsroom & BNY LinkedIn

 Here’s a few of our recent awards:

  • America’s Most Innovative Companies, Fortune, 2025
  • World’s Most Admired Companies, Fortune 2025
  • “Most Just Companies”, Just Capital and CNBC, 2025

Our Benefits and Rewards:

BNY offers highly competitive compensation, benefits, and wellbeing programs rooted in a strong culture of excellence and our pay-for-performance philosophy. We provide access to flexible global resources and tools for your life’s journey. Focus on your health, foster your personal resilience, and reach your financial goals as a valued member of our team, along with generous paid leaves, including paid volunteer time, that can support you and your family through moments that matter.

BNY is an Equal Employment Opportunity/Affirmative Action Employer - Underrepresented racial and ethnic groups/Females/Individuals with Disabilities/Protected Veterans.

Similar Jobs

Junior
eCommerce • Fashion • Retail • Sales • Wearables • Design
Serve as the Coach brand ambassador delivering personalized luxury retail service. Drive sales via styling, cross-selling, clienteling and social/mobile selling, meet KPIs, operate mobile POS, process transactions, manage inventory, merchandising, and store operations, support team collaboration, training, and brand initiatives while maintaining service and loss-prevention standards.
Top Skills: Clienteling ToolsIpadLaptopMobile PosPosSocial Selling PlatformsWalkie-Talkie
9 Hours Ago
Remote or Hybrid
United Kingdom
Mid level
Mid level
HR Tech • Information Technology • Professional Services • Sales • Software
As an Account Manager, you will manage and grow a portfolio of Mid-Market customers, driving adoption and identifying upsell opportunities while collaborating with Customer Success Managers to achieve retention objectives.
Top Skills: ChatgptGong EngageLinkedin Sales NavigatorSalesforce
17 Hours Ago
Remote or Hybrid
United Kingdom
Entry level
Entry level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Analyst will support Global Incident Response efforts by conducting consultations, managing client accounts, and utilizing various data collection tools to respond to incidents while focusing on client satisfaction and account growth.
Top Skills: Data Mining ToolsDfir OperationsEdr ToolsMitreThreat Intelligence

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account