FNZ Group Jobs

Lead Site Reliability Engineer

FNZ Group

Lead Site Reliability Engineer

Reposted 10 Days Ago

Be an Early Applicant

In-Office

London, Greater London, England, GBR

Senior level

In-Office

London, Greater London, England, GBR

Senior level

The Lead Site Reliability Engineer ensures high availability of FNZ platforms, implements monitoring and deployment solutions, and collaborates with engineering teams. Responsibilities include optimizing cloud workloads and managing application delivery networks.

The summary above was generated by AI

Role Purpose

The Site Reliability Engineer will work closely with Application, Infrastructure, and Network Engineering teams to ensure the reliability, scalability, and performance of FNZ platforms. This role focuses on deploying, integrating, and providing ongoing operational support for mission-critical systems, leveraging modern automation and cloud-native practices.

Key Responsibilities

· Maintain high availability and performance of FNZ platforms.

· Implement monitoring, alerting, and observability solutions to proactively detect and resolve issues.

· Collaborate with engineering teams to design and implement robust deployment pipelines.

· Ensure smooth integration of applications with infrastructure and network components.

· Use Terraform for provisioning and managing infrastructure across environments.

· Operate and optimize workloads onprem and public cloud.

· Manage and troubleshoot application delivery networks, load balancing, and traffic routing.

· Configure and support F5 Distributed Cloud or similar CDN/ADC technologies.

· Participate in on-call rotations, perform root cause analysis, and implement preventive measures.

· Work cross-functionally with Application, Infrastructure, and Network Engineering teams to deliver reliable services.

Required Skills & Experience

· Kubernetes (K8s): Deep understanding of container orchestration and cluster management.

· Terraform: Strong experience in Infrastructure as Code for cloud and on-prem environments.

· Public Cloud: Hands-on experience with AWS, Azure, or GCP.

· F5 Distributed Cloud or Similar: Knowledge of CDN/ADC platforms and their integration.

· Networking Fundamentals: Expertise in application delivery networks, load balancing, traffic routing, and troubleshooting.

· Observability Tools: Familiarity with Splunk, NewRelic, or similar.

· Scripting & Automation: Proficiency in Terraform, Bash, or similar languages.

Desirable Skills

· Experience with CI/CD pipelines and GitOps workflows.

· Knowledge of SRE principles.

· Familiarity with security best practices.

Key Attributes

· Strong problem-solving and troubleshooting skills.

· Ability to work collaboratively across multiple teams.

· Passion for automation and reducing operational toil.

Reporting Line

Reports to: Head of Platform Operations/Application Engineering.

Works closely with Application Engineering, Infrastructure Engineering, Network Engineering teams.

#LI-CM1

About FNZ

FNZ is committed to opening up wealth so that everyone, everywhere can invest in their future on their terms. We know the foundation to do that already exists in the wealth management industry, but complexity holds firms back.

We created wealth’s growth platform to help. We provide a global, end-to-end wealth management platform that integrates modern technology with business and investment operations. All in a regulated financial institution.

We partner with the world’s leading financial institutions, with over US$2.2 trillion in assets on platform (AoP).
Together with our clients, we empower nearly 30 million people across all wealth segments to invest in their future.

135 Bishopsgate, London, United Kingdom, EC2M 3TP

Similar Jobs

JPMorganChase

Site Reliability Engineer

7 Days Ago

Hybrid

London, Greater London, England, GBR

Senior level

Financial Services

Lead SRE responsible for improving reliability, observability, and operability of customer-facing microservices. Build automation and tooling to reduce toil, define SLOs/SLIs/error budgets, implement resiliency/self-healing patterns, drive performance testing and capacity planning, partner across teams, and govern safe AI-assisted engineering practices.

Top Skills: Ai-Assisted Engineering ToolsAWSCloud ComputingCommand-Line ToolsElasticsearchGoGrafanaIngressJaegerJavaKibanaKubernetesLoad BalancingOperators/ControllersPrometheusPythonService Discovery

JPMorganChase

Site Reliability Engineer

7 Days Ago

Hybrid

London, Greater London, England, GBR

Senior level

Financial Services

Lead SRE embedded with front-office trading teams to improve reliability, observability, and performance across low-latency, global trading platforms. Responsibilities include incident response, root cause analysis, coding reliability improvements (Java/Kotlin/Python), designing SRE patterns (automation, self-healing), observability, and partnering with infrastructure, cloud, and cybersecurity teams.

Top Skills: AIAlertingAutomated TestingCi/CdDistributed SystemsDistributed TracingDynatraceEvent-Driven ArchitectureFix MessagingGrafanaIbm MqInfluxdbItrs GeneosJavaKafkaKotlinMicroservicesMonitoringOracle DbPythonSplunk

Allwyn UK

Site Reliability Engineer

Yesterday

In-Office

Watford, Hertfordshire, England, GBR

Senior level

Consumer Web • eCommerce • Gaming

Lead SRE responsible for reliability across customer-facing systems using SLOs/SLIs/error budgets. Own incident command and on-call, drive automation (Terraform, CI/CD), transition from ECS to EKS, optimise capacity and performance for high-concurrency events, implement observability (Splunk, CloudWatch, Grafana, Quantum Metric), prioritise SRE backlog, and mentor teams while reporting reliability metrics to leadership.

Top Skills: AWSCi/CdCloudwatchEcsEksGoGrafanaKubernetesPythonQuantum MetricSplunkTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

FNZ Group

Lead Site Reliability Engineer

FNZ Group London, England Office

Similar Jobs

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

What you need to know about the London Tech Scene