FNZ Group Logo

FNZ Group

Lead Site Reliability Engineer

Reposted 17 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Senior level
In-Office
London, Greater London, England, GBR
Senior level
The Lead Site Reliability Engineer ensures high availability of FNZ platforms, implements monitoring and deployment solutions, and collaborates with engineering teams. Responsibilities include optimizing cloud workloads and managing application delivery networks.
The summary above was generated by AI

Role Purpose

The Site Reliability Engineer will work closely with Application, Infrastructure, and Network Engineering teams to ensure the reliability, scalability, and performance of FNZ platforms. This role focuses on deploying, integrating, and providing ongoing operational support for mission-critical systems, leveraging modern automation and cloud-native practices.

Key Responsibilities

·       Maintain high availability and performance of FNZ platforms.

·       Implement monitoring, alerting, and observability solutions to proactively detect and resolve issues.

·       Collaborate with engineering teams to design and implement robust deployment pipelines.

·       Ensure smooth integration of applications with infrastructure and network components.

·       Use Terraform for provisioning and managing infrastructure across environments.

·       Operate and optimize workloads onprem and public cloud.

·       Manage and troubleshoot application delivery networks, load balancing, and traffic routing.

·       Configure and support F5 Distributed Cloud or similar CDN/ADC technologies.

·       Participate in on-call rotations, perform root cause analysis, and implement preventive measures.

·       Work cross-functionally with Application, Infrastructure, and Network Engineering teams to deliver reliable services.

Required Skills & Experience

·       Kubernetes (K8s): Deep understanding of container orchestration and cluster management.

·       Terraform: Strong experience in Infrastructure as Code for cloud and on-prem environments.

·       Public Cloud: Hands-on experience with AWS, Azure, or GCP.

·       F5 Distributed Cloud or Similar: Knowledge of CDN/ADC platforms and their integration.

·       Networking Fundamentals: Expertise in application delivery networks, load balancing, traffic routing, and troubleshooting.

·       Observability Tools: Familiarity with Splunk, NewRelic, or similar.

·       Scripting & Automation: Proficiency in Terraform, Bash, or similar languages.

Desirable Skills

·       Experience with CI/CD pipelines and GitOps workflows.

·       Knowledge of SRE principles.

·       Familiarity with security best practices.

Key Attributes

·       Strong problem-solving and troubleshooting skills.

·       Ability to work collaboratively across multiple teams.

·       Passion for automation and reducing operational toil.

Reporting Line

Reports to: Head of Platform Operations/Application Engineering.

Works closely with Application Engineering, Infrastructure Engineering, Network Engineering teams.

#LI-CM1

About FNZ

FNZ is committed to opening up wealth so that everyone, everywhere can invest in their future on their terms. We know the foundation to do that already exists in the wealth management industry, but complexity holds firms back. 

We created wealth’s growth platform to help. We provide a global, end-to-end wealth management platform that integrates modern technology with business and investment operations. All in a regulated financial institution. 

We partner with the world’s leading financial institutions, with over US$2.2 trillion in assets on platform (AoP).
Together with our clients, we empower nearly 30 million people across all wealth segments to invest in their future.

HQ

FNZ Group London, England Office

135 Bishopsgate, London, United Kingdom, EC2M 3TP

Similar Jobs

Senior level
Fintech • Analytics
The Lead Site Reliability Engineer will establish SRE foundations, collaborate on system reliability, and champion observability practices while improving operational efficiency and mentoring engineers.
Top Skills: AWSCloudFormationDatadogEc2EcsEksElkGrafanaKubernetesOpentelemetryPrometheusTerraform
15 Days Ago
In-Office
Senior level
Senior level
Cloud • Software • Analytics
The Lead Site Reliability Engineer will manage production environments, automate tasks, lead investigations, and enhance observability in cloud platforms. Requires extensive SRE experience and collaboration with engineering teams to enforce service reliability metrics.
Top Skills: ArmAzureAzure DevopsBicepC#ElasticsearchGitGrafanaKubernetesPowershellPrometheusPython
21 Days Ago
In-Office
Senior level
Senior level
Events • News + Entertainment
Lead consulting work in Site Reliability Engineering, mentoring teams to enhance reliability, resilience, and engineering practices across multiple domains.
Top Skills: AWSKubernetes

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account