IQVIA Logo

IQVIA

Site Reliability Engineering Lead

Reposted 2 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England
Expert/Leader
In-Office
London, Greater London, England
Expert/Leader
Lead a team to ensure availability and reliability of cloud services, implementing automation and operational excellence, while managing cloud resources and providing application support.
The summary above was generated by AI
Job Description Summary
We are seeking a Site Reliability Engineering Lead to head a team delivering mission-critical cloud services for a UK Public Sector client. This role combines hands-on technical expertise with leadership responsibilities, ensuring high availability, reliability, and scalability of cloud platforms. You will drive operational excellence, champion automation, and foster collaboration across cross-functional teams to deliver secure, resilient solutions.
Key Responsibilities
Team Leadership & Management
  • Lead, manage and mentor a team of CloudOps engineers, ensuring performance management, career development, and engagement.
  • Manage on-call rota and operational readiness for 24/7 support.
  • Oversee administrative and resource planning tasks.
  • Represent CloudOps in Programme Board, Architecture, Service Reviews & Client Meetings where necessary
Cloud Operations & Automation
  • Design and implement Infrastructure-as-Code (IaC) solutions using tools such as Terraform and Ansible.
  • Automate provisioning, configuration, and scaling of AWS cloud resources.
  • Build and maintain CI/CD pipelines for infrastructure and application deployments.
Platform Reliability & Performance
  • Monitor and troubleshoot cloud services to ensure uptime and rapid incident resolution.
  • Optimise system performance through metrics, dashboards, and proactive tuning.
  • Implement cost optimisation strategies for cloud resource usage.
Application Support
  • Become familiar with the application and service to be able to provide L2 support
  • Co-ordination with Service Management, Engineering & DevOps around application issues
Operational Excellence
  • Develop and maintain disaster recovery and backup strategies.
  • Ensure compliance with security and governance standards, including handling sensitive data (PII/PHI).
  • Maintain comprehensive documentation for infrastructure and operational processes.
Collaboration & Continuous Improvement
  • Partner with QA, Product, and Development teams to enhance service reliability.
  • Drive initiatives to improve time-to-market, quality, and resilience of solutions.

About You
  • Proven experience in CloudOps/DevOps/SRE roles (10+ years), with strong leadership capabilities.
  • Skilled in cloud architecture (AWS preferred), Linux environments, and containerisation frameworks.
  • Proficient in Python or similar programming languages.
  • Hands-on experience with IaC tools (Terraform, Ansible) and CI/CD automation.
  • Strong problem-solving skills and ability to work in fast-paced, distributed teams.
  • Eligible for DBS check and UK Security Clearance.
Desirable:
  • • Experience supporting client-facing systems in public sector or healthcare.
  • • Familiarity with secure systems handling sensitive data.
  • • Proactive mindset for identifying operational improvements.

Note: This role is not eligible for UK visa sponsorship.

IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com

Top Skills

Ansible
AWS
Python
Terraform

Similar Jobs

4 Days Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Information Technology • Financial Services
The Lead Site Reliability Engineer will drive SRE best practices, build cloud-native architectures, automate processes, and enhance system reliability and performance.
Top Skills: AWSAzureBashCi/CdDockerElkGCPGithub ActionsGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
3 Days Ago
Easy Apply
Remote or Hybrid
UK
Easy Apply
Senior level
Senior level
Cloud • Security • Software
As a Senior Site Reliability Engineer, you'll design and deliver scalable solutions, maintain cloud infrastructure, and optimize CI/CD pipelines while collaborating across teams.
Top Skills: Ci/CdCloud PlatformsDockerGitGoKubernetes
5 Days Ago
In-Office
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • Real Estate • Software • PropTech
The Site Reliability Engineer role involves building and managing cloud infrastructure, CI/CD pipelines, and ensuring system reliability and performance while collaborating with software engineering teams.
Top Skills: AzureBashCircleCIDatadogDockerElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesPrometheusPythonTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account