Capita Logo

Capita

Cloud Infrastructure SRE

Posted 3 Days Ago
Be an Early Applicant
Basted, Tonbridge and Malling, Kent, England
Mid level
Basted, Tonbridge and Malling, Kent, England
Mid level
As a Cloud Infrastructure SRE, you will support the public cloud infrastructure, resolve support cases, monitor services, and contribute to service improvements. Your role includes ensuring service availability and performance, managing incidents, and developing documentation while collaborating with diverse teams.
The summary above was generated by AI

Job Title: Cloud Infrastructure – Site Reliability Engineer (SRE) Job Description: The Cloud Infrastructure – Site Reliability Engineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers.

Job title:

Cloud Infrastructure SRE

Job Description:

The Cloud Infrastructure Site Reliability Engineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers.

We will have a high customer focus being actively involved in the support and development of the service including: the resolution of support cases, live service monitoring and maintenance, new service provision and continuous improvement projects. You will provide high quality operational and technical support to customers and will be responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. You must have excellent technical knowledge across Microsoft public Cloud Services (Azure and Microsoft 365). You should have a good knowledge of security practices working in a regulated environment and the flexibility to work out of hours will be required, including on call.

This is an exciting opportunity for a highly experienced Microsoft Azure Cloud Engineer with operational support and project delivery experience to provide L3/L4 analytical incident management and resolution alongside project-based deliverables across a large, expanding customer base to ensure quality service delivery and Service Level Agreement compliancy.

What you will be doing:

  • Working with a collaborative team of varied disciplines, skills, and experience

  • Contribute to the planning of application / infrastructure releases and configuration changes

  • Resolve support requests from customers by phone, email and online making use of the call logging system

  • Interact with key internal stakeholders and external third-party vendors to troubleshoot and resolve complex problems

  • Provide input to administering and maintaining all production and development environments

  • Create detailed technical and procedural documentation (e.g. architecture, configuration, and setup)

  • Design appropriate metrics for reporting on key performance and quality indicators, particularly in terms of in-depth trend analysis

What we are looking for:

  • Innovation should be first, thriving to innovate, automate and keen to improve

  • Microsoft Azure and its relevant build, deployment, automation, networking, and security technologies in cloud and hybrid environments.

  • AZ-104 – Microsoft Certified: Azure Administrator Associate

  • Server Infrastructure Engineering (Virtualization / Windows / Linux)

  • Office 365 / Microsoft 365 Administration

  • Network Engineering

  • DevOps (CI/CD, pipelines and Infrastructure as Code)

  • Ability to work well with individuals and teams

  • Experience with helpdesk IT Service Management Tools (e.g. BMC Remedy / Service Now).

  • Experience with Azure DevOps – deploying Infrastructure using CI/CD pipelines

  • Previously have worked with infrastructure-as-code and immutable builds (e.g. Terraform)

  • Experience with deployment and management of container technologies (e.g. Kubernetes, AKS and Docker)

What's in it for you?

  • A competitive basic salary

  • 23 days’ holiday (rising to 27) with the opportunity to buy extra leave

  • The opportunity to take a paid day out of the office, volunteering for our charity partners or a cause of your choice

  • Company matched pension, life assurance, a cycle2work scheme, 15 weeks’ fully paid maternity, adoption and shared parental leave, paternity pay of two weeks...and plenty more

  • Voluntary benefits designed to suit your lifestyle – from discounts on retail and socialising, to health & wellbeing, travel and technology

About Capita Technology and Software Solutions

Capita Technology and Software Solutions (TSS) is a 5000 people strong global shared service, responsible for delivering innovation and digital transformation for Capita’s colleagues, businesses and clients. 

We design, build and run the right technical competencies and partnerships to enable Capita to deliver seamless public and customer services – from working collaboratively with Capita’s businesses to shape the right technology and software solutions to take to market, to ensuring colleagues have access to resilient, predictable IT services and support, that enables them to work effectively and securely.

TSS is right at the heart of Capita, as we work to create a technology-led organisation. You’ll be part of a Capita-wide network of 55,000 experienced, innovative and dedicated individuals across multiple disciplines, sectors and countries. There are countless opportunities to learn new skills and develop in your career, and we’ll provide the support you need to do just that. Our purpose is to create a better outcome for you.

What we hope you'll do next:

Choose 'Apply now' to fill out our short application, so that we can find out more about you.

Location:

Home-Based - GBR

,

United Kingdom

Time Type:

Full time

Contract Type:

Permanent

Top Skills

DevOps
Microsoft 365
Azure
Office 365

Capita London, England Office

65 Gresham Street, , England , London, United Kingdom, EC2V 7NQ

Similar Jobs

2 Days Ago
Easy Apply
Hybrid
London, Greater London, England, GBR
Easy Apply
Mid level
Mid level
HR Tech • Software • Travel
As a Site Reliability Engineer (SRE) at TravelPerk, you will design and maintain cloud infrastructure, monitor system performance and reliability, improve automation processes, and collaborate with development teams to enhance application scalability while participating in on-call rotations to resolve production issues.
Top Skills: BashNode.jsPython
2 Days Ago
Easy Apply
Hybrid
London, England, GBR
Easy Apply
Senior level
Senior level
Cloud • Software
The Senior Site Reliability Engineer focuses on enhancing the observability of the ThousandEyes platform by implementing cloud-native monitoring tools, maintaining an alerting pipeline, and contributing to a robust incident response system. They are responsible for designing, deploying, and maintaining monitoring services that ensure proactive detection of issues across cloud environments.
Top Skills: GoPython
2 Days Ago
Easy Apply
Hybrid
London, England, GBR
Easy Apply
Entry level
Entry level
Cloud • Software
The Site Reliability Engineer will enhance observability for the ThousandEyes platform, focusing on cloud-native monitoring tools and automation. Responsibilities include designing and maintaining monitoring services, establishing best practices for instrumentation, and supporting the incident response process.
Top Skills: GoPython

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account