A.P. Moller - Maersk

SRE Senior Engineer

Reposted 3 Days Ago

Be an Early Applicant

In-Office

SL6 8AA, Maidenhead, England, GBR

Senior level

In-Office

SL6 8AA, Maidenhead, England, GBR

Senior level

As a Senior SRE Engineer, you will enhance operational reliability through observability, automation, and collaboration with product teams, driving continuous improvement and incident management.

The summary above was generated by AI

Role Introduction

At Maersk, we are transforming to become the global integrator of container logistics by simplifying and connecting our customers’ supply chains. As part of this journey, we are strengthening the Technology capabilities of APM Terminals (APMT), a core Maersk business operating 74 port and terminal facilities across 38 countries. The Technology Operations group is accountable for delivering availability, performance, cost efficiency, and fit-for-purpose solutions across APMT.

We are looking for Senior Software Engineers (SRE) who can apply strong software engineering fundamentals to reliably run and continuously improve our infrastructure and applications. As part of the SRE team, you will design, build, and automate capabilities that uplift operational excellence and customer experience. You will help drive our reliability agenda across infrastructure, applications, data, automation and AIOps, while shaping a culture where engineering and operations work seamlessly together.

The team will play a pivotal role in productising observability, automation, integrations, performance engineering, DevOps workflows, and advanced AIOps capabilities to ensure high availability, resilience and scalability of our services across hybrid environments.

Key Responsibilities

As a Senior SRE Engineer, you will:

Drive the full service lifecycle, from design and deployment to operations and continuous improvement.
Build and implement foundational SRE capabilities such as SLOs/SLIs, observability platforms, status pages, chaos engineering, toil reduction, automated deployments and intelligent runbooks.
Champion reliability, automation, and resilience patterns ensuring fault tolerance and exceptional customer experience.
Engineer and optimise infrastructure, monitoring, and AIOps systems using modern technologies and strong development skills.
Lead enterprise-level triage during incidents, guide stakeholders, and drive deep post-incident problem management and RCA outcomes.
Support production operations while contributing to transformation initiatives across on-premise and cloud platforms.
Collaborate with product and engineering teams to define and maintain SLOs aligned to business outcomes.
Leverage data engineering and analytics to derive insights, automate decision-making, and improve operational intelligence.

Who You Are

We are seeking passionate engineers who demonstrate ownership, curiosity, and the ability to solve complex problems with a first-principles approach. Ideal candidates will bring:

4–5 years of SRE experience, with 10+ years overall in large-scale enterprise environments (data center + cloud, Azure/AWS preferred).
Deep technical expertise in one or more areas of full-stack development and production operations, strong advocacy for open-source technologies.
Experience with monolithic, SOA, microservices, and distributed systems architectures, exposure to transformation programs is a plus.
Hands-on experience building enterprise observability, integrating metrics, logs, traces, alerts and automation pipelines.
Strong coding skills in one or more modern languages (e.g., Python, Go, Java, C#, Node.js).
A strong foundation in performance engineering, scalability, debugging complex production issues and automation at scale.
Excellent communication skills with the ability to simplify complex technical concepts for diverse audiences.

What Success Looks Like

Success in this role means you are elevating reliability engineering across platforms by maturing observability, improving performance, and reducing toil in meaningful ways. You are shaping stronger operational behaviours within teams, driving automation-first practices, and influencing how services are designed, deployed, and operated. You lead complex incident resolution, create clarity during ambiguity, and ensure post-incident actions translate into lasting improvements. Your work enables product and platform teams to deliver faster, operate with confidence, and consistently meet the reliability expectations of our customers.

Maersk is committed to a diverse and inclusive workplace, and we embrace different styles of thinking. Maersk is an equal opportunities employer and welcomes applicants without regard to race, colour, gender, sex, age, religion, creed, national origin, ancestry, citizenship, marital status, sexual orientation, physical or mental disability, medical condition, pregnancy or parental leave, veteran status, gender identity, genetic information, or any other characteristic protected by applicable law. We will consider qualified applicants with criminal histories in a manner consistent with all legal requirements.

We are happy to support your need for any adjustments during the application and hiring process. If you need special assistance or an accommodation to use our website, apply for a position, or to perform a job, please contact us by emailing [email protected].

Top Skills

AWS

Azure

Java

Node.js

Python

Similar Jobs

Employer Direct Healthcare

Senior Site Reliability Engineer

Yesterday

Hybrid

Senior level

Healthtech

As a Senior Site Reliability Engineer, you will ensure the reliability and performance of our Azure-based healthcare platform, implementing SRE practices, driving incident management, and automating operational tasks.

Top Skills: AzureAzure MonitorBashDatadogPowershellPythonTerraform

Axon

Senior Site Reliability Engineer

7 Days Ago

In-Office

London, Greater London, England, GBR

Senior level

Artificial Intelligence • Cloud • Social Impact • Software • Wearables

As a Sr Site Reliability Engineer, you will enhance cloud-native services, build foundational platforms, and influence engineering practices, ensuring high reliability and performance.

Top Skills: ArgocdAWSAzureBashCi/CdCloudFormationDockerGitGoKubernetesLinuxPythonTerraform

Renesas Electronics

Site Reliability Engineer

7 Days Ago

In-Office

Senior level

3D Printing

The Sr. DevOps/Site Reliability Engineer will optimize cloud infrastructure, enhance CI/CD processes, manage incidents, and maintain system reliability in a collaborative environment.

Top Skills: AWSDockerGitlabGrafanaJenkinsKubernetesPrometheusTerraformZabbix

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.