A.P. Moller - Maersk Logo

A.P. Moller - Maersk

SRE Senior Engineer

Reposted 12 Days Ago
Be an Early Applicant
In-Office
SL6 8AA, Maidenhead, England
Senior level
In-Office
SL6 8AA, Maidenhead, England
Senior level
As a Senior SRE Engineer, you will enhance operational reliability through observability, automation, and collaboration with product teams, driving continuous improvement and incident management.
The summary above was generated by AI

Role Introduction

At Maersk, we are transforming to become the global integrator of container logistics by simplifying and connecting our customers’ supply chains. As part of this journey, we are strengthening the Technology capabilities of APM Terminals (APMT), a core Maersk business operating 74 port and terminal facilities across 38 countries. The Technology Operations group is accountable for delivering availability, performance, cost efficiency, and fit-for-purpose solutions across APMT.

We are looking for Senior Software Engineers (SRE) who can apply strong software engineering fundamentals to reliably run and continuously improve our infrastructure and applications. As part of the SRE team, you will design, build, and automate capabilities that uplift operational excellence and customer experience. You will help drive our reliability agenda across infrastructure, applications, data, automation and AIOps, while shaping a culture where engineering and operations work seamlessly together.

The team will play a pivotal role in productising observability, automation, integrations, performance engineering, DevOps workflows, and advanced AIOps capabilities to ensure high availability, resilience and scalability of our services across hybrid environments.

Key Responsibilities

As a Senior SRE Engineer, you will:

  • Drive the full service lifecycle, from design and deployment to operations and continuous improvement.
  • Build and implement foundational SRE capabilities such as SLOs/SLIs, observability platforms, status pages, chaos engineering, toil reduction, automated deployments and intelligent runbooks.
  • Champion reliability, automation, and resilience patterns ensuring fault tolerance and exceptional customer experience.
  • Engineer and optimise infrastructure, monitoring, and AIOps systems using modern technologies and strong development skills.
  • Lead enterprise-level triage during incidents, guide stakeholders, and drive deep post-incident problem management and RCA outcomes.
  • Support production operations while contributing to transformation initiatives across on-premise and cloud platforms.
  • Collaborate with product and engineering teams to define and maintain SLOs aligned to business outcomes.
  • Leverage data engineering and analytics to derive insights, automate decision-making, and improve operational intelligence.

Who You Are

We are seeking passionate engineers who demonstrate ownership, curiosity, and the ability to solve complex problems with a first-principles approach. Ideal candidates will bring:

  • 4–5 years of SRE experience, with 10+ years overall in large-scale enterprise environments (data center + cloud, Azure/AWS preferred).
  • Deep technical expertise in one or more areas of full-stack development and production operations, strong advocacy for open-source technologies.
  • Experience with monolithic, SOA, microservices, and distributed systems architectures, exposure to transformation programs is a plus.
  • Hands-on experience building enterprise observability, integrating metrics, logs, traces, alerts and automation pipelines.
  • Strong coding skills in one or more modern languages (e.g., Python, Go, Java, C#, Node.js).
  • A strong foundation in performance engineering, scalability, debugging complex production issues and automation at scale.
  • Excellent communication skills with the ability to simplify complex technical concepts for diverse audiences.

What Success Looks Like

Success in this role means you are elevating reliability engineering across platforms by maturing observability, improving performance, and reducing toil in meaningful ways. You are shaping stronger operational behaviours within teams, driving automation-first practices, and influencing how services are designed, deployed, and operated. You lead complex incident resolution, create clarity during ambiguity, and ensure post-incident actions translate into lasting improvements. Your work enables product and platform teams to deliver faster, operate with confidence, and consistently meet the reliability expectations of our customers.

Maersk is committed to a diverse and inclusive workplace, and we embrace different styles of thinking. Maersk is an equal opportunities employer and welcomes applicants without regard to race, colour, gender, sex, age, religion, creed, national origin, ancestry, citizenship, marital status, sexual orientation, physical or mental disability, medical condition, pregnancy or parental leave, veteran status, gender identity, genetic information, or any other characteristic protected by applicable law. We will consider qualified applicants with criminal histories in a manner consistent with all legal requirements.

 

We are happy to support your need for any adjustments during the application and hiring process. If you need special assistance or an accommodation to use our website, apply for a position, or to perform a job, please contact us by emailing  [email protected]

Top Skills

AWS
Azure
C#
Go
Java
Node.js
Python

Similar Jobs

17 Days Ago
In-Office or Remote
10 Locations
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, operate, and scale production blockchain node infrastructure across multiple clouds. Build and maintain Kubernetes clusters, IaC with Terraform, CI/CD automation, and integrate AI-assisted tooling. Provide 24/7 on-call incident response, partner with security, mentor engineers, and improve reliability for a fast-growing blockchain platform.
Top Skills: Kubernetes,Helm,Terraform,Go,Python,Shell,Aws,Gcp,Sql,Ci/Cd,Container Image Builds,Blue-Green Deployment,Canary Deployment,Observability,Kubernetes Operators,Kubernetes Controllers,Rbac,Blockchain Nodes (Arc,Ethereum,Solana,Base),Smart Contracts,Cursor,Agentic Workflows
3 Days Ago
Easy Apply
Hybrid
London, England, GBR
Easy Apply
Senior level
Senior level
Fintech • Payments • Productivity • Financial Services
In this role, you will enhance the reliability of Credit Karma's infrastructure by automating, managing cloud databases, and collaborating with developers to ensure system performance and reliability.
Top Skills: Cloud Sql MysqlGoGoogle Cloud PlatformGrafanaKubernetesNew RelicPrometheusPythonSplunkTerraform
3 Days Ago
Easy Apply
In-Office
London, Greater London, England, GBR
Easy Apply
Senior level
Senior level
Machine Learning • Software
The role involves designing, provisioning, and operating core infrastructure across multiple cloud providers, ensuring its reliability and scalability. Key responsibilities include managing Kubernetes clusters, implementing multi-tenancy, and collaborating on security and compliance initiatives.
Top Skills: CniCrossplaneDistributed SystemsExternal Secrets OperatorGitopsGoGpuIstioKubernetesPythonTerraformVault

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account