NewDay Logo

NewDay

Senior Site Reliability Engineer

Posted 3 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Senior level
In-Office
London, Greater London, England, GBR
Senior level
Lead reliability initiatives across the platform by automating infrastructure and operational processes, building observability (monitoring, logging, tracing), driving incident management and root cause analysis, and collaborating with engineering teams to embed SRE practices, resilience, and performance into delivery.
The summary above was generated by AI

Mission Statement & Summary

As a Senior Site Reliability Engineer, you'll sit at the intersection of software engineering and operations, driving reliability, performance, automation, and resilience across our technology estate.

This is an opportunity to shape the future of our platform rather than simply maintain it. You'll work alongside talented engineers, influence technical direction, and champion modern reliability practices that enable teams to move faster with confidence. If you're passionate about solving complex problems, eliminating toil through automation, and creating systems that are resilient by design, we'd love to hear from you.

How you'll contribute

  • Lead initiatives that improve platform reliability, scalability, and operational excellence.

  • Design and deliver automation solutions that reduce manual effort and accelerate engineering teams.

  • Develop observability capabilities, enabling proactive monitoring and faster incident resolution.

  • You will facilitate incident management, driving root cause analysis and continuous improvement.

  • You'll collaborate with engineering teams to embed reliability, resilience, and performance into every stage of delivery.

We're looking for these essential skills

  • Software engineering and design experience (preferably .net/C#), to build and improve production systems, apply solid design principles, and contribute directly to codebases to deliver reliable, scalable, and maintainable services.

  • The ability to automate infrastructure, operational processes, and deployments using modern engineering practices.

  • Experience building effective observability solutions, including monitoring, logging, alerting, and tracing.

  • Strong problem-solving skills with the ability to diagnose and resolve complex production issues.

  • The ability to influence technical decisions and collaborate effectively across engineering and business teams.

It's a plus if you also have these skills

  • Experience operating Kubernetes-based platforms at scale.

  • Knowledge of Infrastructure as Code tools and cloud platform services.

  • Experience implementing Site Reliability Engineering principles, including SLOs, SLIs, and error budgets.

  • Familiarity with security, compliance, and resilience best practices within cloud environments.

  • Experience mentoring engineers and helping teams adopt modern operational and reliability practices.

At NewDay, we value all types of diversity. We’re an equal opportunity employer and believe that our differences create a vibrant, authentic working culture. We want all our colleagues to feel able to bring their whole selves to work. We don’t discriminate on the basis of protected characteristics or identities. We make sure that every job is crafted to be inclusive and that people with disabilities or caring responsibilities can take part in the application and interview process.

Tell us if you need accommodations: We’ll put reasonable adjustments in place to support you.

We work with Textio to make our job design and hiring inclusive.

PermanentSenior SRE role profile.docx
HQ

NewDay London, England Office

7 Handyside Street​, London, United Kingdom, N1C 4DA

Similar Jobs

5 Hours Ago
In-Office
Senior level
Senior level
Fintech • Software • Financial Services
Lead and operate the private cloud platform to ensure stability, performance and scalability. Manage VMware and container platforms (Kubernetes/OpenShift), use IaC and CI/CD, improve observability with Dynatrace, troubleshoot across infrastructure and applications, run post-mortems, and drive automation and platform improvements while working in Agile teams.
Top Skills: AnsibleAWSAzureCi/CdDockerDynatraceGCPInfrastructure As CodeKubernetesLinuxOpenshiftPuppetVMwareWindows
8 Days Ago
Easy Apply
Hybrid
London, Greater London, England, GBR
Easy Apply
Senior level
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Seeking a Senior Site Reliability Engineer to design and develop automation and infrastructure services that ensure reliable, scalable systems for business travelers, while collaborating with development and security teams.
Top Skills: AWSCi/CdCloudFormationDatadogGoJavaJenkinsKibanaMavenNewrelicNode.jsPythonSignalfxTerraform
10 Days Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
Maintain and scale DevTools SRE systems: improve user-facing developer workflows, build fault-tolerant self-healing architecture, optimize performance, modify open- and closed-source tools (GitLab, TeamCity), and support users while measuring improvements via metrics.
Top Skills: ArtifactoryGitlabGoJavaJvmKotlinPythonRubyTeamcityUnix-Like Systems

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account