Lyrebird Health Jobs

Senior Site Reliability Engineer

Lyrebird Health

Senior Site Reliability Engineer

Reposted 3 Days Ago

Be an Early Applicant

Remote

Hiring Remotely in United Kingdom

Senior level

Remote

Hiring Remotely in United Kingdom

Senior level

As a Senior Site Reliability Engineer, you will ensure the reliability and performance of production systems, design scalable infrastructure, improve CI/CD processes, and lead incident management.

The summary above was generated by AI

Senior Site Reliability Engineer
The Role

We’re hiring a Senior SRE to own the reliability, scalability, and performance of our production systems as we continue to grow.

At Lyrebird, you won’t just respond to incidents. You’ll design the systems and standards that prevent them. That means building infrastructure that scales cleanly, creating deployment patterns that reduce risk, and ensuring we can detect and resolve issues before they impact users.

This is a broad role that sits across platform engineering, DevOps, and security. You’ll be responsible for ensuring our systems are resilient under load, observable in real time, and able to scale as usage increases.

You’ll play a key role in how we get code from a developer’s machine into production safely, and how we operate those systems once they’re live.

About Us

Lyrebird Health builds AI-powered tools that reduce the administrative burden on clinicians and improve the quality and accessibility of healthcare.

Our platform is used by thousands of clinicians across multiple markets. As we grow, we’re focused on building systems that are reliable, scalable, and trusted in high-stakes environments.

What you'll do

Keep production systems online and restore them quickly when they fail

Lead and manage incidents, making high-quality decisions under pressure

Design and implement scalable infrastructure and deployment patterns

Build and improve CI/CD pipelines and release systems

Improve monitoring, telemetry, and observability across the stack

Own cloud infrastructure, security, and access controls

Work closely with engineers to ensure systems are built to scale from day one

What you'll bring

5–7 years experience in SRE, platform engineering, or DevOps roles

Strong AWS experience (ECS/Fargate, EC2, Lambda, SQS, IAM)

Experience running and scaling production systems

Strong understanding of distributed systems and scaling approaches

Hands-on experience with Docker and containerised environments

Experience with Kubernetes or ECS

How you work

You take ownership and follow things through

You’re proactive and comfortable operating with ambiguity

You stay calm and make good decisions during incidents

You focus on solving problems end to end

You’re willing to roll up your sleeves and get into the detail

This is a critical hire for us as we scale.

If you want real ownership over how systems are designed, deployed, and operated, and the opportunity to build reliability into a product used in high-stakes environments, we’d love to hear from you.

We’re building a team that reflects the diversity of the people who use our product. If you’re from an underrepresented background in tech, we strongly encourage you to apply, even if you don’t meet every requirement.

Similar Jobs

SS&C Technologies

Senior Site Reliability Engineer

10 Days Ago

Remote

Senior level

Fintech • Software

The Senior Site Reliability Engineer will design and maintain a reliable data platform, develop data pipelines, automate processes, and ensure system observability while collaborating across teams.

Top Skills: AWSAzureBashDockerGCPKubernetesPower BIPythonSQLTerraform

TechInsights

Senior Site Reliability Engineer

17 Days Ago

Remote

United Kingdom

Senior level

Semiconductor • Manufacturing

The Senior Site Reliability Engineer will manage reliability initiatives for AI operations, oversee SLOs and error budgets, support engineering teams, and enhance observability and automation in a semiconductor-focused platform.

Top Skills: Atlassian CompassAWSBackstageBashBitbucket PipelinesDatadogDockerGithub ActionsGitopsJavaKubernetesPythonSpring BootTerraform

Civica

Senior Site Reliability Engineer

23 Days Ago

Remote

United Kingdom

Senior level

Software

As a Senior Site Reliability Engineer, you will oversee cloud platform reliability, drive automation, define reliability metrics, and mentor teams to improve performance and security across Civica's SaaS products.

Top Skills: .NetAksAnsibleAWSAzureDatadogEcsElkGithub ActionsGoGrafanaJaegerJavaKubernetesKubevirtOpensearchOpenshiftPackerPrometheusPythonTerraformVMware

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.