Site Reliability Engineer

Sorry, this job was removed at 12:01 p.m. (GMT) on Wednesday, Oct 29, 2025

Be an Early Applicant

Hybrid

London, England

Hybrid

London, England

Similar Jobs

JPMorganChase

Site Reliability Engineer

18 Days Ago

Hybrid

Bournemouth, Dorset, England, GBR

Senior level

Financial Services

As a Lead Site Reliability Engineer, you'll lead SRE practices and cloud application management, mentor teams, and enhance system reliability.

Top Skills: ApmAWSTerraform

MarketAxess

Site Reliability Engineer

21 Days Ago

Hybrid

London, Greater London, England, GBR

Senior level

Fintech • Information Technology • Financial Services

Lead the Site Reliability Engineering team by implementing SRE best practices, automating solutions, and improving cloud-native architectures. Collaborate with teams to enhance performance, reliability, and incident management, while driving innovation in CI/CD and observability initiatives.

Top Skills: AksAWSAzureBashDockerEksElkGCPGithub ActionsGrafanaJavaJenkinsKubernetesOpenshiftPrometheusPythonTerraform

Cloudflare

Reliability Engineer

9 Days Ago

Hybrid

London, Greater London, England, GBR

Mid level

Cloud • Information Technology • Security • Software • Cybersecurity

We are seeking Systems Reliability Engineers (SRE) to ensure operational excellence of our Edge platform, focusing on automation, monitoring, and service performance. Candidates should possess strong Linux, networking, and programming skills primarily in Go or Python, along with 3 years of SRE experience.

Top Skills: ApacheBgpDnsDockerGoGrafanaGraphiteHaproxyHTTPIp AnycastLinuxNginxOpentsdbPrometheusPythonSaltSQLSquidVarnish

Are you passionate about building reliable, scalable, and high-performing systems? Do you thrive on solving complex infrastructure challenges while driving automation and observability best practices? If so, we want to hear from you!

At Thredd, we’re looking for a Site Reliability Engineer to act as a North Star for this evolving discipline. As our first engineer in this role, you’ll have the unique opportunity to shape our SRE strategy, establish best practices, and set the standard for service reliability and performance.

The Impact You’ll Have as a Site Reliability Engineer

Design and oversee the implementation of complex, secure, and scalable network solutions that support global transaction processing.
Lead network innovation by identifying opportunities to adopt emerging technologies and drive efficiency.
Coordinate and prioritise network-related initiatives across teams, balancing operational needs with strategic growth.
Mentor and support engineers within the team, fostering technical excellence and a customer-focused mindset.
Drive performance and reporting, delivering insights and data that help optimise system health and uptime.
Collaborate with stakeholders, vendors, and service providers to ensure seamless integration and service quality.
Develop and enforce quality assurance protocols and documentation standards across our network landscape.
Own strategic network planning, ensuring infrastructure evolves in step with our product and market expansion.

What You’ll Bring to the Site Reliability Engineer Position

Proven experience building and maintaining infrastructure, tooling, and technical foundations at scale.
Strong track record of ensuring high service uptime and reliability to empower product teams to innovate effectively.
Expertise in shaping and evolving core technology layers that underpin a successful, high-growth platform.
Proven experience implementing SRE principles at scale, including deep knowledge of SLI/SLO/SLA differences.
A product engineering background with strong coding skills in Python or similar.
Experience with incident management frameworks and evolving them for efficiency.
Expertise in cloud platforms (AWS preferred) and container orchestration (Docker, Kubernetes, ECS).
Solid understanding of microservices, service mesh, and modern architectural concepts.
A collaborative mindset – you thrive on helping others and driving company-wide impact.

Nice to Have

Experience working in regulated industries (e.g., PCI compliance).
Background in capacity planning, performance, and load testing.
Sysadmin skills for troubleshooting disk, network, and infrastructure issues.

Where you’ll work

Our working model varies depending on the specific role and team requirements. We strive to provide flexibility whilst ensuring that each position is best supported for optimal collaboration and performance.

This Site Reliability Engineer position requires you to be in the London office (Holborn) one day per week.

About us

Thredd is the trusted next-gen payments partner for innovators looking to modernise their payments offering. Certified by Mastercard, Visa and Diners & Discover, we process billions of debit, prepaid, and credit transactions annually, supporting consumer and corporate fintechs, digital banks, and embedded finance providers across the globe. Our unique offering is our client-centric approach, combining hands-on support with modern, reliable, and scalable technology.

Our assured solution accelerates the development and delivery of consumer and corporate payments components embedded within digital banks, as well as for expense management, B2B payments, crypto, lending, credit, Buy Now Pay Later, FX, remittance, and open banking innovators.

Since 2007, Thredd has enabled market leaders through our highly reliable, secure, and scalable platform and supported many of our client's growth journeys - from early-stage startup through to globally recognized unicorns, including Monzo, Revolut, and Starling.

Diversity and Inclusion at Thredd

Here at Thredd, we are committed to building a diverse and inclusive workplace where everyone feels valued, respected and empowered. We welcome applications from people of all backgrounds, experiences and identities. If you require any adjustments during the recruitment process, please let us know and we would be happy to support you.

Our Values

Our values-driven culture is what unites our teams globally and our teams is what drives our success;

Here are what the values mean for you in this position;

Own it and deliver – Taking responsibility for your own performance and being successful in your own role
Collaborate purposefully – Building trusted relationships with colleagues, supporting activities and being successful together
Think differently – Asking questions to check understanding and sharing your ideas to support continuous improvement
Act courageously – Stepping out of your comfort zone and embracing change to help you learn and grow

Kingsbourne House 229-231 High Holborn London, London, United Kingdom, WC1V 7DA

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Thredd

Site Reliability Engineer

Similar Jobs

Site Reliability Engineer

Site Reliability Engineer

Reliability Engineer

Thredd London, England Office

What you need to know about the London Tech Scene