Thredd Logo

Thredd

Site Reliability Engineer

Posted 13 Days Ago
Be an Early Applicant
Hybrid
London, Greater London, England, GBR
Mid level
Hybrid
London, Greater London, England, GBR
Mid level
As a Site Reliability Engineer, you will design scalable network solutions, mentor engineers, drive automation, and ensure system reliability while collaborating with stakeholders.
The summary above was generated by AI

Are you passionate about building reliable, scalable, and high-performing systems? Do you thrive on solving complex infrastructure challenges while driving automation and observability best practices? If so, we want to hear from you!

At Thredd, we’re looking for a Site Reliability Engineer to act as a North Star for this evolving discipline. As our first engineer in this role, you’ll have the unique opportunity to shape our SRE strategy, establish best practices, and set the standard for service reliability and performance.

The Impact You’ll Have as a Site Reliability Engineer

  • Design and oversee the implementation of complex, secure, and scalable network solutions that support global transaction processing.

  • Lead network innovation by identifying opportunities to adopt emerging technologies and drive efficiency.

  • Coordinate and prioritise network-related initiatives across teams, balancing operational needs with strategic growth.

  • Mentor and support engineers within the team, fostering technical excellence and a customer-focused mindset.

  • Drive performance and reporting, delivering insights and data that help optimise system health and uptime.

  • Collaborate with stakeholders, vendors, and service providers to ensure seamless integration and service quality.

  • Develop and enforce quality assurance protocols and documentation standards across our network landscape.

  • Own strategic network planning, ensuring infrastructure evolves in step with our product and market expansion.

What You’ll Bring to the Site Reliability Engineer Position

  • Proven experience building and maintaining infrastructure, tooling, and technical foundations at scale.

  • Strong track record of ensuring high service uptime and reliability to empower product teams to innovate effectively.

  • Expertise in shaping and evolving core technology layers that underpin a successful, high-growth platform.

  • Proven experience implementing SRE principles at scale, including deep knowledge of SLI/SLO/SLA differences.

  • A product engineering background with strong coding skills in Python or similar.

  • Experience with incident management frameworks and evolving them for efficiency.

  • Expertise in cloud platforms (AWS preferred) and container orchestration (Docker, Kubernetes, ECS).

  • Solid understanding of microservices, service mesh, and modern architectural concepts.

  • A collaborative mindset – you thrive on helping others and driving company-wide impact.

Nice to Have

  • Experience working in regulated industries (e.g., PCI compliance).

  • Background in capacity planning, performance, and load testing.

  • Sysadmin skills for troubleshooting disk, network, and infrastructure issues.

Where you’ll work

Our working model varies depending on the specific role and team requirements. We strive to provide flexibility whilst ensuring that each position is best supported for optimal collaboration and performance.

This Site Reliability Engineer position requires you to be in the London office (Holborn) one day per week.

About us

Thredd is the trusted next-gen payments partner for innovators looking to modernise their payments offering. Certified by Mastercard, Visa and Diners & Discover, we process billions of debit, prepaid, and credit transactions annually, supporting consumer and corporate fintechs, digital banks, and embedded finance providers across the globe. Our unique offering is our client-centric approach, combining hands-on support with modern, reliable, and scalable technology.

Our assured solution accelerates the development and delivery of consumer and corporate payments components embedded within digital banks, as well as for expense management, B2B payments, crypto, lending, credit, Buy Now Pay Later, FX, remittance, and open banking innovators.

Since 2007, Thredd has enabled market leaders through our highly reliable, secure, and scalable platform and supported many of our client's growth journeys - from early-stage startup through to globally recognized unicorns, including Monzo, Revolut, and Starling.

Diversity and Inclusion at Thredd

Here at Thredd, we are committed to building a diverse and inclusive workplace where everyone feels valued, respected and empowered. We welcome applications from people of all backgrounds, experiences and identities. If you require any adjustments during the recruitment process, please let us know and we would be happy to support you.

Our Values

Our values-driven culture is what unites our teams globally and our teams is what drives our success;

Here are what the values mean for you in this position;

  • Own it and deliver – Taking responsibility for your own performance and being successful in your own role
  • Collaborate purposefully – Building trusted relationships with colleagues, supporting activities and being successful together
  • Think differently – Asking questions to check understanding and sharing your ideas to support continuous improvement
  • Act courageously – Stepping out of your comfort zone and embracing change to help you learn and grow

Top Skills

AWS
Docker
Ecs
Kubernetes
Python
HQ

Thredd London, England Office

Kingsbourne House 229-231 High Holborn London, London, United Kingdom, WC1V 7DA

Similar Jobs

9 Hours Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software
As a Site Reliability Engineer, you'll build resilient systems, automate processes, handle on-call duties, and collaborate on cloud platform scaling and security enhancements.
Top Skills: AksAzureChefCi/CdCloud-Native PlatformsDockerGrafanaKubernetesLinuxPrometheusTerraform
4 Days Ago
Easy Apply
In-Office
London, Greater London, England, GBR
Easy Apply
Senior level
Senior level
Financial Services
The Site Reliability Engineer will enhance the reliability and automation of ETF trading systems, managing deployments, incident response, and improving developer productivity through tooling and AI.
Top Skills: Ai ToolingBashCi/Cd PipelinesCloudInfrastructure-As-CodeLinuxMonitoring StacksPython
4 Days Ago
In-Office or Remote
London, Greater London, England, GBR
Mid level
Mid level
Fintech • Payments • Financial Services
As a Site Reliability Engineer, you'll enhance non-functional characteristics of corporate applications, ensure reliability and performance, and collaborate with stakeholders on improvements and incident responses.
Top Skills: AutomationProgramming LanguagesRelease Management Tools

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account