Capital on Tap

Site Reliability Engineer

Reposted 8 Days Ago

Be an Early Applicant

Easy Apply

In-Office

London, Greater London, England

Mid level

Easy Apply

In-Office

London, Greater London, England

Mid level

As a Site Reliability Engineer, you'll ensure the reliability and performance of applications, design monitoring systems, and collaborate with teams to establish SLAs and SLOs while implementing scalable solutions.

The summary above was generated by AI

We’re Capital on Tap 👋
💳 Capital on Tap was founded with the mission to help small business owners and make their lives easier. Today, we provide an all-in-one business credit card & spend management platform that helps business owners save time and money. Capital on Tap proudly serves over 200,000 businesses across the world and our goal is to help 1 million small businesses by 2030.

Why Join Us?
We empower you to be innovative and solve complex problems. Take ownership, make an impact, and thrive in our scaling and agile environment.

Work Flexibly🏡🏢: This is a Hybrid role, the SRE team work from our London (Moorgate) Offices 2 days per week.
Own Your Impact: Lead initiatives that affect thousands of customers.
Collaborate and Grow: Thrive in a supportive, agile culture that values learning, mentorship, and knowledge sharing.

SRE at Capital On Tap 🌞

At Capital On Tap, we run a hybrid embedded SRE model. We aim to work closely with the teams within Capital On Tap to provide them the best support. Our main objective currently is to gain as much visibility into our platform's health while offering scalable solutions.

You’ll join our BAR (Banking and Repayments) Team, who are building and scaling payments and card systems, ensuring our financial platform is reliable, accessible, and seamless for users.

What You’ll be doing:

As a Site Reliability Engineer (SRE) you will help ensure our platforms are fast, reliable, and scalable. You’ll design, build, and monitor systems, prevent issues before they happen. Using SLAs, SLIs, and SLOs, you’ll guide feature launches while maintaining services that everyone can depend on.

Manage and automate Azure, Datadog, NGINX & Cloudflare
Develop and monitor Kubernetes and Serverless resources
Maintain infrastructure code with Terraform & CRDs / Crossplane
Improve systems, processes, and technologies; consult stakeholders to enhance platform performance
Getting involved in new application architecture & design processes
Design solutions to reduce toil, automate repetitive tasks and streamline workflows to reduce manual work and boost team productivity.
Create SLIs and SLOs; increase application visibility
Align with the Product team on SLAs and core service objectives
Collaborate with Platform Engineers for automated solutions and pipelines
Enhance user experience with infrastructure and pipeline optimisation
Support CI/CD tools such as Azure Devops, Octopus Deploy and Flux to streamline software delivery
Lead incident troubleshooting to safeguard customer experience

We’re Looking For 🔎
Required skills:

Experience in managing a public cloud (Azure advantageous)
Experience in Azure DevOps, Octopus, Flux or other CI/CD tools
Experience with Linux and Microsoft Systems
Excellent communication skills and ability to collaborate with multiple teams in an agile environment
Proficient in contributing to IaC technologies involving expertise in writing, managing, and optimising infrastructure with tools such as Terraform and Pulumi
Experience working with a cloud monitoring solution (advantageous to have DataDog)
Experience with Kubernetes and Docker

Nice to have skills:

Experience with Chaos Engineering practices
Experience with IDPs
Experience with software cataloguing
Experience with observability and tracing best practices
Experience in Go (preferred), Powershell (preferred), Python, C# or other scripting languages

Interview Process 🤝
🤝First stage: 30 minute intro and values call with Talent Partner (Video call)
🤝Second stage: 75 minute technical & questions with SRE Team lead (Video call)
🤝Final stage: 45 minute CV overview with Head of department & Engineering team (Video call)

Diversity & Inclusion 🌈
We welcome, consider and encourage applications from anyone who shares our commitment to inclusivity. Join us in creating a space where authenticity thrives, and everyone can do their best work.

Great Work Deserves Great Perks
We try not to take ourselves too seriously (all the time) so we make sure our office is decked out with a pool table, arcade machine, beer tap, and a couple of office dogs thrown in for good measure. Check out our benefits:
🏥 Private Healthcare including dental and opticians services through Vitality
✈️ Worldwide travel insurance through Vitality
🎁 Anniversary Rewards (£250, £500, £750, 4-week fully paid sabbatical)
👛 Salary Sacrifice Pension Scheme up to 7% match
🏖️ 28 days holiday (plus bank holidays)
📖 Annual Learning and Wellbeing Budget
👪 Enhanced Parental Leave
🚲 Cycle to Work Scheme
🚂 Season Ticket Loan
💬 6 free therapy sessions per year
🐶 Dog Friendly Offices
🍫 Free drinks and snacks in our offices

Check out more of our benefits, values and mission here.

Other Info
👍Check out our ‘Top Tips’ for interviewing.
✔️Keep updated on new job opportunities by following us on Linkedin.
📧Email [email protected] if you have any questions.

Excited to work here? Apply!
If you’d like to progress your career within our fast growing, profitable fintech then click apply and we will aim to get back to you within 3 working days (during busy periods this could take up to 5 working days.)

Top Skills

Azure

Azure Devops

Datadog

Flux

Git

Istio

Kubernetes

Linux

Microsoft Systems

Octopus

Powershell

Python

SQL

Terraform

7th Floor, The Tea Bldg, 56 Shoreditch High St, London, United Kingdom, E1 6JJ

Similar Jobs

JPMorganChase

Site Reliability Engineer

8 Days Ago

Hybrid

Bournemouth, Dorset, England, GBR

Senior level

Financial Services

As a Lead Site Reliability Engineer, you'll lead SRE practices and cloud application management, mentor teams, and enhance system reliability.

Top Skills: ApmAWSTerraform

MarketAxess

Site Reliability Engineer

11 Days Ago

Hybrid

London, Greater London, England, GBR

Senior level

Fintech • Information Technology • Financial Services

Lead the Site Reliability Engineering team by implementing SRE best practices, automating solutions, and improving cloud-native architectures. Collaborate with teams to enhance performance, reliability, and incident management, while driving innovation in CI/CD and observability initiatives.

Top Skills: AksAWSAzureBashDockerEksElkGCPGithub ActionsGrafanaJavaJenkinsKubernetesOpenshiftPrometheusPythonTerraform

Cloudflare

Reliability Engineer

14 Days Ago

Hybrid

London, Greater London, England, GBR

Mid level

Cloud • Information Technology • Security • Software • Cybersecurity

We are seeking Systems Reliability Engineers (SRE) to ensure operational excellence of our Edge platform, focusing on automation, monitoring, and service performance. Candidates should possess strong Linux, networking, and programming skills primarily in Go or Python, along with 3 years of SRE experience.

Top Skills: ApacheBgpDnsDockerGoGrafanaGraphiteHaproxyHTTPIp AnycastLinuxNginxOpentsdbPrometheusPythonSaltSQLSquidVarnish

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Capital on Tap

Site Reliability Engineer

Top Skills

Capital on Tap London, England Office

Similar Jobs

Site Reliability Engineer

Site Reliability Engineer

Reliability Engineer

What you need to know about the London Tech Scene