Capital on Tap Logo

Capital on Tap

Site Reliability Engineer

Reposted 8 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England
Mid level
Easy Apply
In-Office
London, Greater London, England
Mid level
As a Site Reliability Engineer, you'll ensure the reliability and performance of applications, design monitoring systems, and collaborate with teams to establish SLAs and SLOs while implementing scalable solutions.
The summary above was generated by AI

We’re Capital on Tap 👋
💳 Capital on Tap was founded with the mission to help small business owners and make their lives easier. Today, we provide an all-in-one business credit card & spend management platform that helps business owners save time and money. Capital on Tap proudly serves over 200,000 businesses across the world and our goal is to help 1 million small businesses by 2030.

Why Join Us?
We empower you to be innovative and solve complex problems. Take ownership, make an impact, and thrive in our scaling and agile environment.

  • Work Flexibly🏡🏢: This is a Hybrid role, the SRE team work from our London (Moorgate) Offices 2 days per week. 
  • Own Your Impact: Lead initiatives that affect thousands of customers.
  • Collaborate and Grow: Thrive in a supportive, agile culture that values learning, mentorship, and knowledge sharing.

SRE at Capital On Tap 🌞

At Capital On Tap, we run a hybrid embedded SRE model. We aim to work closely with the teams within Capital On Tap to provide them the best support. Our main objective currently is to gain as much visibility into our platform's health while offering scalable solutions. 

You’ll join our BAR (Banking and Repayments) Team, who are building and scaling payments and card systems, ensuring our financial platform is reliable, accessible, and seamless for users. 

What You’ll be doing: 

As a Site Reliability Engineer (SRE) you will help ensure our platforms are fast, reliable, and scalable. You’ll design, build, and monitor systems, prevent issues before they happen. Using SLAs, SLIs, and SLOs, you’ll guide feature launches while maintaining services that everyone can depend on.

  • Manage and automate Azure, Datadog, NGINX & Cloudflare
  • Develop and monitor Kubernetes and Serverless resources
  • Maintain infrastructure code with Terraform & CRDs / Crossplane
  • Improve systems, processes, and technologies; consult stakeholders to enhance platform performance
  • Getting involved in new application architecture & design processes
  • Design solutions to reduce toil, automate repetitive tasks and streamline workflows to reduce manual work and boost team productivity.
  • Create SLIs and SLOs; increase application visibility
  • Align with the Product team on SLAs and core service objectives
  • Collaborate with Platform Engineers for automated solutions and pipelines
  • Enhance user experience with infrastructure and pipeline optimisation 
  • Support CI/CD tools such as Azure Devops, Octopus Deploy and Flux to streamline software delivery
  • Lead incident troubleshooting to safeguard customer experience

We’re Looking For 🔎
Required skills:

  • Experience in managing a public cloud (Azure advantageous)
  • Experience in Azure DevOps, Octopus, Flux or other CI/CD tools
  • Experience with Linux and Microsoft Systems
  • Excellent communication skills and ability to collaborate with multiple teams in an agile environment
  • Proficient in contributing to IaC technologies involving expertise in writing, managing, and optimising infrastructure with tools such as Terraform and Pulumi 
  • Experience working with a cloud monitoring solution (advantageous to have DataDog) 
  • Experience with Kubernetes and Docker

Nice to have skills:

  • Experience with Chaos Engineering practices
  • Experience with IDPs 
  • Experience with software cataloguing
  • Experience with observability and tracing best practices
  • Experience in Go (preferred), Powershell (preferred), Python, C# or other scripting languages

Interview Process 🤝
🤝First stage: 30 minute intro and values call with Talent Partner (Video call)
🤝Second stage: 75 minute technical & questions with SRE Team lead (Video call)
🤝Final stage: 45 minute CV overview with Head of department & Engineering team (Video call)

Diversity & Inclusion 🌈
We welcome, consider and encourage applications from anyone who shares our commitment to inclusivity. Join us in creating a space where authenticity thrives, and everyone can do their best work.

Great Work Deserves Great Perks
We try not to take ourselves too seriously (all the time) so we make sure our office is decked out with a pool table, arcade machine, beer tap, and a couple of office dogs thrown in for good measure. Check out our benefits:
🏥 Private Healthcare including dental and opticians services through Vitality
✈️ Worldwide travel insurance through Vitality
🎁 Anniversary Rewards (£250, £500, £750, 4-week fully paid sabbatical)
👛 Salary Sacrifice Pension Scheme up to 7% match
🏖️ 28 days holiday (plus bank holidays)
📖 Annual Learning and Wellbeing Budget
👪 Enhanced Parental Leave
🚲 Cycle to Work Scheme
🚂 Season Ticket Loan
💬 6 free therapy sessions per year
🐶 Dog Friendly Offices
🍫 Free drinks and snacks in our offices

Check out more of our benefits, values and mission here.

Other Info
👍Check out our ‘Top Tips’ for interviewing.
✔️Keep updated on new job opportunities by following us on Linkedin.
📧Email [email protected] if you have any questions.

Excited to work here? Apply!
If you’d like to progress your career within our fast growing, profitable fintech then click apply and we will aim to get back to you within 3 working days (during busy periods this could take up to 5 working days.)

Top Skills

Azure
Azure Devops
C#
Datadog
Flux
Git
Istio
Kubernetes
Linux
Microsoft Systems
Octopus
Powershell
Python
SQL
Terraform

Capital on Tap London, England Office

7th Floor, The Tea Bldg, 56 Shoreditch High St, London, United Kingdom, E1 6JJ

Similar Jobs

8 Days Ago
Hybrid
Bournemouth, Dorset, England, GBR
Senior level
Senior level
Financial Services
As a Lead Site Reliability Engineer, you'll lead SRE practices and cloud application management, mentor teams, and enhance system reliability.
Top Skills: ApmAWSTerraform
11 Days Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Information Technology • Financial Services
Lead the Site Reliability Engineering team by implementing SRE best practices, automating solutions, and improving cloud-native architectures. Collaborate with teams to enhance performance, reliability, and incident management, while driving innovation in CI/CD and observability initiatives.
Top Skills: AksAWSAzureBashDockerEksElkGCPGithub ActionsGrafanaJavaJenkinsKubernetesOpenshiftPrometheusPythonTerraform
14 Days Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
We are seeking Systems Reliability Engineers (SRE) to ensure operational excellence of our Edge platform, focusing on automation, monitoring, and service performance. Candidates should possess strong Linux, networking, and programming skills primarily in Go or Python, along with 3 years of SRE experience.
Top Skills: ApacheBgpDnsDockerGoGrafanaGraphiteHaproxyHTTPIp AnycastLinuxNginxOpentsdbPrometheusPythonSaltSQLSquidVarnish

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account