Qube Research & Technologies Logo

Qube Research & Technologies

Senior Site Reliability Engineer (SRE)

Posted 11 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England
Senior level
Easy Apply
In-Office
London, Greater London, England
Senior level
Join Qube Research & Technologies as a Senior Site Reliability Engineer to enhance observability, improve incident response, and ensure reliability on an engineering platform, while collaborating with software teams.
The summary above was generated by AI

Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology- and data-driven group implementing a scientific approach to investing. Combining data, research, technology, and trading expertise has shaped our collaborative mindset, which enables us to solve the most complex challenges. QRT’s culture of innovation continuously drives our ambition to deliver high-quality returns for our investors.

You will join the Platform team focused on improving reliability and day-to-day operability for an actively used and growing engineering platform. The team works closely with software engineers and platform owners to improve observability, incident response, and reliability outcomes, while keeping long-term service ownership with the teams that build and run the services.

Your Future Role within QRT

You will:

  • Own the effectiveness of the observability platform, ensuring high-quality signals, alert fidelity, and ongoing suitability as the platform scales
  • Build and maintain actionable, low-noise dashboards and alerting across metrics and logs
  • Improve incident detection, response, and follow-up, ensuring corrective actions are implemented in systems, configuration, or automation
  • Define and apply SLIs and SLOs where they support operational decision-making
  • Improve reliability, scalability, and operability of core services through hands-on engineering changes
  • Identify recurring failure patterns and reduce manual operational work through automation and improved defaults
  • Apply Infrastructure as Code across observability and supporting systems
  • Develop tooling and automation in Go (preferred) or Python
  • Introduce shared patterns, defaults, and documentation that reduce repeated bespoke work
  • Partner with service-owning teams to deliver measurable reliability improvements without transferring long-term service ownership to SRE

Your Present Skillset

  • Strong practical experience applying Site Reliability Engineering principles in production environments
  • Strong Linux systems knowledge
  • Experience building and operating containerised workloads (Docker or Podman)
  • Strong development experience in Go (preferred) or Python
  • Strong experience querying and reasoning about metrics using PromQL
  • Hands-on experience with Grafana, including dashboarding and alerting
  • Experience deploying and operating centralised logging systems
  • Strong Infrastructure as Code experience
  • OpenTelemetry experience (metrics, logs, traces)
  • Terraform and/or Ansible experience, plus familiarity with CI/CD pipelines
  • Kubernetes and cloud-native platform experience
  • Exposure to datacentre services and compute/hardware-backed platforms
  • AWS infrastructure configuration and deployment experience
  • Evidence of reducing operational load and recurring incidents in growing systems

QRT is an equal opportunity employer. We welcome diversity as essential to our success. QRT empowers employees to work openly and respectfully to achieve collective success. In addition to professional achievement, we are offering initiatives and programs to enable employees achieve a healthy work-life balance.

Top Skills

Ansible
Ci/Cd
Docker
Go
Grafana
Podman
Promql
Python
Terraform

Qube Research & Technologies London, England Office

160 Victoria Street, London, United Kingdom, SW1E 5LB

Similar Jobs

3 Days Ago
In-Office
Wellington Place, West Midlands, England, GBR
Senior level
Senior level
Fintech • Software • Financial Services
The Senior Site Reliability Engineer will enhance infrastructure reliability, manage CI/CD pipelines, and handle service issues through engineering solutions.
Top Skills: AzureAzure DevopsBashDynatraceGCPGroovyJenkinsKubernetesPowershellPythonTerraform
3 Days Ago
In-Office or Remote
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • HR Tech • Software
The Senior Platform Engineer will build and evolve managed services, improve incident response, design secure solutions, and provide technical leadership.
Top Skills: ElasticsearchGoIstioKafkaKubernetesMongoDBNode.jsPostgresTerraform
6 Days Ago
Easy Apply
In-Office
London, Greater London, England, GBR
Easy Apply
Senior level
Senior level
eCommerce
As a Senior Software Engineer, you'll design, build, and maintain Kubernetes and observability systems, support engineering teams, and foster a culture of continuous learning.
Top Skills: AWSEksElkGrafanaHoneycombJavaJavaScriptKubernetesLokiMimirOtelPrometheusPythonTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account