Qube Research & Technologies Logo

Qube Research & Technologies

Senior Site Reliability Engineer (SRE)

Reposted 2 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England, GBR
Senior level
Easy Apply
In-Office
London, Greater London, England, GBR
Senior level
Join Qube Research & Technologies as a Senior Site Reliability Engineer to enhance observability, improve incident response, and ensure reliability on an engineering platform, while collaborating with software teams.
The summary above was generated by AI

Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology- and data-driven group implementing a scientific approach to investing. Combining data, research, technology, and trading expertise has shaped our collaborative mindset, which enables us to solve the most complex challenges. QRT’s culture of innovation continuously drives our ambition to deliver high-quality returns for our investors.

You will join the Platform team focused on improving reliability and day-to-day operability for an actively used and growing engineering platform. The team works closely with software engineers and platform owners to improve observability, incident response, and reliability outcomes, while keeping long-term service ownership with the teams that build and run the services.

Your Future Role within QRT

You will:

  • Own the effectiveness of the observability platform, ensuring high-quality signals, alert fidelity, and ongoing suitability as the platform scales
  • Build and maintain actionable, low-noise dashboards and alerting across metrics and logs
  • Improve incident detection, response, and follow-up, ensuring corrective actions are implemented in systems, configuration, or automation
  • Define and apply SLIs and SLOs where they support operational decision-making
  • Improve reliability, scalability, and operability of core services through hands-on engineering changes
  • Identify recurring failure patterns and reduce manual operational work through automation and improved defaults
  • Apply Infrastructure as Code across observability and supporting systems
  • Develop tooling and automation in Go (preferred) or Python
  • Introduce shared patterns, defaults, and documentation that reduce repeated bespoke work
  • Partner with service-owning teams to deliver measurable reliability improvements without transferring long-term service ownership to SRE

Your Present Skillset

  • Strong practical experience applying Site Reliability Engineering principles in production environments
  • Strong Linux systems knowledge
  • Experience building and operating containerised workloads (Docker or Podman)
  • Strong development experience in Go (preferred) or Python
  • Strong experience querying and reasoning about metrics using PromQL
  • Hands-on experience with Grafana, including dashboarding and alerting
  • Experience deploying and operating centralised logging systems
  • Strong Infrastructure as Code experience
  • OpenTelemetry experience (metrics, logs, traces)
  • Terraform and/or Ansible experience, plus familiarity with CI/CD pipelines
  • Kubernetes and cloud-native platform experience
  • Exposure to datacentre services and compute/hardware-backed platforms
  • AWS infrastructure configuration and deployment experience
  • Evidence of reducing operational load and recurring incidents in growing systems

QRT is an equal opportunity employer. We welcome diversity as essential to our success. QRT empowers employees to work openly and respectfully to achieve collective success. In addition to professional achievement, we are offering initiatives and programs to enable employees achieve a healthy work-life balance.

Top Skills

Ansible
Ci/Cd
Docker
Go
Grafana
Podman
Promql
Python
Terraform

Qube Research & Technologies London, England Office

160 Victoria Street, London, United Kingdom, SW1E 5LB

Similar Jobs

8 Days Ago
In-Office or Remote
London, Greater London, England, GBR
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, operate, and scale production blockchain node infrastructure across multiple clouds. Build and maintain Kubernetes clusters, IaC with Terraform, CI/CD automation, and integrate AI-assisted tooling. Provide 24/7 on-call incident response, partner with security, mentor engineers, and improve reliability for a fast-growing blockchain platform.
Top Skills: Agentic WorkflowsAWSBase)Blockchain Nodes (ArcBlue-Green DeploymentCanary DeploymentCi/CdContainer Image BuildsCursorEthereumGCPGoHelmKubernetesKubernetes ControllersKubernetes OperatorsObservabilityPythonRbacShellSmart ContractsSolanaSQLTerraform
2 Days Ago
In-Office
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Software
As a Senior Site Reliability Engineer, you will maintain production resilience, implement observability technologies, lead incident response, and develop automation for operational improvement.
Top Skills: AIAirflowAnsibleCassandraGrafanaKubernetesLinuxLokiMongoDBOraclePrometheusPythonSQL ServerTerraformWindows
20 Days Ago
Easy Apply
Hybrid
London, England, GBR
Easy Apply
Senior level
Senior level
Fintech • Payments • Productivity • Financial Services
In this role, you will enhance the reliability of Credit Karma's infrastructure by automating, managing cloud databases, and collaborating with developers to ensure system performance and reliability.
Top Skills: Cloud Sql MysqlGoGoogle Cloud PlatformGrafanaKubernetesNew RelicPrometheusPythonSplunkTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account