Cryptio Logo

Cryptio

Site Reliability Engineer

Reposted 4 Days Ago
Be an Early Applicant
UK
Senior level
UK
Senior level
As a Senior Site Reliability Engineer, you'll ensure Cryptio's platform reliability, observability, and incident response through system design and collaboration across teams.
The summary above was generated by AI

About Cryptio

We’re Cryptio. We build infrastructure to bring financial integrity to the crypto economy. Our enterprise-grade back-office and data platform power mission-critical accounting, reporting, and operational workflows for institutions, corporates, and crypto-native organisations.

We’re trusted by leaders like Circle, Societe Generale, Uniswap, Gemini, and the Government of El Salvador. We’ve raised $26m from top investors including Point Nine, 1kx, Tim Draper, and Ledger Cathay.

The Opportunity

We’re hiring a Site Reliability Engineer (SRE) to strengthen the reliability, observability, and performance of Cryptio’s platform. You'll work across our stack, with a heavy focus on AWS and Kubernetes, helping to make our systems faster, more stable, and easier to operate.

This is a role for a hands-on builder who can trace complex issues, design reliability into everything we ship, and take ownership of both infrastructure and application reliability. You’ll collaborate closely with senior engineers and the CTO to implement our reliability roadmap, improve monitoring and incident management, and automate our infrastructure.

It’s a great opportunity for an experienced SRE/Platform Engineer ready to lead in Kubernetes and AWS environments, take ownership of complex systems, and grow into senior reliability responsibilities.

Key Technologies

  • AWS

  • Kubernetes

  • PostgreSQL, Cassandra, ClickHouse

  • Pulumi, GitLab CI, Docker

  • Grafana, Prometheus, Loki, Jaeger

  • Rust, TypeScript (Node.js. Nest.js, React, OpenAPI)

What you'll do

  • Drive reliability improvements across AWS and Kubernetes, as well as application layers

  • Enhance observability by refining logs, metrics, and traces

  • Support incident response and contribute to postmortem analysis and automation

  • Maintain and evolve AWS and Kubernetes environments with a focus on scalability and cost efficiency

  • Collaborate with engineers to make deployments and services more reliable

  • Automate infrastructure through Infrastructure-as-Code and CI/CD improvements

  • Contribute ideas and best practices to strengthen our reliability culture

We’re looking for someone who

  • Has 3+ years of experience in Software Engineering, SRE, DevOps, or Infrastructure roles

  • Has strong, practical experience with AWS and deep understanding of core AWS primitives (IAM, EC2, EKS, RDS, VPC, Subnets, S3, EBS, ELB)

  • Has significant experience with Kubernetes (ideally beyond basic usage: e.g. developed/used operators like zalando-postgres, Kyverno, Keda, installed on bare-metal, used Traefik, or developed custom operators)

  • Has hands-on experience with databases such as PostgreSQL, Cassandra, or ClickHouse

  • Knows at least one programming language well enough to debug and improve systems

  • Has used Infrastructure-as-Code tools like Pulumi or Terraform

  • Enjoys debugging complex systems and improving performance or reliability

  • Communicates clearly and collaborates effectively with software teams

  • Is curious, systematic, and eager to grow into a senior reliability role

  • (Bonus) Experience or interest in Rust, TypeScript, or crypto/finance/data systems

Why you’ll love this role

  • Work on a high-impact platform powering top crypto and finance institutions

  • Gain broad exposure across infrastructure, application, and data layers

  • Execute on a clear reliability strategy with guidance from the CTO

  • Opportunity to develop into a senior SRE over time

  • Fully remote, with opportunities to visit our hubs in Paris or London

Interview Process

  • Talent Screen (15–30 min): Quick call to discuss your background, Cryptio, and the role

  • Technical Interview (60 min): Practical discussion around AWS/Kubernetes/cloud infrastructure, observability, and incident handling

  • Team Interview (45 min): Meet an engineer and product manager to explore collaboration

  • CTO Interview (45 min): Discussion about roadmap, mentorship, and your growth path at Cryptio

Perks

👩‍💻 Remote or Hybrid working

🏝️ 25 days paid holiday plus bank holidays

🙌 One additional day of annual leave each year, up to 30 total days

🎂 Your birth off

🧘 Mental health resources, wellbeing programs, and professional coaching

🫶 Family-friendly policies

💪 Fitness and wellness budget

💻 MacBook Pro

🖥️ $200 home office setup budget

🎓Training and development budget

*** we have additional benefits depending on location

If this sounds like you, we would love to hear from you 🙌

At Cryptio, we move fast and take ownership of outcomes. We learn from failures, celebrate wins, and let humility, curiosity, and a passion for crypto guide how we work. If you value collaboration and want to build with purpose, you’ll feel right at home here.

Top Skills

AWS
Cassandra
Clickhouse
Docker
Gitlab Ci
Grafana
Jaeger
Kubernetes
Loki
Postgres
Prometheus
Pulumi
Rust
Typescript
HQ

Cryptio London, England Office

5 Alan Road, London, United Kingdom, SW19 7PT

Similar Jobs

8 Days Ago
Hybrid
Bournemouth, Dorset, England, GBR
Senior level
Senior level
Financial Services
As a Lead Site Reliability Engineer, you'll lead SRE practices and cloud application management, mentor teams, and enhance system reliability.
Top Skills: ApmAWSTerraform
11 Days Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Information Technology • Financial Services
Lead the Site Reliability Engineering team by implementing SRE best practices, automating solutions, and improving cloud-native architectures. Collaborate with teams to enhance performance, reliability, and incident management, while driving innovation in CI/CD and observability initiatives.
Top Skills: AksAWSAzureBashDockerEksElkGCPGithub ActionsGrafanaJavaJenkinsKubernetesOpenshiftPrometheusPythonTerraform
14 Days Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
We are seeking Systems Reliability Engineers (SRE) to ensure operational excellence of our Edge platform, focusing on automation, monitoring, and service performance. Candidates should possess strong Linux, networking, and programming skills primarily in Go or Python, along with 3 years of SRE experience.
Top Skills: ApacheBgpDnsDockerGoGrafanaGraphiteHaproxyHTTPIp AnycastLinuxNginxOpentsdbPrometheusPythonSaltSQLSquidVarnish

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account