Valstro Logo

Valstro

Site Reliability Engineer (SRE)

Posted 21 Days Ago
Be an Early Applicant
United Kingdom
Mid level
United Kingdom
Mid level
As a Site Reliability Engineer, you will ensure the reliability of the cloud-native trading platform by automating processes, developing monitoring solutions, responding to incidents, and collaborating with development teams. You will also participate in on-call rotations to provide 24/7 support and continuously improve systems and processes for better performance.
The summary above was generated by AI

Description
Who are we?

Valstro is a recent (mid-2021) FinTech partnership working to deliver next-gen, Cloud-First, trading solutions to global, multi-asset-class institutional clients. You may call us a startup or a “baby enterprise.” Regardless of the term you prefer, we are a “people-first” company: all the value that we bring to clients will come from the efforts of a collaborative, motivated and well-supported team.

The applications that we are building are highly modular, well-tested, well-documented, and internally discoverable - all of which are, we believe, the not-so-secret sauce that will enable us to scale the product and the business.

Our overarching commercial goal is to shake up an industry that is overdue for tech-driven disruption as we expand our capabilities and reach, pushing established industry practices forward in every respect. We are tackling these challenges because we believe that our clients deserve better, and if that vision appeals to you, then read on.

Requirements
What are we looking for?

We are seeking a candidate to fulfill the role of Site Reliability Engineer (SRE), ensuring the reliability, availability, and performance of our cloud native trading platform. The role entails building and maintaining infrastructure, automating process and working closely with the Development and Platform teams to ensure seamless integration and deployment of the service. 

The successful candidate will serve as an essential link between the wider organization, executive leadership, and external vendors. Their responsibilities will include ensuring system reliability, building and maintaining monitoring solutions for both production and UAT systems, automating operational tasks, responding to incidents, and continuously improving systems and processes.

What will you be doing?

·         Act as a key intermediary between engineering, executive leadership, and external vendors.

·         Ensure the reliability, availability, and performance of our cloud-based trading solutions.

·         Develop and maintain monitoring solutions to track system performance and reliability.

·         Automate operational tasks to improve efficiency and reduce manual intervention.

·         Collaborate with development teams to ensure seamless integration and deployment.

·         Respond to incidents and troubleshoot issues to minimize downtime.

·         Continuously improve systems and processes to enhance reliability and performance.

·         Participate in on-call rotations to provide 24/7 support for critical systems.


What you need to bring

A good portion of the following:

·         Strong experience in site reliability engineering, systems engineering, or a related field.

·         Proficiency in cloud-based infrastructure (e.g. AWS, Azure, or Google Cloud.)

·         Experience with monitoring and logging tools (e.g., Prometheus, Grafana observability stack).

·         Expertise in automation and scripting (e.g., Golang, Python, Bash, Terraform).

·         Knowledge of containerization and orchestration (e.g., Docker, Kubernetes).

·         Ability to effectively communicate and liaise between stakeholders, including internal teams, executive management, and external vendors.

·         Strong troubleshooting and problem-solving skills.

·         Experience in establishing and enhancing reliability engineering practices and processes.

·         Capable of operating effectively in a dynamic organizational environment with high delivery and quality expectations.

Benefits

Despite being a young company, Valstro offers an excellent benefits package with top-tier health insurance, 401k plans and highly competitive overall compensation.

Regardless of where you are sitting, Valstro is a wonderful place to work. Leadership brings a genuine wealth of experience and industry knowledge, and for a young company, we humbly believe that we have our product/market fit very carefully dialed in. As we move to execute and deliver the vision to clients, the Engineering team will need client-obsessed, delivery-focused high performers (with a healthy dose of humility, of course) that we can help grow into the FinTech leaders of the future. If this excites you, we would love to chat.

Top Skills

Go
Python

Similar Jobs

10 Hours Ago
Hybrid
Glasgow, City of Glasgow, Scotland, GBR
Senior level
Senior level
Financial Services
The Site Reliability Engineer III will enhance production stability through monitoring, automation, and reliability measures on complex systems. Responsibilities include designing resilient systems, engaging with development teams for reliability, responding to incidents, and supporting SRE best practices.
Top Skills: GoJavaPython
2 Days Ago
Easy Apply
Hybrid
London, Greater London, England, GBR
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software
The Senior Site Reliability Engineer will maintain uptime, implement resilient applications, deploy production apps, monitor performance, ensure security, automate disaster recovery, and drive operational improvements. Responsibilities also include collaborating with engineers on architectural changes and participating in recruitment efforts.
Top Skills: Amazon Web ServicesCi/CdDevOpsDockerKubernetesLinuxPrometheusPythonRestfulSreTerraform
2 Days Ago
London, Greater London, England, GBR
Senior level
Senior level
Financial Services
As a Site Reliability Engineer III, you will guide and assist in building scalable solutions, implement infrastructure as code, and promote best practices in site reliability. You will collaborate with software engineers on deployment strategies and ensure the reliability and availability of applications and platforms while utilizing technical observability tools.
Top Skills: JavaPython

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account