The Site Reliability Engineer will troubleshoot network issues, manage servers, automate infrastructure, maintain cloud services, and implement monitoring systems.
At this stage of growing, we are looking for a Site Reliability Engineer.
Responsibilities
- Troubleshooting network availability issues and accounting for failures;
- Managing remote brokers and a fleet of application servers;
- Participating in infrastructure growth and automation using Python and/or Go;
- Maintaining cloud infrastructure and virtual machines;
- Creating a monitoring system for production trading systems using Zabbix, Influx, and Grafana;
- You will have the opportunity to work with various exotic networks such as radio relay and shortwave, FPGA cards, atomic clocks, and do server tuning and overclock servers.
Requirements
- Working with highload systems for at least 3 years;
- Deploy, configure, and administer Linux based servers;
- Understand the operation of network services;
- Know the basic Linux services, PXE, DHCP, DNS;
- Be able to work with configuration management systems (Ansible);
- Know any monitoring system: Zabbix, Prometheus, Grafana, Kibana;
- Excellent knowledge of English language (starting from B2 - Upper-Intermediate level);
- Basic Python/Bash/Go skills;
- Availability for work trips.
Would be great if you had this
- Knowing the major lines of server hardware from leading vendors;
- Working experience with remote Git repositories;
- Experience in providing technical support for cloud services.
What we offer
- High base salary and social benefits;
- Generous bonus structure. We are very flexible in discussing salary and conditions of employment;
- Cutting-edge hardware and software in production as well as high technical expertise of the company which allows implementation of bold ideas and boosting great results. Ownership over initiatives that directly solve business problems;
- Ability to trade on dozens of international exchanges;
- Flexible workflow (lack of formalism and bureaucracy, no pressure and over-management) and working schedule;
- Tuition reimbursement, conference and training sponsorship.
Top Skills
Ansible
Dhcp
Dns
Go
Grafana
Influx
Pxe
Python
Zabbix
Similar Jobs
Fintech • Software • Financial Services
Manage an in-house infrastructure team and internal compute cluster for trading operations, focusing on operational management, incident response, and developing observability metrics.
Top Skills:
DockerGoGrafanaK8SLinuxPythonUnix
Software
We are seeking a skilled DevOps Engineer to enhance infrastructure, manage CI/CD pipelines, and ensure security and compliance in our trading platform.
Top Skills:
AnsibleAWSAzureBashCi/CdDockerElk StackGitlabGoGCPGrafanaInfrastructure As CodeJenkinsKubernetesPagerdutyPrometheusPythonTerraform
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our inference platform, leveraging Kubernetes and Terraform while ensuring smooth scalability of systems under load.
Top Skills:
BashGrafanaKubernetesMlopsPrometheusPythonRayTerraformTritonVllm
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

