Qube Research & Technologies Logo

Qube Research & Technologies

HPC Platform Management Engineer

Reposted 24 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England, GBR
Mid level
Easy Apply
In-Office
London, Greater London, England, GBR
Mid level
Develop and maintain HPC platforms, optimize workload scheduling, improve performance, and support team development in a collaborative environment.
The summary above was generated by AI

Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology and data driven group implementing a scientific approach to investing. Combining data, research, technology, and trading expertise has shaped QRT’s collaborative mindset which enables us to solve the most complex challenges. QRT’s culture of innovation continuously drives our ambition to deliver high quality returns for our investors.

Join QRT as a technologist within our Workload Scheduling (WLS) team. This key role supports both business and technology groups in integrating High Performance Computing (HPC) solutions, enabling scalable and efficient compute capabilities. You will be instrumental in developing, deploying, and maintaining HPC platforms that leverage Yellow Dog and Ray schedulers across cloud and on-premises infrastructures.

Your Future Role within QRT:

  • Develop and support scalable workload scheduling solutions for HPC environments
  • Collaborate with internal teams to adopt and optimize HPC platforms
  • Improve the performance, resilience, and observability of compute infrastructure
  • Contribute to infrastructure automation and continuous improvement initiatives
  • Share expertise and support team development through coaching and collaboration

Your Present Skillset:

  • Experience of engineering and supporting at least one HPC scheduler, such as YellowDog, Ray, Slurm or IBM Symphony 
  • Good understanding of both loosely coupled and tightly coupled HPC workloads 
  • Experience of developing and supporting large-scale systems (5000+ nodes) and high levels of concurrency (100k+ tasks) 
  • Experience of monitoring and visualisation of large-scale systems 
  • Performance tuning of compute, network and storage components 
  • Good understanding of the challenges of user authorisation in large scale distributed environments using AWS IAM and identity providers such as Okta 
  • Good understanding of core AWS services 
  • VPC security and networking 
  • EC2 configuration and scaling
  • Storage services S3, EFS, EBS and FSx 
  • CloudWatch / CloudTrail / OpenSearch / Athena 
  • Experience of developing Python applications and tools
  • Experience with infrastructure-as-code using configuration languages and tools, particularly Terraform and Ansible 
  • Solid understanding of Linux administration skills
  • Good understanding of various storage solutions and their applicability for different use cases 
  • Able to work in a fast-paced environment with multiple conflicting demands and changing priorities 
  • Effective communicator, able to describe complex issues at the appropriate level for a given audience 
  • Happy to coach colleagues and eager to learn from them 

QRT is an equal opportunity employer. We welcome diversity as essential to our success. QRT empowers employees to work openly and respectfully to achieve collective success. In addition to professional achievement, we are offering initiatives and programs to enable employees achieve a healthy work-life balance.

Top Skills

Ansible
AWS
Hpc
Ibm Symphony
Linux
Python
Ray
Slurm
Terraform
Yellow Dog

Qube Research & Technologies London, England Office

160 Victoria Street, London, United Kingdom, SW1E 5LB

Similar Jobs

2 Hours Ago
Hybrid
London, England, GBR
Senior level
Senior level
Fintech • Mobile • Payments • Software • Financial Services
The Senior Software Engineer will design, implement, and operate stream processing systems on AWS, focusing on real-time data processing and infrastructure automation to support Wise's mission.
Top Skills: Apache FlinkApache IcebergJavaKafkaKubernetesSpring
2 Hours Ago
In-Office or Remote
London, Greater London, England, GBR
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Senior Technical Support Engineer will provide expert support for Circle's API products, resolve technical issues, manage escalations, improve support processes, and contribute to the team’s goals for customer experience and operational excellence.
Top Skills: AWSConfluenceGCPGoJavaScriptJIRAKibanaObjective-CPHPPostmanPythonSalesforceSoliditySQL
2 Hours Ago
In-Office or Remote
2 Locations
Expert/Leader
Expert/Leader
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Lead the BOS Integrated Services Hub, focusing on pre-sales, service delivery, and team management in a high-pressure telecom environment.
Top Skills: Ai/MlApi ManagementBssCloud-NativeCobitData AnalyticsItilMicroservice ArchitectureOssSafe

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account