Jane Street Logo

Jane Street

Machine Learning Performance Engineer

Reposted 5 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England
Mid level
Easy Apply
In-Office
London, Greater London, England
Mid level
The role involves optimizing ML model performance, focusing on efficient training, low-latency inference, and understanding low-level GPU and networking technologies.
The summary above was generated by AI

We are looking for an engineer with experience in low-level systems programming and optimisation to join our growing ML team. 

Machine learning is a critical pillar of Jane Street's global business. Our ever-evolving trading environment serves as a unique, rapid-feedback platform for ML experimentation, allowing us to incorporate new ideas with relatively little friction.

Your part here is optimising the performance of our models – both training and inference. We care about efficient large-scale training, low-latency inference in real-time systems and high-throughput inference in research. Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking and host- and GPU-level considerations. Zooming in, we also want to ensure our platform makes sense even at the lowest level – is all that throughput actually goodput? Does loading that vector from the L2 cache really take that long?

If you’ve never thought about a career in finance, you’re in good company. Many of us were in the same position before working here. If you have a curious mind and a passion for solving interesting problems, we have a feeling you’ll fit right in. 

There’s no fixed set of skills, but here are some of the things we’re looking for:

  • An understanding of modern ML techniques and toolsets
  • The experience and systems knowledge required to debug a training run’s performance end to end
  • Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores and the memory hierarchy
  • Debugging and optimisation experience using tools like CUDA GDB, NSight Systems, NSight Computesight-systems and nsight-compute
  • Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN and cuBLAS
  • Intuition about the latency and throughput characteristics of CUDA graph launch, tensor core arithmetic, warp-level synchronization and asynchronous memory loads
  • Background in Infiniband, RoCE, GPUDirect, PXN, rail optimisation and NVLink, and how to use these networking technologies to link up GPU clusters
  • An understanding of the collective algorithms supporting distributed GPU training in NCCL or MPI
  • An inventive approach and the willingness to ask hard questions about whether we're taking the right approaches and using the right tools
  • Fluency in English

If you're a recruiting agency and want to partner with us, please reach out to [email protected].

Top Skills

Cub
Cublas
Cuda
Cudnn
Cutlass
Gpu
Gpudirect
Infiniband
Mpi
Nccl
Nvlink
Ptx
Pxn
Roce
Sass
Tensor Cores
Thrust
Triton

Jane Street London, England Office

2 & A Half, Devonshire Square, London, United Kingdom, EC2M 4UJ

Similar Jobs

3 Days Ago
In-Office
London, Greater London, England, GBR
Mid level
Mid level
Big Data • Fintech • Information Technology • Machine Learning • Financial Services
As a Machine Learning Performance Engineer, you'll optimize large-scale workloads across GPU and CPU infrastructure, collaborating with researchers and engineers to improve performance and capabilities of research workloads.
Top Skills: C++CudaKubernetesLinuxPythonPyTorch
An Hour Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Lead the IPBB Security Operations team ensuring service availability, managing network security devices, and maintaining incident recovery and configuration management for security nodes.
Top Skills: Alarm HandlingConfigurationFirewall (Juniper SrxFortinet)HardwareIpbb SecuritySoftware
4 Hours Ago
In-Office
Middle Wallop, Test Valley, Hampshire, England, GBR
Entry level
Entry level
Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
The Aircraft Maintenance Apprentice will study aircraft maintenance, conduct scheduled and unscheduled maintenance, and work under experienced engineers while completing qualifications over 36 months.
Top Skills: Aircraft MaintenanceAvionicsElectrical EngineeringMechanical Engineering

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account