Design, implement, and deploy ML inference engines on custom FPGA/ASIC hardware. Perform HW/SW co-design with traders and researchers, optimize neural networks for low latency, and build quantization/compression tools to translate ML frameworks to RTL for rapid production deployment.
We are deploying machine learning directly onto custom hardware - and we want you to help drive it from the ground up. This is an initiative where you'll have the rare opportunity to architect solutions from scratch, influence technical research direction, and see your work drive real impact in one of the most demanding computing environments in the world.
We build the hardware, the software, and the infrastructure, so when you hit a bottleneck, you can fix it - there's no vendor to wait on and no abstraction layer you're not allowed to touch. If you've ever wanted to push the boundaries of what's computationally possible, this role is for you. We're looking for researchers and experienced engineers from any background. Trading experience is a bonus, not a prerequisite.
Your Core Responsibilities
Your Skills and Experience
Nice to Have
We build the hardware, the software, and the infrastructure, so when you hit a bottleneck, you can fix it - there's no vendor to wait on and no abstraction layer you're not allowed to touch. If you've ever wanted to push the boundaries of what's computationally possible, this role is for you. We're looking for researchers and experienced engineers from any background. Trading experience is a bonus, not a prerequisite.
Your Core Responsibilities
- Architect and co-design ML models with traders, quant researchers, and software engineers, treating hardware constraints (latency budgets, resource limits, numerical precision) as first-class design inputs
- Shape our custom hardware roadmap by translating ML model requirements into concrete architectural decisions
- Work hands-on with hardware engineers to implement, verify, and deploy ML inference solutions from proof-of-concept through production
- Track and evaluate emerging research in neural architecture search, machine learning systems and quantization methods, and determine what translates to measurable improvements in our systems
Your Skills and Experience
- Solid understanding of hardware constraints and design trade-offs (e.g., pipelining, resource utilization, fixed-point arithmetic) that shape how ML models can be efficiently mapped onto FPGAs or custom ASICs
- Experience with hardware fundamentals, whether through VHDL/SystemVerilog development, HLS tools, or ML-to-hardware frameworks like hls4ml, FINN, or Vitis AI
- Understanding of machine learning fundamentals - neural network architectures, inference optimization, quantization techniques, ML frameworks such as PyTorch/TensorFlow
- Proficiency in Python, C++, or similar languages for tooling, testing, and simulation
- Strong communication skills and ability to work collaboratively across disciplines with both technical and non-technical teams
Nice to Have
- Exposure to ML compiler infrastructure such as MLIR, TVM, XLA, or similar tools for lowering and optimizing models for hardware targets
- Background in latency-sensitive or resource-constrained systems including high-frequency trading, particle physics data acquisition, real-time signal processing, or similar domains
- Familiarity with functional verification methodologies (for example SystemVerilog, UVM, Cocotb)
- Advanced degree (MS or PhD) in EE, CS, Physics, or related field, or equivalent depth through industry or research experience
IMC Trading London, England Office
London, United Kingdom
Similar Jobs at IMC Trading
Fintech • Machine Learning • Software • Financial Services
The Quantitative Developer will develop systems from research to production, build simulation infrastructure, manage feature pipelines, and troubleshoot issues in quantitative trading environments.
Top Skills:
PandasPolarsPython
Fintech • Machine Learning • Software • Financial Services
As a Software Engineer, you will build AI systems to enhance developer workflows, implementing agents and servers, and ensuring code reliability through evaluation pipelines.
Top Skills:
AIJavaLlmPython
Fintech • Machine Learning • Software • Financial Services
The Machine Learning Engineer will develop and optimize large-scale ML models, build low-latency inference pipelines, and collaborate with teams to enhance performance and automate ML processes.
Top Skills:
C++CudaCudnnHorovodJaxNcclPythonPyTorchTensorFlowTensorrt
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.


.png)