NVIDIA Logo

NVIDIA

Senior Machine Learning Applications and Compiler Engineer

Reposted 14 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
Design and implement compiler and runtime optimizations for large-scale inference, map neural network workloads onto NVIDIA hardware, benchmark and profile performance, prototype new compilation/runtime techniques, and collaborate with hardware and software teams to influence architecture and tools.
The summary above was generated by AI

NVIDIA is seeking engineers to develop algorithms and optimizations for our inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!

 

What you’ll be doing:

  • Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.

  • Define and implement mappings of large-scale inference workloads onto NVIDIA’s systems.

  • Extend and integrate with NVIDIA’s SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.

  • Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.

  • Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.

  • Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.

  • Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.

 

What we need to see:

  • MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 5 years of relevant experience.

  • Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.

  • Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.

  • Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.

  • Familiarity with deep learning frameworks such as TensorFlow and PyTorch, and experience working with portable graph formats such as ONNX.

  • Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.

  • Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.

  • Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.

  • Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.

 

Ways to stand out from the crowd:

  • Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.

  • Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.

  • Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.

  • Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.

 

#LI-Hybrid

Top Skills

C
C++
Llvm
Mlir
Onnx
PyTorch
Rust
TensorFlow

NVIDIA London, England Office

13th Floor One Angel Court, London, United Kingdom, EC2R 7HJ

Similar Jobs

57 Minutes Ago
Easy Apply
In-Office or Remote
8 Locations
Easy Apply
Entry level
Entry level
Greentech • Hardware • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
Halter seeks expressions of interest for various roles across teams like Engineering, Product, Hardware, Sales, and Support. Applicants should be passionate about impactful work and problem-solving. A cover letter is required to express interest and qualifications.
3 Hours Ago
Remote or Hybrid
7 Locations
Internship
Internship
Automotive • eCommerce • Hardware • Music • Retail • Software • Wearables
Work on prototyping and implementing audio ML algorithms for real-time applications. Present findings and integrate solutions into existing platforms. Collaborate with experts in ML and DSP.
Top Skills: CC++MatlabOnnxPythonPyTorchTensorFlowTflite
3 Hours Ago
Easy Apply
Remote or Hybrid
United Kingdom
Easy Apply
Junior
Junior
AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
Serve as a technical advisor in pre-sales: learn Acquia products, gather customer requirements, design and estimate Drupal/cloud solutions, create proposals and presentations, support sales handoff, and influence product roadmap.
Top Skills: Acquia Cloud PlatformAi ToolsCloud ArchitecturesDrupalSaaS

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account