NVIDIA Logo

NVIDIA

Senior Machine Learning Applications and Compiler Engineer, LPX

Reposted 18 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
Design and implement compiler and runtime optimizations for large-scale inference, map neural network workloads onto NVIDIA hardware, benchmark and profile performance, prototype new compilation/runtime techniques, and collaborate with hardware and software teams to influence architecture and tools.
The summary above was generated by AI

NVIDIA is seeking engineers to develop algorithms and optimizations for our LPX inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!

 

What you’ll be doing:

  • Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.

  • Define and implement mappings of large-scale inference workloads onto NVIDIA’s systems.

  • Extend and integrate with NVIDIA’s SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.

  • Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.

  • Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.

  • Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.

  • Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.

 

What we need to see:

  • MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 5 years of relevant experience.

  • Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.

  • Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.

  • Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.

  • Familiarity with deep learning frameworks such as TensorFlow and PyTorch, and experience working with portable graph formats such as ONNX.

  • Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.

  • Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.

  • Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.

  • Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.

 

Ways to stand out from the crowd:

  • Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.

  • Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.

  • Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.

  • Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.

 

#LI-Hybrid

Top Skills

C
C++
Llvm
Mlir
Onnx
PyTorch
Rust
TensorFlow

NVIDIA London, England Office

13th Floor One Angel Court, London, United Kingdom, EC2R 7HJ

Similar Jobs

20 Hours Ago
Remote or Hybrid
United Kingdom
Senior level
Senior level
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
Join the UK engineering team to design, build and maintain the Immersive One frontend. Collaborate with product and QA, write test automation, improve developer experience, resolve incidents, participate in on-call, mentor engineers, and help translate product requirements into robust, scalable frontend solutions.
Top Skills: Ci/CdCSSDockerGitGraphQLHTMLKubernetesNode.jsPlaywrightPythonReactRuby On RailsSassStyled-ComponentsTypescript
20 Hours Ago
Remote or Hybrid
United Kingdom
Senior level
Senior level
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
Manage enterprise customer relationships, ensuring onboarding and success with the platform. Collaborate with internal teams and stakeholders to drive account growth and customer satisfaction in the cybersecurity domain.
Top Skills: Cyber Risk ManagementCybersecuritySaaS
20 Hours Ago
Easy Apply
In-Office or Remote
Easy Apply
Junior
Junior
Greentech • Hardware • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
The Business Development Executive will drive business growth, execute sales strategies, build customer relationships, and meet growth targets primarily in the agricultural sector. This role involves high autonomy and working closely with teams to optimize customer satisfaction and expand Halter's market presence.

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account