NVIDIA Logo

NVIDIA

Senior Deep Learning Engineer, Visual Generative AI

Reposted Yesterday
Be an Early Applicant
In-Office or Remote
6 Locations
Senior level
In-Office or Remote
6 Locations
Senior level
The role involves optimizing and deploying deep learning models, focusing on Diffusion and Vision-Language Models, on NVIDIA GPU platforms.
The summary above was generated by AI

We are looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms Engineer with experience optimizing and deploying Deep Learning models focusing on Diffusion Models and Vision-Language Models (VLMs) in production environments. In this role, you will focus on optimizing and deploying deep learning models for efficient and fast inference across diverse GPU platforms, particularly for Visual Generative AI applications. 

Join the team building software used by the entire world. Work with world class research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production. 

What you will be doing:

  • Optimize deep learning models for low-latency, high-throughput inference, with a focus on Diffusion models for Visual Generative AI applications.

  • Convert, deploy, and optimize models for efficient inference using frameworks such as TensorRT, TensorRT-LLM, and vLLM.

  • Understand, analyze, profile, and optimize performance of deep learning workloads on state-of-the-art NVIDIA GPU hardware and software platforms.

  • Collaborate with internal and partner research scientists and software engineers to ensure seamless integration of cutting-edge AI models from training to deployment.

  • Contribute to the development of automation and tooling for NVIDIA Inference Microservices (NIMs) and inference optimization, including creating automated benchmarks to track performance regressions.

What we need to see:

  • 3+ years of experience in DL model implementation and SW Development.

  • BSc, MS or PhD degree in Computer Science, Computer Architecture or related technical field.

  • Extensive knowledge of at least one DL Framework (PyTorch, JAX, TensorFlow) with practical experience in PyTorch required.

  • Deep understanding of transformer architectures, attention mechanisms, Visual Generative AI foundational models architectures (e.g., U-Net, DiT) and inference bottlenecks.

  • Excellent Python programming skills.

  • Strong problem solving and analytical skills.

  • Algorithms and DL fundamentals.

  • Docker containerization fundamentals.

Ways to stand out from the crowd:

  • Experience in performance measurements and profiling.

  • Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang, and ONNX.

  • Deep understanding of distributed systems for large-scale model inference and serving.

  • Experience with extending and leveraging open-source tools for Visual Generative AI workflow creation.

  • Familiarity with the latest trends in Visual Generative AI for content creation.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you! We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

 

Top Skills

Deep Learning
Diffusion Models
Docker
Jax
Nvidia Gpus
PyTorch
TensorFlow
Tensorrt
Vision-Language Models

NVIDIA London, England Office

13th Floor One Angel Court, London, United Kingdom, EC2R 7HJ

Similar Jobs

An Hour Ago
Remote
United Kingdom
Mid level
Mid level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The role involves outbound prospecting, meeting quotas, building sales pipelines, and collaborating with various teams while ensuring a strong customer experience.
Top Skills: GongLinkedin NavigatorOutreachSFDC
5 Hours Ago
Remote
UK
Senior level
Senior level
Software
The Senior Renewal Representative at Postman executes the subscription renewal lifecycle, achieves revenue targets, manages pipelines, and collaborates with sales on upsell opportunities, requiring strong communication and a solid background in SaaS renewals.
Top Skills: CRM
5 Hours Ago
Remote or Hybrid
3 Locations
Senior level
Senior level
Software
As a Senior Full Stack Engineer, you will develop AI-powered tools and enhance Postman's API Network, focusing on both frontend and backend improvements.
Top Skills: JavaScriptNode.jsReactTypescript

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account