Recraft Logo

Recraft

Neural Network Optimization Engineer

Posted 14 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England
Mid level
In-Office
London, Greater London, England
Mid level
The Neural Network Optimization Engineer will enhance neural network performance, latency, and throughput, optimizing inference workflows using advanced techniques and collaborating with ML researchers.
The summary above was generated by AI
About Us

Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.

We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we’re just getting started.

Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.

If you’re passionate about pushing the boundaries of AI, we want you on board!

Job Description

We are seeking an experienced Neural Network Optimization Engineer who will specialize in enhancing the performance, latency, and throughput of neural network inference workflows. The ideal candidate will have substantial hands-on experience optimizing inference workloads using technologies such as TensorRT, Triton language, and model quantization techniques. You will collaborate closely with ML researchers to ensure that our machine learning models run at peak efficiency and reliability in production environments.

Key Responsibilities

  • Optimize neural network models for inference performance and latency reduction

  • Implement model quantization methods (e.g., INT8, FP8) to maximize computational efficiency.

  • Benchmark, analyze, and improve inference performance on targeted hardware platforms.

  • Collaborate with the ML researchers to deploy optimized models in production environments.

  • Stay updated with the latest developments in model optimization, inference engines, quantization methods, and related technologies.

Requirements

  • Proven professional experience optimizing neural network inference workloads.

  • Strong expertise with TensorRT, Triton language, CUDA programming.

  • Experience with neural network quantization techniques.

  • Proficiency in Python and PyTorch.

  • Deep understanding of GPU architectures and performance optimization.

  • Excellent problem-solving skills and ability to analyze performance bottlenecks.

What We Offer

  • Competitive salary.

  • We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.

  • Opportunities for professional growth and development.

  • A collaborative and user-focused work environment.

  • The chance to shape the future of AI-powered creativity through research.

  • Exciting projects where your insights will directly impact product development.

Top Skills

Cuda
Python
PyTorch
Tensorrt
Triton Language
HQ

Recraft London, England Office

The Stables Market, Chalk Farm Rd, Chalk Farm, London, United Kingdom, NW1 8AH

Similar Jobs

2 Hours Ago
In-Office
London, Greater London, England, GBR
Mid level
Mid level
Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
The Escrow Business Compliance Analyst manages client onboarding for escrow deals, ensures compliance with KYC regulations, and oversees transaction setup and documentation.
4 Hours Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The role involves developing microservices primarily in Golang, maintaining code quality, deploying applications, and collaborating with team members in a hybrid work environment.
Top Skills: Ci/CdGoHelmK8SPythonSQL
4 Hours Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Product Innovation Manager will lead development of new payment products, engage in idea generation, and partner with teams for market testing and validation.
Top Skills: Business Model InnovationData-Driven TechnologiesPayments Acceptance EcosystemStablecoin

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account