Mistral AI
Applied Scientist / Research Engineer - Edge Devices and Quantization - EMEA
Be an Early Applicant
Mistral AI seeks Applied Scientists/Research Engineers to develop efficient models for edge device deployment, focusing on quantization and model optimization for on-device inference.
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.
We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
About The Job
Mistral AI is seeking Applied Scientists and Research Engineers focused on model efficiency and edge deployment. You will research and build ultra‑efficient models and toolchains for on‑device inference across CPUs, GPUs, NPUs, and specialized accelerators. Your work will enable Mistral models to run privately, reliably, and fast on mobile, desktop, and embedded devices.
What you will do
• Run pre-training, post-training and deploy state of the art models on clusters with thousands of GPUs. You don’t panic when you see OOM errors or when NCCL feels like not wanting to talk.
• Design and evaluate quantization, pruning, distillation, and sparsity methods for LLMs and multimodal models.
• Build deployment stacks, optimize kernels and memory layouts.
• Run large‑scale experiments to balance accuracy, latency, throughput, and power under tight memory constraints; profile and fix bandwidth/compute bottlenecks.
• Develop tooling for calibration data generation, mixed‑precision training, quant‑aware finetuning, structured/unstructured sparsity, and compilation passes.
• Manage research projects and communications with client research teams.
About you
• You are fluent in English, and have excellent communication skills. You are at ease explaining complex technical concepts to both technical and non-technical audiences.
• You’re not afraid of contributing to a big codebase and can find yourself around independently with little guidance.
• You’ve a deep understanding of quantization trade‑offs, hardware constraints, and compiler stacks
• You’re expert with PyTorch or JAX; strong C++/CUDA or low‑level performance skills a plus; production‑grade Python.
• You don’t need roadmaps: you just do. You don’t need a manager: you just ship.
• Low-ego, collaborative and eager to learn.
• You have a track record of success through personal projects, professional projects or in academia.
It would be great if you
• Hold a PhD / master in a relevant field (e.g., Mathematics, Physics, Machine Learning), but if you’re an exceptional candidate from a different background, you should apply.
• Have contributed to a large codebase used by many (open source or in the industry).
• Have a track record of publications in top academic journals or conferences.
• Contributions to open‑source inference/compilers stacks.
• Love improving existing code by fixing typing issues, adding tests and improving CI pipelines.
• Have experience optimizing inference on edge devices
Benefits
We have local offices in Paris, London, Marseille, Singapore and Palo Alto.
France
💰 Competitive cash salary and equity
🥕 Food : Daily lunch vouchers
🥎 Sport : Monthly contribution to a Gympass subscription
🚴 Transportation : Monthly contribution to a mobility pass
🧑⚕️ Health : Full health insurance for you and your family
🍼 Parental : Generous parental leave policy
🌎 Visa sponsorship
UK
💰 Competitive cash salary and equity
🚑 Insurance
🚴 Transportation: Reimburse office parking charges, or 90GBP/month for public transport
🥎 Sport: 90GBP/month reimbursement for gym membership
🥕 Meal voucher: £200 monthly allowance for its meals
💰 Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)
Top Skills
AI
C++
Cuda
Jax
Python
PyTorch
Similar Jobs
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
The RF Front-End Engineer role focuses on the design, qualification, and integration of high-frequency RF systems for satellites, emphasizing GaN and GaAs technologies, collaboration, and manufacturing processes.
Top Skills:
AdsAltiumCstGaasGanHfssKa-BandMatlabPcbRf Front-End
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
Design and validate advanced digital electronics for MEO satellite platforms, focusing on high-speed PCB design, subsystem integration, and compliance with space standards.
Top Skills:
Altium DesignerCadence AllegroDdrDifferential Pair RoutingHigh-Speed Digital Pcb DesignImpedance ControlMentor GraphicsPcieSerdesSignal Integrity
Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
Lead the development of Ka-band Direct Radiating Antenna subsystems for satellite payloads, overseeing integration, compliance, and project management. Requires extensive experience in electronic hardware design and space applications.
Top Skills:
AsicsDigital BeamformingElectronic HardwareKa-Band SystemsRf ComponentsThermal Design
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

