Mistral AI Logo

Mistral AI

Research Engineer, Data Infrastructure

Posted 7 Hours Ago
Be an Early Applicant
Hybrid
London, Greater London, England, GBR
Mid level
Hybrid
London, Greater London, England, GBR
Mid level
The Research Engineer will build and scale data infrastructure, architect multi-cluster orchestration, and enhance storage systems for AI training platforms.
The summary above was generated by AI
About Mistral 
 
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
 
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
 
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
 
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
 
Mistral AI participates in the E-Verify program
 

By applying, you agree to our Applicant Privacy Policy.


Role Summary 
 
Research Engineer, Data Infrastructure
 
The Data Infrastructure team at Mistral AI is architecting the backbone of our frontier model training and fine-tuning ecosystem. We are building the specialized compute and data fabrics required to power the development of world-class AI.
 
Our vision is to operate some of the largest compute fleets in production and build data lakes and metadata systems with a roadmap toward exabyte-scale architecture. We are currently in the process of building a high-performance training platform designed for massive scale across both on-premise and cloud-native Kubernetes environments.
 
We are leading a strategic transition from legacy scheduling to modern orchestration. With numerous clusters distributed across various regions, we are focussed on implementing sophisticated multi-cluster orchestration and cloud-bursting capabilities to better utilize our global resources and ensure our researchers have seamless access to compute wherever it resides. Our mission is to evolve our current systems into a platform that is as durable as it is flexible.
 
Location: Paris / London (hybrid) or remote EU/UK with one hub day per month.
 
About the Role
 
This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability.

You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production-grade pipelines and participating in on-call rotations for critical training jobs.
 
In this role, you will:
  • Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems
  • Global Orchestration: Architect and maintain multi-cluster orchestration layers to optimize workload placement across diverse hardware and regions.
  • Design Future-Proof Storage: Architect our transition to modern storage formats to handle fine-tuning datasets at a scale that anticipates exabyte growth.
  • Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine-tuning capabilities across Kubernetes and SLURM based environments.
  • Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
  • Operational Excellence: Use modern deployment workflows to manage cloud-native deployments, ensuring our data platform can scale by orders of magnitude while remaining reliable and efficient.
You might thrive in this role if you:
  • Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.

  • Have experience or a strong interest in supporting foundational compute and storage platforms.

  • Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.

  • Are well-versed in Kubernetes-native tooling and excited to debug large-scale distributed systems across multi-cluster environments.

  • Take pride in building and operating scalable, reliable, and secure systems from the ground up.

  • Are comfortable with ambiguity and the challenges of building high-scale infrastructure in a rapid-growth AI environment.

 
Benefits
 
France
💰 Competitive cash salary and equity
🥕 Food: Daily lunch vouchers
🥎 Sport: Monthly contribution to a Gympass subscription
🚴 Transportation: Monthly contribution to a mobility pass
🧑‍⚕️ Health: Full health insurance for you and your family
🍼 Parental: Generous parental leave policy
🌎 Visa sponsorship
 
UK
💰 Competitive cash salary and equity
🚑 Insurance
🚴 Transportation: Reimburse office parking charges, or £90 per month for public transport
🥎 Sport: £90 per month reimbursement for gym membership
🥕 Meal voucher: £200 monthly allowance for meals
💰 Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)
 
By applying, you agree to our Applicant Privacy Policy.

Top Skills

Kubernetes
Python
Slurm

Similar Jobs

15 Hours Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
As a Security Engineer, you will enhance threat detection and response capabilities, build threat models, investigate incidents, and collaborate with teams during security events.
Top Skills: Amazon Web ServicesGoGoogle Cloud PlatformKubernetesLinuxmacOSPythonWindows
15 Hours Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Manage multiple projects, provide client support, lead technology solution delivery, ensure robust technical design, and drive innovative client solutions.
Top Skills: AIAnalyticsBig DataBusiness Intelligence ReportingCloudDigitalMdm
Yesterday
In-Office
London, Greater London, England, GBR
Expert/Leader
Expert/Leader
Information Technology • Software • Financial Services • Big Data Analytics
Quantitative Researchers at Citadel use advanced statistical techniques to develop models and trading strategies, backtesting them in live environments.
Top Skills: C++PythonR

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account