As a Model Behavior Architect, you will improve LLM system behavior, collaborate on AI models, and design evaluation frameworks.
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
Mistral AI participates in the E-Verify program
About the role
As a Model Behavior Architect, you are at the forefront of defining and measuring LLM behaviour.
We are looking for people who have built a career in engineering, machine learning, and large language models and are experts in model evaluation, policy writing, and creating eval pipelines for complicated tasks. Your role is to work hand-in-hand with our Science team to define what ‘good’ looks like for Reasoning, Audio, Alignment, Tools, and all Frontier bets.
Join us if you are passionate about tackling cutting-edge, open-ended research challenges and transforming your insights into best-in-class models.
What you will do
- Interact with models to identify where model behavior can be improved
- Gather internal and external feedback on model behavior to scope areas for improvement
- Design and implement evals, data guidelines, data generation, and synthetic testing environments
- Identify and fix edge case behaviors through rigorous testing
- Develop robust evaluation pipelines for our model candidates
- Work collaboratively with AI Scientists
About you
- You have a deep understanding of either 1) linguistics, language, and translation, 2) engineering and code behavior, 3) LLM agents at work, including reasoning and tool use
- You have prior knowledge in training and optimising model behaviour
- You are an expert at building robust evaluations
- You thrive in dynamic and technically complex environments
- You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints
Top Skills
Ai Systems
Model Evaluation
Prompt Engineering
Similar Jobs
Fintech • Payments • Financial Services
As an AP Specialist, you'll manage vendor invoices, streamline workflows, maintain records, support month-end close, and assist in finance projects.
Top Skills:
NetSuiteRamp
Cloud • Information Technology • Machine Learning
The Data Center Lease Administration Manager oversees lease administration, compliance, and governance within a global data center portfolio, ensuring accuracy and adherence to standards. This role involves managing lease records, financial obligations, and working with various internal teams.
Top Skills:
Asc 842CostarSox
An Hour Ago
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Vice President, Sales Specialist Leader will lead international credit risk sales teams, defining strategic sales direction and driving enterprise growth through consultative engagement and collaboration with various stakeholders.
Top Skills:
Ai-Driven Risk AnalyticsAnalytic ConsultingCredit Decisioning PlatformsCredit Risk ModelingRegulatory ComplianceScoring
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.


