DeepL

(Senior) Staff Research Scientist | Voice

Reposted 23 Days Ago

Be an Early Applicant

Hybrid

London, Greater London, England, GBR

Senior level

Hybrid

London, Greater London, England, GBR

Senior level

The role involves leading research in speech and translation models, developing ASR and TTS systems, and integrating machine learning into production. Responsibilities include model training, experimentation, optimization, and mentoring team members.

The summary above was generated by AI

Meet DeepL

DeepL is a global AI product and research company focused on building secure, intelligent solutions to complex business problems. Over 200,000 business customers and millions of individuals across 228 global markets today trust DeepL's Language AI platform for human-like translation, improved writing and real-time voice translation.
Founded in 2017 by CEO Jaroslaw “Jarek” Kutylowski, DeepL now has over 1,000 passionate employees and is supported by world-renowned investors including Benchmark, IVP, and Index Ventures.

Our goal is to become the global leader in trusted, intelligent AI technology, building products that drive better communication, foster connections, and create a meaningful impact. To achieve this, we need talented people like you to join our journey. If you’re ready to shape the future of AI and grow your career in a fast-moving, purpose-driven environment, DeepL is your next destination.

What sets us apart

What sets us apart is our blend of cutting-edge AI technology, meaningful work, and a culture where people truly thrive. We’re a team of innovators, researchers, and creators driven by a shared purpose to unlock human potential by making work simpler, smarter, and more connected.

When we share what it’s like to work at DeepL, the reactions are overwhelmingly positive. This might be because of our technology that helps millions of people and businesses communicate and work better every day, or because of the trust, curiosity, and care that shape our culture.

What we know for sure is this: being part of DeepL means joining a team dedicated to innovation, growth, and well-being. Discover more about life at DeepL onLinkedIn,Instagram, and our Blog.

Meet the Team

DeepL Voice is defining the future of real-time multilingual communication. Built on the foundation of DeepL’s world-leading translation quality, the Voice Track brings together cross-functional teams across Research, Engineering, and Product to push the boundaries of what is possible in speech-to-speech interpretation.

In just two years, we have launched our first customer-facing features and built reliable real-time systems for meetings, live communication, and developer integrations. We are now scaling up our research efforts to invent the next generation of audio, speech, and multimodal translation technologies.

What will you be doing at DeepL Voice

We’re looking for a Senior Staff Research Scientist to lead scientific innovation across our speech and translation models. This is a high-impact, hands-on role for a research leader who can define long-term scientific strategy, prototype rapidly, run large-scale experiments, and drive breakthroughs all the way into production.

You will work across ASR, MT, TTS, streaming inference, and large speech models—leading both cascaded and emerging end-to-end internationalized speech-translation approaches.

Your responsibilities

Lead hands-on research and development across ASR, MT, TTS, and speech-to-speech translation for real-time voice products.
Design, train, and optimize large-scale ASR models for multilingual accuracy, robustness, and ultra-low-latency streaming.
Improve cascaded translation pipelines end to end: segmentation, ASR→MT interfaces, streaming MT inference, and incremental decoding.
Develop and refine real-time TTS models with natural prosody, stable speaker characteristics, and fast inference.
Build and experiment with end-to-end and LLM-based speech-to-speech translation systems, including streaming and one-shot approaches.
Own the full lifecycle of model delivery: prototyping, ablations, training, evaluation, optimization, and production deployment.
Work closely with engineering teams to integrate models into real-time systems, ensuring reliability, uptime, and quality at scale.
Drive improvements in inference efficiency, model serving, voice UX, and robustness to real-world acoustic conditions.
Establish strong practices for evaluation, reproducibility, monitoring, and continuous model improvement in production.
Mentor researchers and engineers, promote hands-on collaboration, and raise the bar for model quality and operational excellence.

What we’re looking for

Deep expertise in speech, audio, or multilingual ML, particularly in ASR, MT, TTS, end-to-end ST, or large speech models.
A hands-on builder who enjoys training models, running experiments, debugging pipelines, and integrating ML systems into production.
Strong understanding of real-time streaming constraints and how to design models that operate reliably at low latency.
Experience shipping ML models to production, maintaining them at scale, and working with engineers on deployment, monitoring, and serving.
Ability to lead complex research efforts while staying grounded in product impact, user experience, and real-world performance.
Strong coding and experimentation skills (Python, PyTorch/JAX, audio processing libraries).
Ability to communicate clearly, collaborate across teams, and align research work with product and engineering priorities.
Proven experience mentoring others and elevating technical quality across a fast-moving, applied research team.

We are an equal opportunity employer

You are welcome at DeepL for who you are - we appreciate authenticity here. Our product is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all succeed, contribute, and think forward! So bring us your personal experience, your perspectives, and your background. It’s in our diversity that we will find the power to break down language barriers in the world.

Top Skills

Asr

Jax

Python

PyTorch

Speech-To-Speech Translation

Tts

Similar Jobs

HERE Technologies

Employee & Labor Relations Manager

An Hour Ago

Hybrid

Expert/Leader

Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software

The Employee & Labor Relations Manager focuses on works council matters, providing expert advice, leading the Labor Relations team, and ensuring compliance with legal obligations while promoting effective employee representation globally.

Perk

Quality Assurance Specialist

14 Hours Ago

Hybrid

Senior level

Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality

The Senior Quality Assurance Specialist monitors QA performance, analyzes trends, coaches vendor teams, and ensures compliance with QA processes while enhancing service quality across multiple locations.

Top Skills: Qa ToolsZendesk

Perk

Consultant

14 Hours Ago

Hybrid

Senior level

Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality

As a Senior Business Travel Consultant, you'll provide high-touch support for VIP customers, manage complex itineraries using Amadeus, and ensure exceptional service and satisfaction.

Top Skills: AmadeusGdsZendesk

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.