Spotify Logo

Spotify

Senior Data Engineer

Posted 20 Days Ago
Be an Early Applicant
Remote
Hybrid
Hiring Remotely in London, Greater London, England
Senior level
Remote
Hybrid
Hiring Remotely in London, Greater London, England
Senior level
As a Senior Data Engineer, you will build large-scale speech and audio data pipelines, work on machine learning projects for generative AI, and collaborate with engineers and stakeholders to deliver maintainable code. Additionally, you will ensure high-quality datasets for training machine learning models and promote best practices within the team.
The summary above was generated by AI

The Personalization team makes deciding what to play next easier and more enjoyable for every listener. From Blend to Discover Weekly, we’re behind some of Spotify’s most-loved features. We built them by understanding the world of music and podcasts better than anyone else. Join us and you’ll keep millions of users listening by making great recommendations to each and every one of them. We ask that our team members be physically located in Central European time or Eastern Standard/Daylight time zones for the purposes of our collaboration hours.


The Speak team is Spotifies in-house text-to-speech (TTS) team, supporting products like DJ, AI Voice Translation, as well as the development of exciting new unreleased products. We focus on building world class speech technologies that can power the next generation of personalized generative voice products at scale.

What You'll Do

  • Build large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam
  • Work on machine learning projects powering new generative AI experiences and helping to build state-of-the-art text-to-speech models
  • Learn and contribute to the teams best practices and techniques for building data pipelines for large scale generative models, including cleaning, filtering, classifying and labelling
  • Collaborate with other engineers, researchers, product managers and stakeholders, taking on learning and leadership opportunities that arise
  • Deliver scalable, testable, maintainable, and high-quality code
  • Share knowledge, promote standard methodologies, making your team the best version of itself through mentorship and constructive accountability.

Who You Are

  • You have Data Engineering experience and you know how to work with high-volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS
  • You have experience building clean, high quality datasets for training large scale machine learning models, a focus on audio data is preferred
  • You have experience with one or more higher-level Python or Java based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark etc
  • You have strong Python programming abilities. You might have worked with Docker as well as Luigi, Airflow, or similar tools
  • You care about quality and you know what it means to ship high quality code
  • You have experience managing data retention policies
  • You care about agile software processes, data-driven development, reliability, and responsible experimentation
  • You understand the value of collaboration and partnership within teams
  • You have experience in developing datasets tailored for training high-performance machine learning models.
  • Familiarity with generative models or audio-based machine learning applications is highly desirable.
  • You are proficient in cleaning, filtering, and evaluating dataset quality, leveraging both pre-trained and in-house machine learning models, as well as human evaluation techniques, to ensure optimal quality.

Where You'll Be

  • We offer you the flexibility to work where you work best! For this role, you can be within the UK region as long as we have a work location.
  • This team operates within the GMT time zone for collaboration.

Spotify is an equal opportunity employer. You are welcome at Spotify for who you are, no matter where you come from, what you look like, or what’s playing in your headphones. Our platform is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all thrive, contribute, and be forward-thinking! So bring us your personal experience, your perspectives, and your background. It’s in our differences that we will find the power to keep revolutionizing the way the world listens.


Spotify transformed music listening forever when we launched in 2008. Our mission is to unlock the potential of human creativity by giving a million creative artists the opportunity to live off their art and billions of fans the chance to enjoy and be passionate about these creators. Everything we do is driven by our love for music and podcasting. Today, we are the world’s most popular audio streaming subscription service.



Top Skills

Java
Python

Similar Jobs

16 Hours Ago
Remote
Hybrid
4 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Sr. Engineer, you will design and maintain a data platform processing petabytes of data, improving service efficiency, and developing features for data analytics and querying for internal and external users.
Top Skills: JavaPythonScala
2 Days Ago
Remote
29 Locations
Senior level
Senior level
Marketing Tech • Cryptocurrency
As a Senior Data Engineer at Auros, you will enhance and lead the management of the firm's trading data systems. Responsibilities include building high-throughput data architectures, developing real-time data collectors, analyzing data quality, and collaborating with traders to optimize data analysis requirements.
Top Skills: Python
2 Days Ago
Remote
28 Locations
Senior level
Senior level
Information Technology • Consulting
As a Senior Data Engineer, you will design and implement a cloud-based Consumer Data Lake, transforming consumer data into actionable insights through hands-on technical execution and collaboration. You will build and optimize data pipelines, ensure data quality, and work with global teams to meet business needs while adhering to privacy regulations.
Top Skills: Python

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account