Cantina Labs Jobs

Machine Learning Enginer, Core Evaluations

Cantina Labs

Machine Learning Enginer, Core Evaluations

Posted 24 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in London, Greater London, England, GBR

Mid level

In-Office or Remote

Hiring Remotely in London, Greater London, England, GBR

Mid level

The role involves designing and developing model evaluation pipelines, user studies for subjective evaluations, and automated dashboards to report results, alongside leading the evaluation team and communicating with other teams to improve model performance.

The summary above was generated by AI

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role:

We are seeking an experienced Machine Learning Engineer (MLE) to focus on audio model evaluation, specifically for speech generation and recognition models.

This role involves designing and developing comprehensive model evaluation pipelines for both development and production environments, as well as creating automated dashboards for reporting evaluation results.

As the founding member of our evaluation team, the ideal candidate is expected to leverage their experience to lead our evaluation efforts and play a key role in the future growth of the evaluation team.

What You’ll Do:

Designing model evaluation pipelines for models in development and production
Designing user studies for subjective model evaluations.
Converting requirements into measurable metrics.
Designing and developing automated evaluation dashboard to see model performances and compare results.
Training new models to capture new and different evaluation metrics.
Communicating with the model team to help design better models based on the evaluation results.
Communicating with the data team to help decide the type of data necessary to improve model performance.
Communication with the product-manager to make sure product requirements are correctly measured.
Help grow the evaluation team as the founding member.
Lead the evaluation team in the future.

What You’ll Bring:

Strong experience and intuition for designing metrics that capture model performance.
Strong experience with designing user studies on Mechanical Turk or similar platforms. .
Strong experience with model training and fine-tuning for model evaluation.
Strong statistical knowledge and experience to statistically compare evaluation results and take decisions.
Very strong engineering and programming skills.
Experience with training ASR, TTS models.
Experience at ML teams working on large-scale machine learning problems. (>3B models with >1m hours of data)

Similar Jobs

Circle

Senior Counsel

4 Hours Ago

In-Office or Remote

London, Greater London, England, GBR

Senior level

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3

Provide strategic legal support for the Arc blockchain, advising on design, regulatory compliance, and risk management while collaborating with cross-functional teams.

Top Skills: BlockchainDigital AssetsSmart ContractsWeb3

Mondelēz International

Mgr, Global Demand Insights

9 Hours Ago

Remote or Hybrid

United Kingdom

Senior level

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing

The Manager, Global Demand Insights will drive consumer-centric growth through data analysis and methodology evolution, collaborating with global teams to enhance business strategies and innovations.

Top Skills: Ai-Powered AnalyticsExcelPower BIPowerPointTableau

NBCUniversal

Senior Commercial Finance Analyst

13 Hours Ago

Remote or Hybrid

London, Greater London, England, GBR

Senior level

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development

This role involves supporting investment decisions in TV production, managing financial analysis, and collaborating with various stakeholders in Commercial Finance.

Top Skills: BpcExcelPowerPointSAPWord

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.