Flawless (flawlessai.com) Logo

Flawless (flawlessai.com)

Senior / Staff / Principal ML Systems Engineer

Posted 4 Days Ago
Be an Early Applicant
Hybrid
London, Greater London, England, GBR
Senior level
Hybrid
London, Greater London, England, GBR
Senior level
Build and optimize ML infrastructure and data platforms for large-scale multimodal datasets, supporting training, evaluation, lifecycle management, and production inference. Work on data ingestion, distributed training, experiment tracking, model versioning, and scalable serving while improving reliability, performance, and observability. Provide technical leadership at senior levels.
The summary above was generated by AI

"The AI company that's revolutionizing Hollywood"

Flawless is transforming Hollywood with assistive AI. Our tools empower filmmakers to edit, localize, and refine performances while preserving artistic intent.

Designed to support, not replace, artists, our technology expands what is possible on screen and gives creators freedom to tell stories with greater impact and reach audiences in new ways. From enabling seamless multilingual releases to eliminating the need for costly reshoots, Flawless solves critical challenges that slow down productions and limit distribution.

We are also setting the standard for ethical AI in entertainment. Our Artistic Rights Treasury (A.R.T.) is a rights management solution that protects artists and rights holders, ensuring that innovation moves forward with transparency and respect for creative ownership.

What We're Building

Research Services builds the infrastructure that enables scientists to train, evaluate, and deploy models at scale - forming the foundation of Hollywood's AI transformation.

Our team sits at the intersection of large-scale data systems, machine learning, and high-performance computing. We own the full stack, from data ingestion and curation through distributed training and production inference, enabling researchers to move quickly while maintaining reliability and scalability.

This role focuses on building and optimizing systems for large-scale multimodal datasets, including video, embeddings, and metadata, ensuring they are fast, reliable, and production-ready.

The Role

We're looking for experienced ML Systems Engineers to join our Research Services team and help build the infrastructure that powers machine learning across Flawless.

This role is open across multiple levels, from Senior Engineer through Staff Engineer. The level and scope of responsibility will be determined based on your experience, technical depth, leadership impact, and track record of delivery.

As an ML Systems Engineer, you'll work closely with scientists, machine learning engineers, and platform teams to design and build the systems that underpin model development and deployment. You'll contribute hands-on across data platforms, training infrastructure, evaluation systems, model lifecycle management, and production inference.

More senior candidates will be expected to provide technical leadership, drive architectural decisions, mentor other engineers, and influence infrastructure strategy across multiple initiatives.

What You'll DoData Platforms for Machine Learning
  • Build and evolve data platforms used to curate and manage large-scale multimodal datasets.

  • Design systems that index, process, and enrich thousands of videos through machine learning pipelines.

  • Optimize data storage and access patterns for efficient model training and experimentation.

  • Improve reliability, scalability, and observability across the data ecosystem.

ML Training Infrastructure
  • Build and optimize infrastructure for large-scale model training.

  • Improve performance across single-node and distributed training environments.

  • Scale data loading, preprocessing, and training workflows.

  • Ensure training pipelines are reproducible, efficient, and easy to operate.

Evaluation & Experimentation Systems
  • Develop systems for collecting, storing, and analyzing model outputs.

  • Build tooling for dataset exploration, experiment tracking, and model comparison.

  • Enable scientists to iterate rapidly while maintaining robust evaluation practices.

Model Lifecycle Management
  • Design and maintain infrastructure for model versioning, experimentation, validation, and deployment.

  • Improve reproducibility and governance across the machine learning lifecycle.

  • Support the promotion of models from research through production.

Production Inference Systems
  • Build and optimize inference infrastructure for production workloads.

  • Define and improve model serving protocols and deployment patterns.

  • Enhance performance, reliability, and scalability of production inference systems.

What We're Looking For

We're interested in engineers who enjoy building systems that make machine learning teams more effective and productive.

We're particularly interested in candidates with:
  • Experience building machine learning infrastructure, ML platforms, data platforms, or large-scale backend systems.

  • Strong Python engineering skills and experience building production services.

  • Deep understanding of data pipelines and performance trade-offs across storage, networking, memory, and compute.

  • Hands-on experience working with machine learning frameworks such as PyTorch.

  • Experience building and operating distributed systems.

  • Experience working with large-scale datasets and high-throughput data processing pipelines.

  • Familiarity with modern data storage and analytics technologies, including columnar data formats and data lake architectures.

  • Strong debugging, problem-solving, and systems design skills.

  • Experience collaborating effectively with cross-functional teams.

Additional Expectations for Staff Engineers
  • Demonstrated technical leadership across significant infrastructure initiatives.

  • Experience defining architecture and technical strategy for complex systems.

  • Ability to influence engineering direction beyond an individual team.

  • Track record of mentoring engineers and raising technical standards.

  • Experience balancing immediate research needs with long-term platform investments.

Nice to Have
  • Experience working with video, media, or multimodal machine learning pipelines.

  • Familiarity with embeddings, vector search, or retrieval systems.

  • Experience operating production inference systems.

  • Frontend experience (React or similar) for building internal tools and workflows.

Why work at Flawless?

You will be working in an environment based on trust, autonomy and collaboration, and this is a great opportunity for someone who wants to be part of a growing company in its most exciting stage of development. You can play a part in shaping the future of a company that’s caring, creative and collaborative.

In addition to this, you'll also receive: 

- Autonomy

- A hybrid working environment

- Competitive Salary

- All permanent employees receive generous stock options

I don’t meet all the listed requirements—should I still apply?

Absolutely! Research shows that women and underrepresented groups often hesitate to apply unless they meet every qualification, but at Flawless, we actively work to break down those barriers. We believe diverse perspectives, experiences, and backgrounds make us stronger, and we are committed to supporting and elevating underrepresented talent. If you're excited about the role, share our values, and believe you can contribute meaningfully, we encourage you to apply—even if you don’t meet every single requirement. Your unique skills and perspective matter, and we’d love to hear from you ❤️

Similar Jobs

2 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Manager in Oracle HCM, you'll help clients optimize HR processes by implementing Oracle solutions, leading teams, and ensuring project success through effective problem-solving and innovation.
Top Skills: Cc&BEbsFusionHyperionOracle ApplicationsOracle Hcm CloudPeoplesoftRiceSiebel
4 Hours Ago
Hybrid
London, England, GBR
Senior level
Senior level
Fintech • Mobile • Payments • Software • Financial Services
Lead and grow a Database Reliability Engineering team, coach and mentor engineers, contribute hands-on as a senior engineer, collaborate with cross-functional partners, drive platform automation and cloud infrastructure (AWS/GCP), ensure database scalability, high availability and security, own vendor negotiations, and foster operational excellence.
Top Skills: AnsibleAWSAws Step FunctionsCadenceDockerGCPGithub ActionsJavaKubernetesNoSQLPackerPythonSQLTemporalTerraform
4 Hours Ago
Remote or Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Customer-facing consultant delivering Nexthink implementations and advisory services. Lead use-case design, configure products, build dashboards/alerts/automation, integrate via scripting/APIs, analyze endpoint data, troubleshoot, and drive adoption to achieve measurable business outcomes while maintaining trusted-advisor relationships.
Top Skills: BashJSONLdapmacOSWindowsNexthinkPower BIPowershellPythonRest ApiSQLSsoTableau

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account