Own and evolve the platform infrastructure for deployed ML and NLP systems: deploy and monitor models via Azure ML and AKS, build orchestration pipelines (Dagster), ensure observability and Responsible AI/GDPR compliance, and support scaling across multi-tenant local authority clients while advising across the organisation.
In this role you will work in the Platform team – a function for the deployment and evolution of the backend platform that underpins the core of the Xantura business.
Key Responsibilities
As an MLOps Engineer, you'll own the infrastructure layer that all of Xantura's AI services depend on to operate and scale. You'll be responsible for how ML models and NLP systems are deployed, monitored, and maintained across a growing base of local authority clients, ensuring that what works for one client works reliably for a hundred.
You'll work across every pillar of the AI function - predictive modelling, text analytics, knowledge representation, and agentic AI - building the deployment pipelines, orchestration, and observability that enable the rest of the engineering team to ship with confidence.
Key responsibilities
- Continuously evolve the platform infrastructure powering all AI services (predictive modelling, NLP, knowledge representation, and agentic AI), ensuring reliable, scalable operation across a growing base of local authority clients.
- Deploy and manage ML models via Azure ML endpoints, batch endpoints , and AKS, enabling resilient, secure model hosting that accelerates client onboarding and ensures models remain performant and monitorable throughout their lifecycle.
- Ensure all ML systems are transparent, explainable, and auditable, aligned with Responsible AI principles and UK GDPR; essential where AI outputs inform decisions about vulnerable people in health and social care.
- Design, build, and maintain production-grade orchestration pipelines (Dagster) supporting model training, inference, and retraining, ensuring data from local authority systems is timely, accurate, and fit for purpose before it reaches ML services.
- Contribute to organisation-wide AI capability building, sharing best practice with delivery and consulting teams, advising on technical feasibility, and shaping governance standards as the AI function scales.
What are we looking for?
We’d love to hear from you if you have:
- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field, or equivalent practical experience.
- 4+ years of professional experience in an MLOps, Platform Engineering, or Infrastructure Engineering role supporting ML or data-intensive systems.
- Strong programming skills and production experience in Python.
- Expertise in Azure-native MLOps, including model endpoints, pipelines, registries, environments, and compute management.
Clear evidence of practical experience across the following:
- Deploying, scaling, and troubleshooting containerised workload on Kubernetes in production
- Building and maintaining CI/CD pipelines (Azure DevOps or equivalent) for automated testing, building, and deployment of ML services
- Implementing infrastructure-as-code (Terraform, Bicep or Pulumi)
- Implementing monitoring and observability for production systems, including metrics, altering, logging, and dashboarding (e.g. Prometheus, Grafana)
- Pipeline orchestration using Dagster, Airflow, Prefect, or similar
Bonus points if you have:
- Practical experience with model serving infrastructure – batch and/or real-time inference at scale.
- Experience operating multi-tenant systems, particularly scaling infrastructure across multiple clients or business units.
- Practical experience building and serving production-ready, asynchronous APIs for embedding and/or other compute-intensive services.
- Experience setting up, and optimising vector databases, e.g. Qdrant, and integrating with other services
- Proficiency in Python for building high-performance data and model pipelines, with strong software engineering discipline (testing, versioning, CI/CD).
- Deep familiarity with the Azure ecosystem (Azure Kubernetes Service, Azure Container Registry, Azure DevOps, Azure Blob Storage, Azure Monitor, Azure Key Vault).
Location – This is a hybrid role based in our office in London (Borough). You would be expected to be able to work from the office at least 1-2 days per week. Some travel is also required for on-site client engagements as needed.
What can we offer you?
- Competitive salary reviewed annually
- Work for a passionate, mission-driven company solving society’s big problems
- Work flexible hours around life commitments with a focus on delivering company value rather than hours worked
- Ability to work remotely (excluding face-to-face Team Meetings and client meetings)
- Training and development opportunities
- 25 days annual leave (plus bank holidays)
- Company pension
- Private medical insurance
- Generous enhanced parental leave policies
- Cycle to work scheme
- Flu Vaccinations,
- Eye Test and contribution towards Glasses for VDU use
- Employee Assistance Programme
- Mental health and wellbeing support
- Remote GP access
- Counselling/therapy
- Physiotherapy
- Medical second opinions
About
At Xantura, we’re on a mission to reduce societal inequality by helping local authorities use data more effectively. Our AI-driven platform empowers frontline workers with the insights they need to prevent complex issues like homelessness or children being taken into care — before they happen. We make this possible by connecting siloed datasets, applying advanced machine learning to enrich the data, and using predictive analytics to identify those most at risk. Our platform then distills this into clear, actionable insights that help frontline staff intervene early and make a real difference. It’s an exciting time to join Xantura. We’re scaling quickly, bringing on new clients, strengthening our platform, and expanding into new areas. While we’re a technology company at heart, our true focus is on improving lives — and we’re looking for people who share that vision.
Similar Jobs
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The role involves designing and implementing advanced analytics solutions for healthcare payers, analyzing complex problems, mentoring juniors, and building client relationships.
Top Skills:
AzureDatabricksGitMl-OpsPython
Digital Media
Build and operate MLOps platform components: pipelines, model registries, deployment, monitoring, drift detection, and retraining. Collaborate with Data Science and Data Engineering to establish reproducible, scalable ML lifecycles, governance, and CI/CD for production ML services, and mentor junior engineers.
Top Skills:
AirflowAWSCi/CdContainerizationDockerEcsEksFeastLambdaMlflowPrefectPythonSagemakerSagemaker Model RegistrySnowflakeStep FunctionsTectonTensorflow ServingTerraformTorchserveWeights & Biases
Edtech • Information Technology • Software
Seeking a Senior Machine Learning Platform/Ops Engineer to productionize ML systems with reliability, collaborating with teams to enhance ML lifecycle from training to deployment.
Top Skills:
AirflowBigQueryDatabricksDbtDockerKafkaKubernetesMlflowPythonSagemakerSparkSQLTectonTerraform
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.



