NexGen Cloud Logo

NexGen Cloud

Platform Operations Lead

Reposted 15 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in UK
Mid level
Remote
Hiring Remotely in UK
Mid level
As Platform Operations Lead, you will enhance the operational maturity of NexGen Cloud's infrastructure, focusing on automation, reliability, and incident response while improving processes across teams.
The summary above was generated by AI

Platform Operations Lead

Location: UK (Remote)
Department: Infrastructure
Reporting to: Head of Infrastructure

ABOUT NEXGEN CLOUD:

NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises running the world's most compute-intensive workloads. We deliver on-demand and private GPU infrastructure to teams who treat performance as a requirement, not a feature.

We're a tight-knit, fast-moving team working at the cutting edge of AI cloud infrastructure. We practice what we preach, equipping our people with AI at every level so we can solve harder problems, ship faster, and keep raising the bar for what enterprise GPU infrastructure looks like.

THE ROLE: Platform Operations Lead

This role exists to help NexGen Cloud scale the operational maturity of its cloud infrastructure as demand grows across regions, customers and services.

You'll sit at the intersection of Infrastructure, DevOps, Engineering and Customer Experience, helping reduce operational load on engineering teams through automation, tooling, runbooks and clear support processes. You'll play a key role in improving reliability, observability, incident response and operational readiness across our platform.

This is a hands-on role for someone who enjoys building practical solutions, improving how teams work, and taking ownership of operational outcomes in a fast-moving cloud environment.

WHAT YOU'LL BE DOING:

Rather than a long checklist, here's what success in this role looks like:

  • Build and improve scalable infrastructure operations processes that support a growing cloud platform
  • Enable customer-facing and operational teams with secure automation, diagnostics, tooling and clear workflows
  • Reduce repeatable manual work by identifying operational pain points and turning them into automated or self-service solutions
  • Support the rollout and readiness of new infrastructure environments, working closely with Infrastructure, DevOps and Engineering teams
  • Improve observability, incident response and operational documentation across production environments
  • Design and maintain runbooks, escalation paths and ownership models between technical and customer-facing teams
  • Evaluate new tools, vendors or approaches that could improve operational efficiency, reliability or scale
  • Coach and enable teams to adopt automation-first ways of working
  • Act as a bridge between technical infrastructure teams and the teams supporting customers day to day
ABOUT YOU:

We're more interested in how you think and work than in a perfect CV. You'll likely bring a combination of the following:

Essential
  • Strong background in infrastructure, cloud operations, DevOps, platform engineering or operational engineering
  • Experience supporting production environments where reliability, incident response and operational discipline matter
  • Proven ability to design or implement automation that reduces manual operational workload
  • Strong scripting, tooling or workflow automation skills, with an emphasis on clarity, maintainability and security
  • Experience working across multiple technical and non-technical teams to improve processes and outcomes
  • Familiarity with observability, monitoring, infrastructure platforms or orchestration technologies
  • Ability to operate both hands-on and strategically in a fast-moving scale-up environment
  • Strong documentation, communication and problem-solving skills
  • A sense of ownership and accountability for operational outcomes
Nice to Have
  • Experience with GPU, high-performance compute, cloud infrastructure or managed infrastructure platforms
  • Exposure to Kubernetes, OpenStack, Grafana, Windmill automation platforms or infrastructure-as-code tooling
  • Experience helping build or mature 24/7 support, NOC, SRE or technical operations capabilities
  • Experience designing self-service tooling for support, operations or customer-facing teams
WHAT WE OFFER:
  • Competitive salary and annual discretionary bonus scheme
  • Employee wellbeing benefits
  • 25 days of holiday, plus public holidays
  • Flexible working arrangements (remote or hybrid, depending on role and location)
  • Real ownership and autonomy, with the trust to take initiative and experiment
  • The opportunity to make a visible, meaningful impact as we scale
  • Clear career progression and growth opportunities in a fast-growing company
  • A collaborative, international culture built on trust, transparency, and ownership
  • The chance to help shape NexGen Cloud's team, culture, and future alongside ambitious, mission-driven colleagues
MORE INFORMATION

Head over to our NexGen Cloud careers page to view current openings and follow us on LinkedIn and X to learn more about our journey, newest releases and hear exciting news in the neocloud space.


HQ

NexGen Cloud London, England Office

24 Greville St, London, United Kingdom, EC1N 8SS

Similar Jobs

17 Days Ago
In-Office or Remote
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • Software • Automation
The role involves developing and supporting data processing pipelines, collaborating with ML Engineers, and improving infrastructure while mentoring team members.
Top Skills: DockerGkeGrafanaKubernetesLokiMessage QueuesNatsPrometheusPythonRabbitTempo
17 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Senior Associate, you will implement Oracle HCM solutions, analyze problems, mentor junior staff, manage client relationships, and ensure quality deliverables.
Top Skills: Cc&BEbsHyperionOracle FusionOracle HcmPeoplesoftSiebel
19 Hours Ago
Remote
UK
Senior level
Senior level
Information Technology
As a Senior Data Scientist, you will lead high-complexity projects, develop ML and NLP solutions, and collaborate across teams to drive business impact through statistical modeling and data analysis.
Top Skills: BigQueryClickhouseDruidLlmsMachine LearningNatural Language ProcessingPower BIPythonRedshiftSQLTableau

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account