SolveAI Logo

SolveAI

Infrastructure Engineer

Reposted 6 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Mid level
In-Office
London, Greater London, England, GBR
Mid level
The Infrastructure Engineer will manage AWS infrastructure, lead customer deployments, ensure production reliability, and shape compliance while engaging with customer architects.
The summary above was generated by AI
About SolveAI

Most enterprise problems never get solved in software, because the person who understands the problem and the person who can build the fix are never the same person. SolveAI closes that gap.

Our platform lets the people closest to the problem build real, production-grade software by describing what they need in plain English. It plugs into the enterprise stack as it actually exists, and what it ships clears the bar enterprises actually hold software to: secure, compliant, reliable, maintainable, and able to scale. The result is software tailored to each customer's environment, owned by them.

Unlike the wave of "vibe coding" tools squinting at the enterprise from the consumer world - hoping the hard parts can be bolted on later - we started at the hard end. For us, that bar is the foundation we build on, not a roadmap item.

We can do this because of who's in the room: a team that's spent decades operating inside the world's most complex organisations, not just observing them. Backed by GV and Accel, built by an ex-Palantir team, and already live with enterprise customers.

What We Offer
  • High-impact environment: Join a company shaping the future of enterprise software, working with a team that's redefining how AI meets real-world business needs

  • Ownership and visibility: In a small, high-performing team, the impact of your work isn't theoretical - it actively shapes the company's direction

  • Empowerment through innovation: Use cutting-edge AI tools internally. You'll be working with the same technology we deliver to enterprises

About the role

This is a high-leverage, high-ownership role and you'll set technical direction across reliability, observability, and customer deployability.

What makes it unusual is that you'll be the person enterprise security and infrastructure teams talk to when they have hard questions. As much as this is an internal platform role, it's also a customer-facing one. You'll be on calls with customer architects, leading on-prem rollouts, and navigating the compliance requirements of some demanding environments.

What you'll do
  • Own our AWS footprint, EKS clusters, and Terraform codebase: design, evolve, harden

  • Build and operate the observability stack (Datadog) so we catch problems before customers do: metrics, traces, logs, alerting, SLOs

  • Design for multi-tenancy: isolation, performance, cost attribution, noisy-neighbour mitigation

  • Lead our customer deployment story: make it straightforward for enterprises to run SolveAI in their own AWS / Azure / GCP accounts, or fully on-prem

  • Be the technical lead on customer security reviews, architecture deep-dives, and on-prem rollouts

  • Own CI/CD, secrets management, and the developer experience that lets engineers ship safely and fast

  • Own production reliability: incident response, post-mortems, capacity planning

  • Help shape our compliance posture (SOC 2, ISO 27001, financial-services requirements) as we grow

What we're looking for

Beyond technical depth, we're looking for:

Ownership. You treat the platform as yours. You don't wait to be told something is broken and you don't hand problems off.

Customer instinct. You're comfortable talking to enterprise security and infra teams. You can take a hard technical question and give a clear, honest answer.

Pragmatism. You know the difference between good enough to ship and good enough to last. You make that call correctly.

On the technical side:

  • Deep AWS expertise: VPC, IAM, EKS, networking

  • Strong Kubernetes knowledge — operator level, not just kubectl.

  • Terraform fluency: module design, state management, multi-account patterns

  • Strong observability instincts: alerting that doesn't burn people out, symptoms vs causes

  • Experience supporting enterprise customers running your software in their own environments (BYOC or on-prem), or a strong appetite to figure it out

  • Comfortable enough in Rust and Python to trace a problem from infrastructure all the way into application code

  • Strong written communication: you'll be talking to customer architects as much as our own engineers

Nice to have
  • Experience with air-gapped or on-prem distributions of cloud-native software

  • Helm, ArgoCD, or similar GitOps tooling

  • Background with high-throughput / low-latency systems

  • Cost optimisation experience at AWS scale

Why SolveAI

At SolveAI, you'll work alongside a team that's spent decades solving the hardest enterprise challenges, from operational inefficiencies to data fragmentation.

If you want to see real change in the world, care deeply about value add, and thrive when given ownership in unfamiliar environments, we'd love to hear from you.

Similar Jobs

19 Days Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Responsible for operating and maintaining edge infrastructure, managing cloud and physical servers, troubleshooting systems, and automating tasks while collaborating with multiple teams.
Top Skills: CassandraHadoopKubernetesLinuxOraclePostgres
19 Days Ago
Hybrid
London, Greater London, England, GBR
Mid level
Mid level
Artificial Intelligence • Information Technology • Software
The role involves maintaining and optimizing Palantir software systems, developing automated workflows, and collaborating on infrastructure challenges for government solutions.
Top Skills: AIBashConfiguration ManagementGoJavaJavaScriptLlmPalantir ApolloPalantir FoundryPython
Yesterday
In-Office
Mid level
Mid level
Information Technology • Consulting
Design, build, and operate secure, compliant Azure Landing Zones and reusable infrastructure patterns. Implement Infrastructure as Code (Bicep/Terraform), automate deployments via Azure DevOps CI/CD, manage networking, governance, identity, and security controls, onboard workloads, harden VMs, and maintain image-building tooling to improve platform reliability and compliance.
Top Skills: Azure CliAzure DevopsAzure FirewallAzure Landing Zone AcceleratorAzure Landing ZonesAzure PolicyAzure Virtual Network (Vnet)BicepCi/CdCloud Adoption Framework (Caf)Defender For CloudDesired State ConfigurationDnsEntra Id (Azure Ad)Image BuilderLinuxAzurePackerPimPowershellPrivate EndpointsPythonRbacSource Control (Git)TerraformWindows

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account