Lead GPU-edge inference systems, ensuring compliance and performance through design and monitoring. Manage global infrastructure for AI deployment.
Full-time | Remote | Infrastructure | Reports to CTO
About Elloe
Elloe is the trust layer for AI.
We sit between the world’s most powerful language models and the institutions that can't afford to get it wrong — hospitals, banks, regulators. We trace and block failures in real time. That’s not marketing — we’re deployed at the European Commission, with NIH clinical trials, and inside a Top-5 EU bank catching GDPR violations live.
This is the enforcement layer GenAI has been missing. We're not visualizing problems — we're fixing them.
About the Role
You’ll lead our GPU-edge inference systems. From chaos-resilient deployment to SHAP-driven compliance metrics, you’ll own global infra that makes AI safe and performant.
What You’ll Own
1. Global Edge Routing
- Design zone-routing that ensures <50ms SLA in 10+ regions
- Build fallback orchestration to handle compliance-aware rollbacks
2. GPU Infra Ops
- Maximize utilization across 100K+ GPUs via mesh & load prediction
- Integrate compliance overlays with VaultChain and SHAP triggers
3. Reliability Telemetry
- Ship `/vault/audit`, `/inference/predict`, `/compliance/log` endpoints
- Trace every edge request across governance and model layers
Who You Are
- Senior systems engineer with GPU fleet experience (KubeRay, Istio, Envoy)
- Operated real-time AI infra with 10M+ QPS loads
- Comfortable with compliance observability and infra governance
Why This Matters
Our competitive edge isn’t just AI — it’s defensible enforcement. This role turns that into product.
You’ll Leave This Role With
- Referenceable contributions to enforcement infra that’s live in EU and US institutions
- First-hand product work across legal, engineering, and GTM teams
- Influence over how regulatory primitives become systems people trust
Logistics & Application
- Start Date: Flexible (Q3–Q4 ideal)
- Location: Remote-first; timezone overlap with NY or EU preferred
- Compensation: Top of market salary + equity
- To Apply: Send your resume and a sentence on the hardest infra problem you'd want to own at scale.
Similar Jobs
Artificial Intelligence • Blockchain • Fintech • Software • Financial Services • Cryptocurrency
The Lead Generation Representative will create a pipeline for the ChainGPT Launchpad and Saleium by engaging Web3 projects, qualifying leads, and collaborating with internal teams on outreach strategies and market research.
Top Skills:
Ai-Assisted Prospecting ToolsCrm SystemsKommoLemlistMake.ComN8NZapier
Fintech • Payments • Software • Financial Services
The Senior Data Scientist will develop credit risk models, analyze data for trends, collaborate with partners, and mentor junior scientists.
Top Skills:
PythonRStatistical And Machine Learning Techniques
Retail • Sales
As an Executive Support Specialist, you will manage the CEO and CCO's communication, schedules, and key projects, ensuring operational efficiency and supporting team initiatives.
Top Skills:
ClickupExcelGoogle SuiteModern Ai Tools
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.


