Helsing Jobs

AI Research Engineer - Reinforcement Learning

Helsing

AI Research Engineer - Reinforcement Learning

Reposted 25 Days Ago

Be an Early Applicant

In-Office

London, Greater London, England, GBR

Mid level

In-Office

London, Greater London, England, GBR

Mid level

You will design, train, and deploy agents in multi-agent environments, enhancing our reinforcement learning capabilities and integrating cutting-edge AI into production systems.

The summary above was generated by AI

Who we are

Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and control their ethical standards.

As democracies, we believe we have a special responsibility to be thoughtful about the development and deployment of powerful technologies like AI. We take this responsibility seriously.

We are an ambitious and committed team of engineers, AI specialists and customer-facing programme managers. We are looking for mission-driven people to join our European teams – and apply their skills to solve the most complex and impactful problems. We embrace an open and transparent culture that welcomes healthy debates on the use of technology in defence, its benefits, and its ethical implications.

The role

At Helsing we deliver AI-based capabilities and the enabling infrastructure that allow semi-autonomous platforms to localise, navigate, and perceive the world in real time. You will have the unique opportunity to shape the future of AI in one of the most challenging sectors, where performance needs to be paired with high generalisation capabilities and strong robustness against adversarial attacks.

You will build the autonomy brain for a cutting-edge autonomous aerial platform that will actually take flight. You will develop and integrate state-of-the-art reinforcement learning agents into the operational systems of our own Unmanned Combat Aerial Vehicle, the CA-1 Europa, as part of the groundbreaking Centaur project. This is a unique opportunity to take ownership of novel autonomous systems designed from the ground up, owning the full pipeline from large-scale simulation training through to real-time deployment on flight-ready hardware.

You should apply if you

Hold a MSc in Reinforcement Learning, Robotics, Automation and Control, or a closely related field, with a strong focus on sequential decision-making and autonomous systems.
Have hands-on experience building, training, and deploying reinforcement learning agents. You have iterated on a policy beyond simulation and understand what it takes to make learned behaviour reliable in a real operational system.
Are deeply familiar with modern RL and multi-agent RL techniques, including but not limited to: model-free methods (eg. PPO, SAC), population-based training, handling of partial observability and long horizons.
Have experience integrating RL policies into high-performance runtime systems, with a solid understanding of the latency and throughput constraints that come with real-time autonomous decision-making.
Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust or modern C++, and have experience deploying AI software to production including testing, QA, and monitoring.
Have excellent communication skills and the ability to report and present research findings clearly and efficiently, both internally and externally.
Are passionate about keeping up to date with current research and enjoy reimplementing and extending state-of-the-art approaches in deep reinforcement learning.

Note: We operate at an intersection where women, as well as other minority groups, are systematically under-represented. We encourage you to apply even if you don’t meet all the listed qualifications; ability and impact cannot be summarised in a few bullet points.

Nice to have

PhD in Reinforcement Learning, Multi-Agent Systems, Automation and Control, Robotics, or a related field, with publications in top-tier venues.
Experience with large-scale distributed RL training frameworks, the infrastructure challenges of running thousands of parallel simulation environments, and gpu-based simulators.
Experience modelling and training multi-agent controllers using state-of-the-art techniques, including emergent coordination, competitive self-play, or decentralised execution with centralised training.
Familiarity with flight dynamics, aerospace systems, or guidance, navigation, and control (GNC) concepts.
Experience deploying AI software to safety-critical production systems, including formal verification, testing pipelines, and runtime monitoring.

Join Helsing and work with world-leading experts in their fields

Helsing’s work is important. You’ll be directly contributing to the protection of democratic countries while balancing both ethical and geopolitical concerns.
The work is unique. We operate in a domain that has highly unusual technical requirements and constraints, and where robustness, safety, and ethical considerations are vital. You will face unique Engineering and AI challenges that make a meaningful impact in the world.
Our work frequently takes us right up to the state of the art in technical innovation, be it reinforcement learning, distributed systems, generative AI, or deployment infrastructure. The defence industry is entering the most exciting phase of the technological development curve. Advances in our field of world are not incremental: Helsing is part of, and often leading, historic leaps forward.
In our domain, success is a matter of order-of-magnitude improvements and novel capabilities. This means we take bets, aim high, and focus on big opportunities. Despite being a relatively young company, Helsing has already been selected for multiple significant government contracts.
We actively encourage healthy, proactive, and diverse debate internally about what we do and how we choose to do it. Teams and individual engineers are trusted (and encouraged) to practise responsible autonomy and critical thinking, and to focus on outcomes, not conformity. At Helsing you will have a say in how we (and you!) work, the opportunity to engage on what does and doesn’t work, and to take ownership of aspects of our culture that you care deeply about.

What we offer

Competitive salary and VSOP options
Relocation support: up to €2,500 and 4 weeks temporary accommodation
Learning: €500/£450 yearly allowance
Health & wellness: gym membership and mental health support (Nilo.health)
Social: regular company events and monthly social allowances
Enhanced parental leave: 22 weeks fully paid for primary caregivers & 6 weeks for secondary caregivers
Family support: 5 days of paid family emergency leave, 100% remote work option during pregnancy and phased return to work

These are the core benefits across all locations, there may be additional benefits in certain locations.

Helsing is an equal opportunities employer. We are committed to equal employment opportunity regardless of race, religion, sexual orientation, age, marital status, disability or gender identity. Please do not submit personal data revealing racial or ethnic origin, political opinions, religious or philosophical beliefs, trade union membership, data concerning your health, or data concerning your sexual orientation.

Helsing's Candidate Privacy and Confidentiality Regime can be found here.

London, United Kingdom

Similar Jobs

Superhuman

System Engineer

8 Minutes Ago

Hybrid

Senior level

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI

Design and lead automation, security, and scalability for identity and endpoint management. Architect identity lifecycle, device provisioning, compliance automation, and access controls; build automation around Okta, Jamf Pro, Workspace ONE, and scripting; lead migrations, platform consolidations, and self-service tooling while collaborating with security, IT, and business partners.

Top Skills: BashCis BenchmarksDevice TrustGitIntuneJamf ProOktaPowershellPythonScimSsoUemWorkspace OneZero-Touch Deployment

Airwallex

Senior Associate, Revenue Strategy & Enablement, EMEA

18 Hours Ago

In-Office

London, England, GBR

Senior level

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI

Lead design and delivery of enablement programs across EMEA, translating GTM priorities into onboarding, product training, content, and AI-enabled tools. Partner with commercial leaders and Revenue Operations to improve sales execution, analyse funnel performance, and scale enablement to accelerate revenue and rep productivity.

Top Skills: Ai ToolsGongHighspotLms PlatformsOutreachSalesforce

Airwallex

Manager, Revenue Strategy & Enablement, EMEA

18 Hours Ago

Hybrid

London, England, GBR

Senior level

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI

Lead EMEA revenue enablement by designing and delivering onboarding, product training, enablement programs, AI-enabled tools, and scalable content. Partner with commercial leaders to translate GTM priorities, analyse funnel performance, and improve rep effectiveness to accelerate revenue.

Top Skills: Ai ToolsGongHighspotLms PlatformsOutreachSalesforce

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Helsing

AI Research Engineer - Reinforcement Learning

Helsing London, England Office

Similar Jobs

System Engineer

Senior Associate, Revenue Strategy & Enablement, EMEA

Manager, Revenue Strategy & Enablement, EMEA

What you need to know about the London Tech Scene