Helsing Logo

Helsing

AI Research Engineer - Reinforcement Learning

Reposted 6 Hours Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Mid level
In-Office
London, Greater London, England, GBR
Mid level
You will design, train, and deploy agents in multi-agent environments, enhancing our reinforcement learning capabilities and integrating cutting-edge AI into production systems.
The summary above was generated by AI
Who we are

Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and control their ethical standards. 

As democracies, we believe we have a special responsibility to be thoughtful about the development and deployment of powerful technologies like AI. We take this responsibility seriously. 

We are an ambitious and committed team of engineers, AI specialists and customer-facing programme managers. We are looking for mission-driven people to join our European teams – and apply their skills to solve the most complex and impactful problems. We embrace an open and transparent culture that welcomes healthy debates on the use of technology in defence, its benefits, and its ethical implications. 

The role

At Helsing we deliver AI-based capabilities and the enabling infrastructure that allow semi-autonomous platforms to localise, navigate, and perceive the world in real time. You will have the unique opportunity to shape the future of AI in one of the most challenging sectors, where performance needs to be paired with high generalisation capabilities and strong robustness against adversarial attacks.

You will build the autonomy brain for a cutting-edge autonomous aerial platform that will actually take flight. You will develop and integrate state-of-the-art reinforcement learning agents into the operational systems of our own Unmanned Combat Aerial Vehicle, the CA-1 Europa, as part of the groundbreaking Centaur project. This is a unique opportunity to take ownership of novel autonomous systems designed from the ground up, owning the full pipeline from large-scale simulation training through to real-time deployment on flight-ready hardware.

You should apply if you
  • Hold a MSc in Reinforcement Learning, Robotics, Automation and Control, or a closely related field, with a strong focus on sequential decision-making and autonomous systems.

  • Have hands-on experience building, training, and deploying reinforcement learning agents. You have iterated on a policy beyond simulation and understand what it takes to make learned behaviour reliable in a real operational system.

  • Are deeply familiar with modern RL and multi-agent RL techniques, including but not limited to: model-free methods (eg. PPO, SAC), population-based training, handling of partial observability and long horizons.

  • Have experience integrating RL policies into high-performance runtime systems, with a solid understanding of the latency and throughput constraints that come with real-time autonomous decision-making.

  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust or modern C++, and have experience deploying AI software to production including testing, QA, and monitoring.

  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently, both internally and externally.

  • Are passionate about keeping up to date with current research and enjoy reimplementing and extending state-of-the-art approaches in deep reinforcement learning.

Note: We operate at an intersection where women, as well as other minority groups, are systematically under-represented. We encourage you to apply even if you don’t meet all the listed qualifications; ability and impact cannot be summarised in a few bullet points.

Nice to have
  • PhD in Reinforcement Learning, Multi-Agent Systems, Automation and Control, Robotics, or a related field, with publications in top-tier venues.

  • Experience with large-scale distributed RL training frameworks, the infrastructure challenges of running thousands of parallel simulation environments, and gpu-based simulators.

  • Experience modelling and training multi-agent controllers using state-of-the-art techniques, including emergent coordination, competitive self-play, or decentralised execution with centralised training.

  • Familiarity with flight dynamics, aerospace systems, or guidance, navigation, and control (GNC) concepts.

  • Experience deploying AI software to safety-critical production systems, including formal verification, testing pipelines, and runtime monitoring.

Join Helsing and work with world-leading experts in their fields
  • Helsing’s work is important. You’ll be directly contributing to the protection of democratic countries while balancing both ethical and geopolitical concerns. 

  • The work is unique. We operate in a domain that has highly unusual technical requirements and constraints, and where robustness, safety, and ethical considerations are vital. You will face unique Engineering and AI challenges that make a meaningful impact in the world. 

  • Our work frequently takes us right up to the state of the art in technical innovation, be it reinforcement learning, distributed systems, generative AI, or deployment infrastructure. The defence industry is entering the most exciting phase of the technological development curve. Advances in our field of world are not incremental: Helsing is part of, and often leading, historic leaps forward. 

  • In our domain, success is a matter of order-of-magnitude improvements and novel capabilities. This means we take bets, aim high, and focus on big opportunities. Despite being a relatively young company, Helsing has already been selected for multiple significant government contracts.  

  • We actively encourage healthy, proactive, and diverse debate internally about what we do and how we choose to do it. Teams and individual engineers are trusted (and encouraged) to practise responsible autonomy and critical thinking, and to focus on outcomes, not conformity. At Helsing you will have a say in how we (and you!) work, the opportunity to engage on what does and doesn’t work, and to take ownership of aspects of our culture that you care deeply about. 

What we offer
  • Competitive salary and VSOP options

  • Relocation support: up to €2,500 and 4 weeks temporary accommodation

  • Learning: €500/£450 yearly allowance

  • Health & wellness: gym membership and mental health support (Nilo.health)

  • Social: regular company events and monthly social allowances

  • Enhanced parental leave: 22 weeks fully paid for primary caregivers & 6 weeks for secondary caregivers

  • Family support: 5 days of paid family emergency leave, 100% remote work option during pregnancy and phased return to work

These are the core benefits across all locations, there may be additional benefits in certain locations.

Helsing is an equal opportunities employer. We are committed to equal employment opportunity regardless of race, religion, sexual orientation, age, marital status, disability or gender identity. Please do not submit personal data revealing racial or ethnic origin, political opinions, religious or philosophical beliefs, trade union membership, data concerning your health, or data concerning your sexual orientation. 

 
Helsing's Candidate Privacy and Confidentiality Regime can be found here. 
 
 

Similar Jobs

2 Hours Ago
Easy Apply
In-Office or Remote
Easy Apply
Entry level
Entry level
Artificial Intelligence • Edtech • Mobile • Natural Language Processing • Productivity • Software
The Marketplace Designer will create a variety of templates for social media, promotional materials, and events, ensuring ease of customization. They should have a design portfolio and proficiency in relevant tools.
Top Skills: Adobe IllustratorCanvaFigmaPhotoshop
6 Hours Ago
In-Office or Remote
GB
Mid level
Mid level
Productivity • Software • App development • Automation
As a C++ Engineer, you will develop machine learning features, integrate models, maintain SDKs, and collaborate across teams for software performance and enhancements.
Top Skills: AWSC++CmakeDockerGCPGitJenkinsNumpyOpencvPython
6 Hours Ago
In-Office or Remote
United Kingdom
Senior level
Senior level
Productivity • Software • App development • Automation
The QA Automation Lead will design automated test frameworks, ensure product quality, mentor engineers, and collaborate with development teams.
Top Skills: C#C++Ci/CdLinuxOsxPythonSdksWindows

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account