Baton Corporation Logo

Baton Corporation

Reinforcement Learning Engineer ($400k - $800k salary)

Reposted 10 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Expert/Leader
In-Office
London, Greater London, England, GBR
Expert/Leader
As a Reinforcement Learning Engineer, you will own production trading systems, design reward functions, validate frameworks, and lead RL efforts to drive trading volume safely with real capital.
The summary above was generated by AI
Who We Are

Baton Corporation is the development company that builds and operates the entire technology stack behind pump.fun, the largest memecoin launchpad in production today. The systems are low latency, high throughput, live under constant load, and break if you get them wrong.

What You’ll Do

As our Reinforcement Learning Engineer, you will own a production trading system that directly deploys real capital. This is not a research role - it’s about building learning systems that are robust, measurable, and safe under real-world constraints.

  • Own and ship an RL-driven trading agent using real capital to increase trading volume and user participation in a memecoin ecosystem

  • Design reward functions and policies aligned with product goals while enforcing strict downside risk constraints

  • Build evaluation and validation frameworks (simulation, offline analysis) to minimize reliance on live sequential testing

  • Safely transition an existing heuristic-based production system toward learning-based approaches

  • Take end-to-end ownership and technical leadership as the sole RL expert, from data and modeling through deployment, monitoring, and safeguards

Who You Are:
  • You have previously put an autonomous learning system into production that directly controlled capital, pricing, traffic, or resources and can explain what broke and how they fixed it

  • Have personally designed and enforced hard risk limits (capital caps, loss bounds, circuit breakers) in a live system, not just talked about “risk-aware objectives.

  • Have built a policy evaluation loop from scratch (simulators, replay, counterfactuals, shadow deployments) before trusting live rollout.

  • Can make and defend uncomfortable tradeoffs (e.g. heuristic > RL, bandit > deep RL) based on empirical results instead of ideology

  • Have operated as the single owner of a complex ML system in a small team, with no safety net of research orgs, infra teams, or “ML platforms.”

What it's like to work here
  • We work in person

  • Hours can be long and unconventional

  • The pace is intense

  • Expectations are high, and impact is immediate

  • Working at Baton is not for everyone

Why Join Us?
  • Unmatched ownership and autonomy

  • Exposure to systems operating at the edge of crypto scale

  • The ability to ship fast and see real-world impact immediately

If you’re motivated by responsibility, speed, and building products used by massive audiences, you’ll feel at home here.

Similar Jobs

16 Minutes Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Lead a team of 9-12 sales representatives across Neuroscience (Migraine) and Cardiovascular portfolios. Hire, coach, set goals, and manage performance. Develop and execute area business plans, manage budgets, analyze sales data, and build KOL and key account relationships. Ensure compliance, leverage digital/virtual tools, and travel across representative territories for field and overnight meetings.
17 Minutes Ago
Easy Apply
Remote or Hybrid
Easy Apply
Junior
Junior
Cloud • Healthtech • Professional Services • Software • Pharmaceutical
Lead elluminate implementation projects across the lifecycle using PMI methodologies. Manage multiple software implementation projects, maintain documentation, mitigate risks, align resources, manage budgets and client expectations, drive platform adoption, and deliver on-time, on-budget, in-scope technical services and integrations for life sciences clients.
Top Skills: 21 Cfr Part 11Clinical Data Repository (Cdr)Edc SystemsElluminate Clinical Data CloudIch/GcpMedidataExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft ProjectMicrosoft TeamsMicrosoft VisioMicrosoft WordOnedrivePmi MethodologiesRaveSharepoint
17 Minutes Ago
In-Office
Internship
Internship
Artificial Intelligence
Create short-form video and social content for Instagram, TikTok, and LinkedIn; write conversion-focused copy; capture customer stories; support content calendar and repurposing; and contribute creative ideas. In-office Brooklyn internship with potential path to full-time.
Top Skills: AdobeCanvaCapcutInstagramLinkedInTiktokVeed

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account