G2i Logo

G2i

Constraint Programming Software Engineer, AI

Sorry, this job was removed at 08:18 a.m. (GMT) on Friday, Jul 04, 2025
In-Office or Remote
207 Locations
In-Office or Remote
207 Locations

Similar Jobs

22 Seconds Ago
Remote or Hybrid
United States
Senior level
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Lead a team to enhance the Identity Security Cloud Platform by solving complex technical challenges and collaborating with various stakeholders. Manage technical debt and drive productivity.
Top Skills: AWSDockerKafkaKubernetesSqs
23 Seconds Ago
Remote or Hybrid
United States
Senior level
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Lead and enhance accounting operations, manage GL financial close processes, ensure compliance with U.S. GAAP, and mentor staff.
Top Skills: Erp And Accounting SystemsIfrsSox ComplianceU.S. Gaap
46 Seconds Ago
Remote or Hybrid
United States
Senior level
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Staff Machine Learning Engineer, you will lead the design, implementation, and optimization of machine learning models, collaborate cross-functionally, and drive AI strategy to enhance SailPoint's identity security solutions.
Top Skills: AirflowAWSBedrockCloudbeesDbtFeastGoJenkinsKafkaPythonPyTorchQlikSagemakerScikit-LearnShell/BashSnowflakeSQLTableauTensorFlow

List of accepted countries and locations

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.


What is Needed

  • 4+ years of professional software-engineering experience.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is better than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to thrive in a fully asynchronous, low-oversight remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and security issues quickly.

What is Not Needed
  • No prior RLHF or AI-training experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Tech Stack

We are looking for Fullstack JavaScript or TypeScript engineers

  • React, Next.js

  • Node

  • Postgress


Logistics
  • Location: Fully remote (work from anywhere).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, zero fluff. If this fits your profile, apply here.

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account