Careerflow.ai Logo

Careerflow.ai

Math IMO Expert

Posted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in VN
Senior level
Remote
Hiring Remotely in VN
Senior level
Design original IMO/AIME/HMMT-level math problems across algebra, number theory, combinatorics, and geometry; write rigorous LaTeX solutions; craft prompts to expose LLM reasoning failures; review and diagnose model outputs; label and classify prompts; contribute to evaluation benchmark design. Remote contractor role, 40 hrs/week with PST overlap.
The summary above was generated by AI

Total Number of Positions: 40

Role Overview

  • In this role, you will work on projects that improve and evaluate large language models by crafting challenging, competition-level mathematics problems and rigorously assessing model reasoning.

  • The ideal candidate has a strong foundation in competitive mathematics at the AIME, HMMT, and IMO (Olympiad) level across the four classic pillars: Algebra, Number Theory, Combinatorics, and Geometry.

  • You should be able to design novel, "Google-proof" problems intended to expose deep reasoning deficiencies in state-of-the-art models, and to diagnose precisely where and why a model's reasoning breaks down.

  • The role combines original problem authoring, rigorous solution writing, and detailed evaluation of model-generated responses.

  • This is your chance to future-proof your career in an AI-first world by working at the frontier of mathematical reasoning evaluation.

What does the day-to-day look like:

  • Design original, challenging mathematics problems at AIME, HMMT, and IMO difficulty that test the reasoning limits of large language models in multi-step, abstract settings, drawn strictly from Algebra, Number Theory, Combinatorics, or Geometry.

  • Author novel prompts that "break" evaluated models, meaning the model arrives at an incorrect final answer; ensure problems cannot be bypassed via brute-force or computationally intensive methods.

  • Solve problems independently and write detailed, logically structured, self-contained solutions with clear justifications, properly rendered in LaTeX.

  • Review model-generated solutions, identify mathematical errors, logical fallacies, or missing arguments, and diagnose the root cause using defined failure categories (Final Answer, Reasoning Steps, Instruction Following).

  • Contribute to defining new evaluation benchmarks across competition and Olympiad-level mathematics curricula.

  • Classify each prompt accurately by domain, sub-domain, topic, and proficiency level within the labeling tool.

Requirements

  • Mathematical Expertise: Strong command of competitive mathematics at the level of AIME, HMMT, and IMO across Algebra, Number Theory, Combinatorics, and Geometry.

  • Writing Proficiency: Excellent structured written communication, including fluency with standard LaTeX delimiters for all mathematical expressions.

  • Analytical Skills: Strong research and analytical skills, with the ability to construct rigorous, proof-based reasoning.

  • Creative Thinking: Creative and lateral thinking abilities to design novel problems that are not adapted from existing competitions or online repositories.

  • Feedback Skills: Ability to provide constructive feedback, precise annotations, and accurate error diagnosis on model outputs.

  • Independence: Self-motivated and able to work independently in a remote setting.

  • Technical Setup: Desktop/Laptop setup with a good internet connection.

Preferred Qualifications

  • Candidates pursuing or holding a Bachelor’s/Master's degree in Mathematics, Applied Mathematics, Statistics, Engineering, or a related field are eligible and encouraged to apply.

  • Prior experience in competitive mathematics (e.g., national or international Olympiads or equivalent competitive examinations) as a participant, coach, or problem setter is a bonus.

  • Ability to analyze and solve complex problems with a structured, logical approach and to express solutions clearly and rigorously.

Perks of Freelancing

  • Work in a fully remote environment.

  • Opportunity to work on cutting-edge AI projects with leading LLM companies.

  • Potential for contract extension based on performance and project needs.

Offer Details

  • Commitments Required: At least 8 hours per day and 40 hours per week, with 4 hours of overlap with PST.

  • Engagement type: Contractor assignment/freelancer (no medical/paid leave).

Similar Jobs

Mid level
eCommerce • Fashion • Retail • Sales • Wearables • Design
The Developer will support footwear production, manage development projects, coordinate sourcing, ensure accurate specifications, and communicate between teams.
Top Skills: ExcelOffice SoftwarePowerPointWord
Mid level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Senior Health Representative builds relationships with healthcare professionals, promotes Pfizer products, and ensures customer engagement through a consultative sales approach in a defined territory.
Top Skills: Digital Service OfferingsPharmaceutical Products PortfolioVirtual Health Representative Services
9 Days Ago
Remote or Hybrid
Entry level
Entry level
eCommerce • Fashion • Retail • Sales • Wearables • Design
As a Footwear Fitting Model, provide feedback on footwear fit, comfort, and quality, collaborate with design teams, and participate in fitting sessions and photoshoots.
Top Skills: ExcelMS OfficePowerPointWord

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account