The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Productivity • Software • App development • Automation
Develop features for the Xodo platform, engage in all aspects of development, collaborate with other engineers, and research new projects.
Top Skills:
AWSDockerJavaScriptMySQLNext.JsPostgresReactTypescript
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The SDET will develop and maintain automated tests in a microservices architecture, ensuring quality through collaboration and adherence to best practices.
Top Skills:
AWSAzureC#CypressDockerGCPJavaJavaScriptJmeterK6KubernetesPythonRubySeleniumSQL ServerTypescript
Cloud • Security • Software • Cybersecurity • Automation
As a Deal Desk Analyst, you'll support Sales in structuring, quoting, and booking deals, ensuring accurate deal intent in Salesforce and working with finance for seamless revenue processes.
Top Skills:
Cpq ToolsExcelGoogle SheetsSalesforceZuora
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.


.png)
