Black Forest Labs Logo

Black Forest Labs

Multimodal VLM/LLM Researcher

Posted 23 Days Ago
Be an Early Applicant
Remote
3 Locations
Mid level
Remote
3 Locations
Mid level
The Multimodal VLM/LLM Researcher will develop and train vision-language and large language models, implement fine-tuning strategies, and innovate by integrating these models into media generation. The role involves conducting research, maintaining knowledge of AI developments, collaborating on model implementation, and sharing findings with teams.
The summary above was generated by AI

Join Black Forest Labs, the pioneering team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1, as we push the boundaries of generative AI. We're seeking an exceptional researcher to run cutting-edge projects in multimodal vision-language and large language models.

Key Responsibilities

Model Development & Training

  • Run the development and training of state-of-the-art multimodal vision-language models within the FLUX technology stack
  • Design and implement specialized fine-tuning strategies for VLMs to address specific use cases and performance requirements
  • Develop and optimize LLM implementations for prompt enhancement, content moderation, and novel applications

Research & Innovation

  • Drive innovation by integrating VLM/LLM capabilities into our media generation pipeline
  • Conduct research to creatively combine vision and language models for enhanced generative capabilities
  • Maintain cutting-edge knowledge of the latest developments in multimodal AI and LLM research
  • Evaluate emerging models and architectures for potential integration into our technology stack

Technical Leadership

  • Collaborate with cross-functional teams to implement and deploy models at scale
  • Contribute to architectural decisions and technical roadmap planning
  • Document and share research findings with the broader team

Required Qualifications

  • Demonstrated expertise in training and fine-tuning large-scale vision-language models
  • Strong publication record or practical experience with relevant projects in multimodal AI research
  • Proficiency in PyTorch or similar deep learning frameworks
  • Experience with distributed training systems and large-scale model optimization
  • Track record of implementing and scaling AI models in production environments

Nice to have

  • Experience with diffusion models and generative AI architectures alongside autoregressive modelling 
  • Background in computer vision 
  • Contributions to open-source AI projects
  • Experience working in fast-paced startup environments
  • Strong software engineering practices and system design skills
  • Experience in open-source VLM inference frameworks such as vLLM

What We Offer

  • Opportunity to work with the strong technical team at Black Forest Labs
  • Access to state-of-the-art computing resources
  • Collaborative, research-focused environment
  • Competitive compensation package
  • Flexible work arrangements
  • Chance to shape the future of generative AI

We're looking for self-motivated individuals who are passionate about advancing the field of AI and can thrive in a fast-paced, research-driven environment. If you're excited about pushing the boundaries of what's possible in generative AI, we want to hear from you.

Top Skills

PyTorch

Similar Jobs

17 Hours Ago
Remote
United Kingdom
Senior level
Senior level
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
As a Senior Ruby Developer at Immersive Labs, you'll architect and deliver significant updates to their learning platform, ensuring high-quality code through design, testing, and maintenance. You'll collaborate within a multi-disciplinary agile team, mentor peers, and engage in innovative projects while participating in a robust on-call scheme for system incidents.
Top Skills: Ruby
17 Hours Ago
Remote
Hybrid
United Kingdom
Senior level
Senior level
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
As a Full Stack Developer at Immersive Labs, you will work on building and improving our learning platform, collaborating with a multi-disciplinary team. Your responsibilities include architecting solutions, maintaining code quality, writing test automation, and mentoring other engineers while contributing to technical discussions and initiatives.
Top Skills: JavaScriptPythonRuby
2 Days Ago
Remote
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The Senior Software Engineer will contribute to developing and maintaining backend applications using Java and Kotlin, work on high-availability systems, support product requirements, and ensure system performance. Responsibilities include designing new applications, improving existing systems, coding, and monitoring system performance, all within an agile environment.
Top Skills: JavaKotlin

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account