The Multimodal VLM/LLM Researcher will develop and train vision-language and large language models, implement fine-tuning strategies, and innovate by integrating these models into media generation. The role involves conducting research, maintaining knowledge of AI developments, collaborating on model implementation, and sharing findings with teams.
Join Black Forest Labs, the pioneering team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1, as we push the boundaries of generative AI. We're seeking an exceptional researcher to run cutting-edge projects in multimodal vision-language and large language models.
Key Responsibilities
Model Development & Training
- Run the development and training of state-of-the-art multimodal vision-language models within the FLUX technology stack
- Design and implement specialized fine-tuning strategies for VLMs to address specific use cases and performance requirements
- Develop and optimize LLM implementations for prompt enhancement, content moderation, and novel applications
Research & Innovation
- Drive innovation by integrating VLM/LLM capabilities into our media generation pipeline
- Conduct research to creatively combine vision and language models for enhanced generative capabilities
- Maintain cutting-edge knowledge of the latest developments in multimodal AI and LLM research
- Evaluate emerging models and architectures for potential integration into our technology stack
Technical Leadership
- Collaborate with cross-functional teams to implement and deploy models at scale
- Contribute to architectural decisions and technical roadmap planning
- Document and share research findings with the broader team
Required Qualifications
- Demonstrated expertise in training and fine-tuning large-scale vision-language models
- Strong publication record or practical experience with relevant projects in multimodal AI research
- Proficiency in PyTorch or similar deep learning frameworks
- Experience with distributed training systems and large-scale model optimization
- Track record of implementing and scaling AI models in production environments
Nice to have
- Experience with diffusion models and generative AI architectures alongside autoregressive modelling
- Background in computer vision
- Contributions to open-source AI projects
- Experience working in fast-paced startup environments
- Strong software engineering practices and system design skills
- Experience in open-source VLM inference frameworks such as vLLM
What We Offer
- Opportunity to work with the strong technical team at Black Forest Labs
- Access to state-of-the-art computing resources
- Collaborative, research-focused environment
- Competitive compensation package
- Flexible work arrangements
- Chance to shape the future of generative AI
We're looking for self-motivated individuals who are passionate about advancing the field of AI and can thrive in a fast-paced, research-driven environment. If you're excited about pushing the boundaries of what's possible in generative AI, we want to hear from you.
Top Skills
PyTorch
Similar Jobs
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
As a Senior Ruby Developer at Immersive Labs, you'll architect and deliver significant updates to their learning platform, ensuring high-quality code through design, testing, and maintenance. You'll collaborate within a multi-disciplinary agile team, mentor peers, and engage in innovative projects while participating in a robust on-call scheme for system incidents.
Top Skills:
Ruby
Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
As a Full Stack Developer at Immersive Labs, you will work on building and improving our learning platform, collaborating with a multi-disciplinary team. Your responsibilities include architecting solutions, maintaining code quality, writing test automation, and mentoring other engineers while contributing to technical discussions and initiatives.
Top Skills:
JavaScriptPythonRuby
Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
The Senior Software Engineer will contribute to developing and maintaining backend applications using Java and Kotlin, work on high-availability systems, support product requirements, and ensure system performance. Responsibilities include designing new applications, improving existing systems, coding, and monitoring system performance, all within an agile environment.
Top Skills:
JavaKotlin
What you need to know about the London Tech Scene
London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.