Tencent Logo

Tencent

Video Generation Content Understanding and Feedback Research Intern

Posted 12 Hours Ago
Be an Early Applicant
In-Office
London, Greater London, England, GBR
Internship
In-Office
London, Greater London, England, GBR
Internship
Support research on video generation and understanding by implementing state tracking, evaluating consistency and causal/physical constraints, exploring unified generation-understanding and hybrid low-latency architectures, validating interactive capabilities in simulated/game scenarios, building evaluation metrics, and tracking relevant academic and open-source work.
The summary above was generated by AI
Business UnitLIGHTSPEED STUDIOS is made up of passionate players who advance the art & science of game development through great stories, great gameplay, and advanced technology. We are focused on bringing next generation experiences to gamers who want to enjoy them anywhere, anytime, across multiple genres and devices.

About the Hiring TeamLightspeed Tech Center is a R&D department under Lightspeed Studios which develop PUBG Mobile and other high-quality games. Our Tech Center leads the research, exploration, and discovery of innovative technologies and provides technical services for all games during all phases of life cycle, including engine, audio, QA, AI, next generation game, technical cooperation, etc.What the Role Entails

1. Assist in researching the model's ability to understand generated content, including parsing semantics, objects, relationships, and spatial structures.

2. Help implement state tracking and evaluate consistency modeling for generated videos.

3. Participate in exploring "unified generation-understanding" model architectures.

Assist in evaluating causal consistency control and physical constraints for "action input → video output" pipelines.

4. Contribute to researching hybrid architectures aimed at achieving low-latency feedback for real-time interactive generation.

5. Work with the team to test and refine the end-to-end closed loop of "generation → understanding → control → feedback."

6. Assist in validating interactive capabilities in simulated/game scenarios and help build evaluation metrics.

7. Track the latest industry academic papers and open-source projects related to video understanding and generation.

Who We Look For

1. Currently pursuing a Ph.D. or Master's degree in AI-related fields (video understanding, video prediction, reinforcement learning, or multimodal generation).

2. Good understanding of the internal mechanisms of video generation/VLM models and diffusion model principles.

3. Academic or project experience in interactive video generation, controllable generation, or multimodal understanding.

4. Solid coding and engineering skills; able to assist in building and debugging model training pipelines.

5. Proficient in Python/PyTorch; publications in related fields are a strong plus.

6. Familiarity with Game AI, simulation environments, or reinforcement learning frameworks is preferred.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Similar Jobs

An Hour Ago
Remote or Hybrid
United Kingdom
Mid level
Mid level
Information Technology • Machine Learning • Software • Conversational AI • Generative AI • Manufacturing
The Professional Services Consultant leads customer implementations, configures software solutions, conducts training, and collaborates with internal teams to enhance customer success in various industries.
Top Skills: Active DirectoryAlmEngineering SystemsEnterprise ApplicationsLdapPlmRest ApisSQL
An Hour Ago
Hybrid
City of London, City and County of the City of London, England, GBR
Senior level
Senior level
Fintech • Financial Services
The role involves implementing ALM models in Python, collaborating with quant teams, analyzing performance, and contributing to software development in an Agile environment.
Top Skills: C++ConfluenceGitJIRAPython
An Hour Ago
Remote or Hybrid
United Kingdom
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design and build a canonical knowledge graph and ingestion framework: implement connectors, parsers, schema-to-model transforms, code generators, and entity-resolution systems. Lead cross-functional teams, ensure data quality and validation across a multi-backend graph architecture, mentor engineers and taxonomists, and support broader CTO organization with briefings and product demonstrations.
Top Skills: APIsGoGraphRust

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account