Tencent
Palo Alto, California, US

Research Intern – Video World Models

Onsite$80,168 – $124,800/yrPosted 2 weeks agoLinkedIn

Skip the busywork

ApplyBolt rewrites your resume for this exact role and hits submit. You just pick the jobs.

Resume tailored to this roleApplied in secondsTrack every application
Download the app

About this role

Business Unit

What the Role Entails

About the Position

We are seeking an exceptional Research Intern to join our team in building the next generation of video world models. While traditional generative models focus on creating passive video (text-to-video), our mission is to build "World Models"—foundation models that understand physics, causality, and dynamics directly from large-scale data, and can be explored and interacted in real-time. You will work at the frontier of generative AI research, enabling the model to "dream" and interact with complex virtual worlds.

Who We Look For

Requirements:

  • Currently pursuing a PhD (or Master’s degree with strong research track record) in Computer Science, Machine Learning, or a related field.
  • Strong proficiency in Python and a deep learning framework (PyTorch or JAX). Experience in large-scale machine learning systems is a great plus.
  • Deep understanding of Generative Models (Diffusion, Transformers, VAEs, Auto-regressive models).
  • Publication Record: First-author publications in top AI venues (CVPR, ICCV, NeurIPS, ICML, ICLR, etc.).