腾讯(tencent)招聘Research Intern – Video World Models (Research & ML Systems) 106904
招聘职位:
Research Intern – Video World Models (Research & ML Systems) 106904 搜索同类职位
岗位职责:
Business Unit
What the Role Entails
About the Position
We are seeking an exceptional Research Intern to join our core team in building the next generation of interactive Video World Models. While traditional generative AI focuses on generating passive pixels (e.g., text-to-video), our mission is fundamentally more ambitious: we are building foundational "World Models" that inherently understand physics, causality, action spaces, and complex dynamics directly from internet-scale data. Our goal is to train models that can simulate and "dream" complex virtual worlds, allowing users and agents to explore and interact with them in real time.
This is not a purely theoretical role.
Training interactive world models at this scale requires pushing the absolute limits of modern GPUs. We operate at the intersection of cutting-edge generative AI research and high-performance machine learning systems. We are looking for "full-stack" hacker-researchers—visionary thinkers who are also elite engineers, capable of co-designing novel neural architectures and engineering the highly optimized infrastructure required to train them across thousands of GPUs.What You Will DoArchitect & Scale Foundation Models: Design, train, and scale state-of-the-art interactive world models (combining Diffusion, Autoregressive Transformers, VAEs, LLMs, VLMs) on massive video datasets.
Push the Boundaries of ML Systems: Architect highly scalable distributed training pipelines, utilizing advanced model and data parallelism to train massive models efficiently on large-scale GPU clusters.
Optimize for Efficiency: Profile and optimize model architectures to break through memory and compute bottlenecks. Write high-performance, custom hardware kernels to maximize Model FLOPs Utilization (MFU) and enable real-time, low-latency inference.
Who We Look For
RequirementsAcademic Excellence: Currently pursuing a PhD (or Master’s degree with a truly exceptional research/engineering track record) in Computer Science, Machine Learning, Computer Architecture, or a related field.
Engineering Skills: Exceptional, production-level coding proficiency in Python or other languages. Background in competitive programming is a great plus.
AI Infrastructure & Scaling: Experience with modern AI infrastructure stack and large-scale machine learning systems, such as PyTorch FSDP, Megatron, etc. Experience with GPU kernels using CUDA and/or Triton is a great plus.
Deep Generative Expertise: Thorough theoretical and practical understanding of modern generative paradigms (Diffusion, Vision Transformers, Autoregressive sequence modeling, discrete tokenization/VAEs).
Top-Tier Publication Record: First-author publications in top-tier AI venues (NeurIPS, ICLR, ICML, CVPR, ICCV) OR premier ML Systems venues (MLSys, OSDI, ASPLOS).
Location State(s)
US-California-Palo Alto
The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.
This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Work Location: US-California-Palo Alto
岗位要求:
岗位要求详情请见上方岗位职责内说明
Business Unit
What the Role Entails
About the Position
We are seeking an exceptional Research Intern to join our core team in building the next generation of interactive Video World Models. While traditional generative AI focuses on generating passive pixels (e.g., text-to-video), our mission is fundamentally more ambitious: we are building foundational "World Models" that inherently understand physics, causality, action spaces, and complex dynamics directly from internet-scale data. Our goal is to train models that can simulate and "dream" complex virtual worlds, allowing users and agents to explore and interact with them in real time.
This is not a purely theoretical role.
Training interactive world models at this scale requires pushing the absolute limits of modern GPUs. We operate at the intersection of cutting-edge generative AI research and high-performance machine learning systems. We are looking for "full-stack" hacker-researchers—visionary thinkers who are also elite engineers, capable of co-designing novel neural architectures and engineering the highly optimized infrastructure required to train them across thousands of GPUs.What You Will DoArchitect & Scale Foundation Models: Design, train, and scale state-of-the-art interactive world models (combining Diffusion, Autoregressive Transformers, VAEs, LLMs, VLMs) on massive video datasets.
Push the Boundaries of ML Systems: Architect highly scalable distributed training pipelines, utilizing advanced model and data parallelism to train massive models efficiently on large-scale GPU clusters.
Optimize for Efficiency: Profile and optimize model architectures to break through memory and compute bottlenecks. Write high-performance, custom hardware kernels to maximize Model FLOPs Utilization (MFU) and enable real-time, low-latency inference.
Who We Look For
RequirementsAcademic Excellence: Currently pursuing a PhD (or Master’s degree with a truly exceptional research/engineering track record) in Computer Science, Machine Learning, Computer Architecture, or a related field.
Engineering Skills: Exceptional, production-level coding proficiency in Python or other languages. Background in competitive programming is a great plus.
AI Infrastructure & Scaling: Experience with modern AI infrastructure stack and large-scale machine learning systems, such as PyTorch FSDP, Megatron, etc. Experience with GPU kernels using CUDA and/or Triton is a great plus.
Deep Generative Expertise: Thorough theoretical and practical understanding of modern generative paradigms (Diffusion, Vision Transformers, Autoregressive sequence modeling, discrete tokenization/VAEs).
Top-Tier Publication Record: First-author publications in top-tier AI venues (NeurIPS, ICLR, ICML, CVPR, ICCV) OR premier ML Systems venues (MLSys, OSDI, ASPLOS).
Location State(s)
US-California-Palo Alto
The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.
This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Work Location: US-California-Palo Alto
岗位要求:
岗位要求详情请见上方岗位职责内说明
免责声明:
此信息由腾讯官网 (查看来源)审核并发布,我们转载该信息,仅出于传递更多就业招聘资讯、促进大学生及广大求职者就业之目的。该招聘职位信息的真实性、准确性、时效性及合法性均由原始发布方“腾讯官网”负责。我们作为信息转载平台,不构成求职建议,不涉及任何职业中介服务,不对其内容承担任何形式的保证责任。请用户在使用转载信息时保持审慎,自行判断并承担相应风险,求职请认准企业官方渠道!