2 9 15

Tianyu He

deeptimhe

https://www.microsoft.com/en-us/research/people/tianyuhe/

deeptimhe

AI & ML interests

None yet

Recent Activity

upvoted an article 14 days ago

How to generate text: using different decoding methods for language generation with Transformers

authored a paper about 1 month ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

upvoted a paper about 1 month ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

View all activity

Organizations

upvoted an article 14 days ago

Article

How to generate text: using different decoding methods for language generation with Transformers

•

Mar 1, 2020

• 237

authored a paper about 1 month ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10 • 32

upvoted a paper about 1 month ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10 • 32

authored a paper about 1 month ago

Playing with Transformer at 30+ FPS via Next-Frame Diffusion

Paper • 2506.01380 • Published Jun 2 • 1

liked 2 datasets 4 months ago

laion/laion2B-en-aesthetic

Viewer • Updated Jul 13, 2024 • 51.9M • 2.4k • 54

djiajunustc/3D-LLaVA-Data

Updated Apr 22 • 20 • 4

upvoted a paper 4 months ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11 • 40

updated a model 5 months ago

microsoft/VidTok

Updated Apr 5 • 41

authored a paper 8 months ago

VidTok: A Versatile and Open-Source Video Tokenizer

Paper • 2412.13061 • Published Dec 17, 2024 • 8

upvoted a paper 8 months ago

VidTok: A Versatile and Open-Source Video Tokenizer

Paper • 2412.13061 • Published Dec 17, 2024 • 8

commented a paper 8 months ago

VidTok: A Versatile and Open-Source Video Tokenizer

Paper • 2412.13061 • Published Dec 17, 2024 • 8 •

liked a model 8 months ago

microsoft/VidTok

Updated Apr 5 • 41

authored 6 papers 8 months ago

Video In-context Learning

Paper • 2407.07356 • Published Jul 10, 2024

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement

Paper • 2406.08096 • Published Jun 12, 2024

IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI

Paper • 2411.00785 • Published Oct 17, 2024 • 8

Memories are One-to-Many Mapping Alleviators in Talking Face Generation

Paper • 2212.05005 • Published Dec 9, 2022

End-to-End Rate-Distortion Optimized 3D Gaussian Representation

Paper • 2406.01597 • Published Apr 9, 2024

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder

Paper • 2303.17550 • Published Mar 30, 2023

liked a dataset 9 months ago

HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 5.61k • 322

liked a model 9 months ago

tencent/HunyuanVideo

Text-to-Video • Updated Mar 6 • 3.91k • • 2.02k

Tianyu He

AI & ML interests

Recent Activity

Organizations

deeptimhe's activity

How to generate text: using different decoding methods for language generation with Transformers