Luca Zhang

ZHHJemotion

AI & ML interests

VLM and MLLM

Recent Activity

upvoted a paper about 2 hours ago

DINOv3

upvoted a paper 19 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

upvoted a paper about 1 month ago

MMaDA: Multimodal Large Diffusion Language Models

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

DINOv3

Paper • 2508.10104 • Published 7 days ago • 117

upvoted a paper 19 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published 22 days ago • 124

upvoted a paper about 1 month ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 95

upvoted an article about 1 month ago

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 178

upvoted 3 papers about 2 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Paper • 2507.02813 • Published Jul 3 • 60

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 129

upvoted an article about 2 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

and 3 others •

Dec 9, 2022

• 320

upvoted an article 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 208

upvoted 2 papers 3 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 177

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 182

upvoted 4 papers 4 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 63

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Paper • 2504.13820 • Published Apr 18 • 17

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 134

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16 • 36

upvoted a paper 9 months ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 47

Luca Zhang

AI & ML interests

Recent Activity

Organizations

ZHHJemotion's activity

SigLIP 2: A better multilingual vision language encoder

Illustrating Reinforcement Learning from Human Feedback (RLHF)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge