23 44 42

Joya Chen PRO

chenjoya

https://chenjoya.github.io/

chenjoya

AI & ML interests

Video LLM

Recent Activity

upvoted a paper 8 days ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

upvoted a paper 8 days ago

D-AR: Diffusion via Autoregressive Models

upvoted a paper 8 days ago

SWE-bench Goes Live!

View all activity

Organizations

chenjoya's activity

upvoted 4 papers 8 days ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published 10 days ago • 39

D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published 9 days ago • 34

SWE-bench Goes Live!

Paper • 2505.23419 • Published 9 days ago • 20

UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

Paper • 2505.23380 • Published 9 days ago • 23

upvoted a paper 10 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published 11 days ago • 91

upvoted a paper 15 days ago

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Paper • 2505.16854 • Published 16 days ago • 11

upvoted a paper 18 days ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published 21 days ago • 57

upvoted a paper 19 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 24 days ago • 182

upvoted a paper 25 days ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 27 days ago • 143

upvoted 2 papers about 1 month ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5 • 82

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 60

upvoted a paper about 2 months ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published Apr 22 • 34

upvoted a collection about 2 months ago

LiveCC

Collection

Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025) • 8 items • Updated Apr 23 • 4

upvoted a paper 2 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

upvoted 6 papers 3 months ago

DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles

Paper • 2503.03651 • Published Mar 5 • 16

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published Mar 3 • 44