13 25 22

Yuansheng Ni

yuanshengni

https://yuanshengni.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper about 4 hours ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

upvoted a paper 1 day ago

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

authored a paper 1 day ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

View all activity

Organizations

yuanshengni's activity

upvoted a paper about 4 hours ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published 2 days ago • 20

upvoted a paper 1 day ago

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published 3 days ago • 16

upvoted a collection 1 day ago

VisCoder

Collection

6 items • Updated 1 day ago • 1

upvoted a paper 10 days ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published 16 days ago • 48

upvoted 2 papers 15 days ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published 16 days ago • 39

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published 16 days ago • 51

upvoted a paper 25 days ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published 25 days ago • 26

upvoted 2 papers 2 months ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 43

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 134

upvoted 3 papers 3 months ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 20

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 66

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 49

upvoted 4 papers 4 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

upvoted a paper 6 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 28

upvoted a paper 7 months ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

upvoted a paper 8 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 39

upvoted a paper 9 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 31