Zhen Dong's picture

Zhen Dong

zhendongucb

·

https://dong-zhen.com/

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

nvidia/Llama-Nemotron-VLM-Dataset-v1

liked a dataset 13 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

liked a model 13 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

View all activity

Organizations

upvoted 2 collections 13 days ago

NexusRaven V2

11 items • Updated Dec 14, 2023 • 3

Llama Nemotron

Open, Production-ready Enterprise Models • 11 items • Updated 6 days ago • 65

upvoted 4 papers about 1 month ago

NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers

Paper • 2211.16056 • Published Nov 29, 2022 • 4

PB-LLM: Partially Binarized Large Language Models

Paper • 2310.00034 • Published Sep 29, 2023 • 2

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Paper • 2505.24133 • Published May 30 • 1

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15 • 31

upvoted a paper 8 months ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 44

upvoted 2 papers 10 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 53

An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

Paper • 2403.04880 • Published Mar 7, 2024 • 6

upvoted 3 papers 12 months ago

LLM Inference Unveiled: Survey and Roofline Model Insights

Paper • 2402.16363 • Published Feb 26, 2024 • 2

Q-Diffusion: Quantizing Diffusion Models

Paper • 2302.04304 • Published Feb 8, 2023 • 4

K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences

Paper • 2408.14468 • Published Aug 26, 2024 • 38

upvoted a paper about 1 year ago

Integrating View Conditions for Image Synthesis

Paper • 2310.16002 • Published Oct 24, 2023 • 3

upvoted a paper over 1 year ago

Magic-Me: Identity-Specific Video Customized Diffusion

Paper • 2402.09368 • Published Feb 14, 2024 • 31

upvoted 2 papers almost 2 years ago

SqueezeLLM: Dense-and-Sparse Quantization

Paper • 2306.07629 • Published Jun 13, 2023 • 4

QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection

Paper • 2308.10515 • Published Aug 21, 2023 • 2