Llama Nemotron Collection Open, Production-ready Enterprise Models • 11 items • Updated 6 days ago • 65
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers Paper • 2211.16056 • Published Nov 29, 2022 • 4
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models Paper • 2505.24133 • Published May 30 • 1
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Paper • 2507.11527 • Published Jul 15 • 31
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published Dec 16, 2024 • 44
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published Oct 10, 2024 • 53
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control Paper • 2403.04880 • Published Mar 7, 2024 • 6
LLM Inference Unveiled: Survey and Roofline Model Insights Paper • 2402.16363 • Published Feb 26, 2024 • 2
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences Paper • 2408.14468 • Published Aug 26, 2024 • 38
Magic-Me: Identity-Specific Video Customized Diffusion Paper • 2402.09368 • Published Feb 14, 2024 • 31
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection Paper • 2308.10515 • Published Aug 21, 2023 • 2