AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis Paper • 2504.13157 • Published Apr 17 • 21
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published Apr 11 • 125
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper • 2504.07866 • Published Apr 10 • 11
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published Apr 7 • 132
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published Apr 15 • 28
AgentRxiv: Towards Collaborative Autonomous Research Paper • 2503.18102 • Published Mar 23 • 23
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning Paper • 2503.18769 • Published Mar 24 • 10
V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms Paper • 2503.17422 • Published Mar 21 • 6
Gemini Robotics: Bringing AI into the Physical World Paper • 2503.20020 • Published Mar 25 • 25
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper • 2503.21758 • Published Mar 27 • 22
Challenges and Paths Towards AI for Software Engineering Paper • 2503.22625 • Published Mar 28 • 4
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper • 2503.19901 • Published Mar 25 • 41
TransMamba: Flexibly Switching between Transformer and Mamba Paper • 2503.24067 • Published Mar 31 • 21
Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin for Real-World Robot Policy Evaluation Paper • 2504.03597 • Published Apr 4 • 5
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7 • 188