AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Paper • 2505.10468 • Published 23 days ago • 9
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper • 2505.08581 • Published 25 days ago • 9
Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning Paper • 2505.09738 • Published 24 days ago • 9
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper • 2505.10558 • Published 23 days ago • 15
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published 23 days ago • 11
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Paper • 2505.10320 • Published 23 days ago • 22
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published 23 days ago • 118
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 26 days ago • 418
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated about 15 hours ago • 50
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 329
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 232
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 170
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Paper • 2410.23320 • Published Oct 30, 2024 • 8
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 37 items • Updated May 4 • 9