忍者

byteprobe

AI & ML interests

RL | NLP | LLM | multimodal | evaluations | agents

Recent Activity

liked a model about 2 months ago

moonshotai/Kimi-VL-A3B-Thinking-2506

upvoted a changelog about 2 months ago

Organization and User profiles now include repository listing pages

liked a dataset about 2 months ago

nvidia/OpenScience

View all activity

Organizations

upvoted a changelog about 2 months ago

Changelog

Organization and User profiles now include repository listing pages

Jun 20

• 124

upvoted 8 papers 2 months ago

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 74

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 98

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 261

Magistral

Paper • 2506.10910 • Published Jun 12 • 63

upvoted 4 changelogs 2 months ago

Changelog

Add MCP-Compatible Spaces to Your Tools

Jun 17

• 80

Changelog

New Model Filtering Options on the Hub

Jun 16

• 72

Changelog

New Inference Providers Dashboard

Jun 5

• 62

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 105

upvoted 7 papers 3 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 55

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16 • 76

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 89

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 88

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 122

忍者

AI & ML interests

Recent Activity

Organizations

byteprobe's activity

Organization and User profiles now include repository listing pages

Add MCP-Compatible Spaces to Your Tools

New Model Filtering Options on the Hub

New Inference Providers Dashboard

Connect Your MCP Client to the Hugging Face Hub