Sugato Ray's picture

Sugato Ray PRO

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 7 hours ago

updated a collection about 7 hours ago

RLMs (Reasoning Language Models)

liked a model about 7 hours ago

Haoz0206/Omni-R1

View all activity

Organizations

sugatoray's activity

upvoted a paper about 7 hours ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 157

upvoted a collection about 7 hours ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 17 days ago • 141

upvoted a collection about 19 hours ago

Stokmark-2

3 items • Updated 4 days ago • 1

upvoted 2 collections 1 day ago

Table-R1 Datasets

5 items • Updated 8 days ago • 2

Table-R1 Models

5 items • Updated 8 days ago • 1

upvoted a paper 2 days ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 9 days ago • 88

upvoted a collection 3 days ago

SmolVLA

Small, efficient and light-weight VLAs pretrained on community datasets • 1 item • Updated 6 days ago • 19

upvoted an article 4 days ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

By

and 1 other •

5 days ago

• 23

upvoted a paper 8 days ago

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Paper • 2505.15778 • Published 17 days ago • 15

upvoted 4 papers 9 days ago

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Paper • 2505.20289 • Published 12 days ago • 10

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Paper • 2505.19000 • Published 13 days ago • 42

Asymptotics of Language Model Alignment

Paper • 2404.01730 • Published Apr 2, 2024 • 1

RL with KL penalties is better viewed as Bayesian inference

Paper • 2205.11275 • Published May 23, 2022 • 1

upvoted a collection 10 days ago

Gemma 3n Preview

2 items • Updated 8 days ago • 110

upvoted a collection 12 days ago

One-RL-to-See-Them-All

https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated 12 days ago • 12

upvoted a paper 12 days ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published 15 days ago • 59

upvoted a paper 14 days ago

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

Paper • 2505.15656 • Published 17 days ago • 14

upvoted an article 14 days ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By

and 3 others •

15 days ago

• 122

upvoted 2 papers 16 days ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 17 days ago • 85

Reward Reasoning Model

Paper • 2505.14674 • Published 18 days ago • 34