2 174 83

Raja Biswas

rbiswasfc

AI & ML interests

NLP, Generative AI

Recent Activity

updated a dataset 6 days ago

rbiswasfc/zotero-answer-ai-texts

updated a dataset 6 days ago

rbiswasfc/zotero-answer-ai-images

upvoted a paper 8 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

liked a model 15 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated 21 days ago • 18.4k • 178

liked 2 Spaces about 1 month ago

KVPress Leaderboard

🥇

KVPress leaderboard: benchmark KV Cache compression methods

DeepResearch Bench

🔍

Display a leaderboard for DeepResearch Bench

liked 2 models 3 months ago

PleIAs/Pleias-350m-Preview

0.4B • Updated Feb 14 • 878 • 24

lightonai/Reason-ModernColBERT

liked a dataset 3 months ago

mohsenfayyaz/ColDeR

Viewer • Updated 25 days ago • 2k • 274 • 6

liked 3 models 4 months ago

liked a model 5 months ago

nvidia/GR00T-N1-2B

Robotics • 2B • Updated Jul 8 • 1.15k • 327

liked 2 datasets 5 months ago

qihoo360/Light-R1-SFTData

Viewer • Updated Mar 17 • 79.4k • 299 • 52

open-r1/codeforces

Viewer • Updated May 19 • 34.8k • 11.3k • 65

liked a model 6 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Feb 24 • 1.69M • • 696

liked a Space 6 months ago

3.08k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 3 datasets 6 months ago

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 156 • 223

lmarena-ai/arena-human-preference-100k

Viewer • Updated Feb 11 • 106k • 628 • 48

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15 • 25k • 23.4k • 39

liked a model 6 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24 • 712k • • 1.31k

liked 2 datasets 6 months ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 3.13k • 156

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 11.1k • 635