Rajkumar rawal's picture

31 119

Rajkumar rawal PRO

rawalraj

·

https://rajkumarrawal.com.np/

AI & ML interests

AI & Blockchain

Recent Activity

upvoted a collection 1 day ago

Qwen3-Embedding

upvoted a collection 1 day ago

upvoted an article 3 days ago

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

View all activity

Organizations

rawalraj's activity

upvoted 2 collections 1 day ago

Qwen3-Embedding

6 items • Updated 1 day ago • 58

Qwen3-Reranker

3 items • Updated 1 day ago • 34

upvoted an article 3 days ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

By

and 1 other •

4 days ago

• 60

upvoted an article 4 days ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By

and 6 others •

27 days ago

• 57

upvoted a collection 8 days ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 8 days ago • 237

upvoted an article 11 days ago

Article

How to Build an MCP Server with Gradio

By

and 1 other •

Apr 30

• 162

upvoted a collection 21 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 17 days ago • 141

upvoted an article 24 days ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

By

and 5 others •

25 days ago

• 67

upvoted a collection 26 days ago

Qwen3

40 items • Updated 17 days ago • 739

upvoted 2 papers about 1 month ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5 • 82

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188

upvoted 2 articles about 1 month ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

Jan 23

• 68

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

By

and 4 others •

Mar 18

• 41

upvoted 4 collections about 1 month ago

Mellum

Series of code models by JetBrains • 5 items • Updated 16 days ago • 25

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated May 1 • 154

DeepSeek-Prover

DeepSeek-Prover-Series • 10 items • Updated Apr 30 • 54

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Apr 28 • 317

upvoted a paper about 1 month ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

upvoted a collection about 1 month ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65

upvoted a paper about 1 month ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160