Shawon Ashraf's picture

34 327

Shawon Ashraf

shawon

·

https://shawonashraf.github.io

AI & ML interests

Multi-Modal NLP, LLM and RAG

Recent Activity

liked a model about 14 hours ago

meta-llama/Llama-4-Scout-17B-16E

liked a model 1 day ago

distilbert/distilbert-base-uncased-finetuned-sst-2-english

upvoted a collection 6 days ago

Any-to-Any Models, Datasets, Spaces

View all activity

Organizations

shawon's activity

upvoted a collection 6 days ago

Any-to-Any Models, Datasets, Spaces

16 items • Updated 6 days ago • 19

upvoted 8 papers 21 days ago

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published 23 days ago • 9

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

Paper • 2505.08581 • Published 25 days ago • 9

Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning

Paper • 2505.09738 • Published 24 days ago • 9

Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Paper • 2505.10558 • Published 23 days ago • 15

Depth Anything with Any Prior

Paper • 2505.10565 • Published 23 days ago • 11

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Paper • 2505.09990 • Published 23 days ago • 11

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published 23 days ago • 22

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 23 days ago • 118

upvoted an article 22 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

26 days ago

• 418

upvoted 2 collections about 1 month ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated about 15 hours ago • 50

upvoted an article about 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 329

upvoted a collection 4 months ago

SigLIP2

36 items • Updated 8 days ago • 74

upvoted a paper 4 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 232

upvoted a collection 6 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 170

upvoted a paper 7 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted a collection 7 months ago

LongVU

7 items • Updated Oct 31, 2024 • 33

upvoted a paper 8 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179

upvoted a collection 8 months ago

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 37 items • Updated May 4 • 9