15 660 263

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper about 9 hours ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 4 days ago

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

View all activity

Organizations

taufiqdp's activity

upvoted a paper about 9 hours ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published 1 day ago • 22

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 127

upvoted a paper 4 days ago

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Paper • 2505.24878 • Published 7 days ago • 21

liked a model 9 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 8 days ago • 74.7k • • 1.8k

upvoted a paper 10 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published 10 days ago • 91

upvoted 3 papers 11 days ago

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Paper • 2505.17873 • Published 14 days ago • 30

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published 17 days ago • 73

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published 14 days ago • 85

upvoted 2 papers 15 days ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 16 days ago • 85

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 17 days ago • 72

upvoted a paper 16 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 17 days ago • 129

upvoted a collection 17 days ago

Gemma 3n Preview

Collection

2 items • Updated 7 days ago • 110

upvoted a paper 18 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

upvoted 2 papers 21 days ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published 23 days ago • 69

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 22 days ago • 118

upvoted 2 papers 22 days ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published 25 days ago • 124

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 23 days ago • 63

upvoted an article 24 days ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

26 days ago

• 417

upvoted 2 papers 25 days ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published 25 days ago • 26

DanceGRPO: Unleashing GRPO on Visual Generation

Paper • 2505.07818 • Published 25 days ago • 29