Quentin Tardif's picture

Quentin Tardif

ntnq

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Qwen/Qwen3-Embedding-0.6B-GGUF

upvoted a paper 19 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

upvoted a paper 19 days ago

System Prompt Optimization with Meta-Learning

View all activity

Organizations

ntnq's activity

upvoted 5 papers 19 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 24 days ago • 63

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published 24 days ago • 69

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 23 days ago • 78

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 23 days ago • 118

Qwen3 Technical Report

Paper • 2505.09388 • Published 24 days ago • 182

upvoted an article 25 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

26 days ago

• 418

upvoted a paper about 1 month ago

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family

Paper • 2504.18225 • Published Apr 25 • 12

upvoted a collection about 1 month ago

Qwen3

40 items • Updated 17 days ago • 739

upvoted a paper about 1 month ago

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24 • 23

upvoted a paper about 2 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 74

upvoted 2 papers 2 months ago

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published Mar 25 • 75

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 44

upvoted a collection 2 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 8 items • Updated about 14 hours ago • 59

upvoted a paper 2 months ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 51

upvoted 2 articles 3 months ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

By

and 3 others •

Mar 10

• 144

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted 2 collections 3 months ago

EuroBERT

Scaling Multilingual Encoders for European Languages • 4 items • Updated Mar 10 • 11

QwQ

Qwen with Questions • 6 items • Updated Apr 28 • 95

upvoted 2 papers 3 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 87