6 39 4

Samuel Arcadinho

SSamDav

SSamDav

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

upvoted a paper about 1 month ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

upvoted a paper about 1 month ago

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

View all activity

Organizations

upvoted a paper 2 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 19 days ago • 215

upvoted 2 papers about 1 month ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15 • 25

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

Paper • 2507.07955 • Published Jul 10 • 23

commented a paper about 1 month ago

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

Paper • 2507.07955 • Published Jul 10 • 23 •

upvoted a paper about 1 month ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1 • 77

commented a paper about 1 month ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 60 •

upvoted a paper about 1 month ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 60

commented a paper about 2 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 60 •

upvoted a paper 2 months ago

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17 • 45

upvoted 6 papers 4 months ago

commented a paper 4 months ago

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Paper • 2504.03601 • Published Apr 4 • 17 •

upvoted 4 papers 5 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 165

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 168

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 47

Samuel Arcadinho

AI & ML interests

Recent Activity

Organizations

SSamDav's activity