4 244 51

Charles I Niswander II

charlesniswander

dhar174

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

upvoted a paper 4 days ago

Large Language Models are Locally Linear Mappings

upvoted a paper 13 days ago

RLVR-World: Training World Models with Reinforcement Learning

View all activity

Organizations

None yet

charlesniswander's activity

upvoted a paper 3 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 7 days ago • 112

upvoted a paper 4 days ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published 8 days ago • 11

upvoted a paper 13 days ago

RLVR-World: Training World Models with Reinforcement Learning

Paper • 2505.13934 • Published 18 days ago • 14

upvoted 2 papers 17 days ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published 18 days ago • 78

Simple Semi-supervised Knowledge Distillation from Vision-Language Models via texttt{D}ual-texttt{H}ead texttt{O}ptimization

Paper • 2505.07675 • Published 25 days ago • 19

upvoted a paper 20 days ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

upvoted a paper 21 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 22 days ago • 118

upvoted a paper 24 days ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 25 days ago • 78

upvoted a paper 27 days ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published May 1 • 27

upvoted a paper 30 days ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3 • 35

upvoted 4 papers about 1 month ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 36

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 92

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 168

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Paper • 2504.15785 • Published Apr 22 • 19

upvoted 4 papers about 2 months ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 25

upvoted 2 papers 2 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 38

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published Mar 31 • 19