3 10

Ziqi wang

wzq016

https://wzq016.github.io

AI & ML interests

NLP

Recent Activity

upvoted a paper about 1 month ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

upvoted a paper about 2 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

upvoted a paper 2 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

View all activity

Organizations

upvoted a paper about 1 month ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10 • 69

upvoted a paper about 2 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 86

upvoted a paper 2 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

Paper • 2506.06444 • Published Jun 6 • 74

upvoted a paper 3 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 135

upvoted a paper 4 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 78

upvoted a paper 6 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

upvoted a paper 11 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 55

upvoted a paper about 1 year ago

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 9

upvoted a collection about 1 year ago

Model Extrapolation Expedites Alignment

Collection

Better aligned models obtained by model extrapolation (ExPO) • 25 items • Updated May 27 • 17

upvoted a paper about 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

Ziqi wang

AI & ML interests

Recent Activity

Organizations

wzq016's activity