Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
charlesniswander 's Collections
Reasoning systems

Reasoning systems

updated Mar 18
Upvote
1

  • Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

    Paper • 2412.18319 • Published Dec 24, 2024 • 40

  • Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling

    Paper • 2412.14860 • Published Dec 19, 2024 • 2

  • Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

    Paper • 2411.04282 • Published Nov 6, 2024 • 36

  • Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

    Paper • 2412.15797 • Published Dec 20, 2024 • 18

  • Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

    Paper • 2412.09078 • Published Dec 12, 2024

  • Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

    Paper • 2502.14768 • Published Feb 20 • 48

  • R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

    Paper • 2503.12937 • Published Mar 17 • 29
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs