monte_carlo_24_best - a u-brixton Collection

u-brixton 's Collections

math

foundation_models

alignment_24_best

monte_carlo_24_best

monte_carlo_24_best

updated Jan 2

Planning Like Human: A Dual-process Framework for Dialogue Planning

Paper • 2406.05374 • Published Jun 8, 2024
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents

Paper • 2311.00262 • Published Nov 1, 2023
Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning

Paper • 2403.06769 • Published Mar 11, 2024
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning

Paper • 2305.13660 • Published May 23, 2023
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13, 2024 • 21
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Paper • 2405.00451 • Published May 1, 2024