-
Natural Language Reinforcement Learning
Paper • 2411.14251 • Published • 31 -
Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value
Feature Extraction • Updated • 7 -
Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
Feature Extraction • Updated • 9 -
Waterhorse/Llama-3.1-8B-Instruct-NLRL-Breakthrough-Value
Feature Extraction • Updated • 19
Bo Liu
Benjamin-eecs
AI & ML interests
Reinforcement Learning, Reasoning, Machine Learning Systems
Recent Activity
updated
a model
5 days ago
the-acorn-ai/Qwen3-4B-Base-4K-SimpleNegotiation-Self-Role-0531-Benjamin-step160
updated
a model
6 days ago
the-acorn-ai/Qwen3-4B-Base-4K-SimpleNegotiation-Self-Role-0531-Benjamin-step512
published
a model
6 days ago
the-acorn-ai/Qwen3-4B-Base-4K-SimpleNegotiation-Self-Role-0531-Benjamin-step512
Organizations
Collections
1
models
2
datasets
0
None public yet