-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
Hung Le
neurocoder
AI & ML interests
None yet
Organizations
None yet