sft_24_best - a u-brixton Collection

u-brixton 's Collections

math

foundation_models

alignment_24_best

monte_carlo_24_best

sft_24_best

updated Sep 23, 2024

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Paper • 2402.04833 • Published Feb 7, 2024 • 5
A Closer Look at the Limitations of Instruction Tuning

Paper • 2402.05119 • Published Feb 3, 2024 • 5
STaR-GATE: Teaching Language Models to Ask Clarifying Questions

Paper • 2403.19154 • Published Mar 28, 2024
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15, 2024 • 25