code_rlcef - a u-brixton Collection

u-brixton 's Collections

math

foundation_models

alignment_24_best

monte_carlo_24_best

code_rlcef

updated 6 days ago

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

Paper • 2504.04030 • Published Apr 5
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4 • 31
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48
Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 82
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Paper • 2407.21077 • Published Jul 29, 2024 • 1
Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning

Paper • 2502.13820 • Published Feb 19
SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 25
Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Paper • 2505.23387 • Published 8 days ago • 7