Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
u-brixton 's Collections
code_rlcef
emnlp 2023
emlnp 2023 tbd
math
foundation_models
alignment_24_best
monte_carlo_24_best
sft_24_best

code_rlcef

updated 6 days ago
Upvote
-

  • OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

    Paper • 2504.04030 • Published Apr 5

  • KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

    Paper • 2503.02951 • Published Mar 4 • 31

  • BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

    Paper • 2406.15877 • Published Jun 22, 2024 • 48

  • Magicoder: Source Code Is All You Need

    Paper • 2312.02120 • Published Dec 4, 2023 • 82

  • Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

    Paper • 2407.21077 • Published Jul 29, 2024 • 1

  • Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning

    Paper • 2502.13820 • Published Feb 19

  • SelfCodeAlign: Self-Alignment for Code Generation

    Paper • 2410.24198 • Published Oct 31, 2024 • 25

  • Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

    Paper • 2505.23387 • Published 8 days ago • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs