Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

H4-colab

community
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Edward Beeching's profile picture

edbeeching 
authored a paper 5 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 47
edbeeching 
authored 2 papers over 1 year ago

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15, 2024 • 21

Godot Reinforcement Learning Agents

Paper • 2112.03636 • Published Dec 7, 2021 • 1
edbeeching 
authored a paper almost 2 years ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs