Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Luckeciano Carvalho Melo
luckeciano
Follow
0 followers
·
2 following
https://luckeciano.github.io
LuckecianoMelo
luckeciano
AI & ML interests
Reinforcement Learning
Recent Activity
published
a model
about 12 hours ago
luckeciano/Qwen-2.5-7B-GRPO-Base_2293
published
a model
about 12 hours ago
luckeciano/Qwen-2.5-7B-GRPO-Base_4831
updated
a model
about 16 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_441
View all activity
Organizations
Papers
1
arxiv:
2206.06614
models
201
Sort: Recently updated
luckeciano/Qwen-2.5-7B-GRPO-Base_2293
Updated
about 12 hours ago
luckeciano/Qwen-2.5-7B-GRPO-Base_4831
Updated
about 12 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_441
Text Generation
•
Updated
about 16 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_889
Text Generation
•
Updated
about 19 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_996
Text Generation
•
Updated
about 21 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_638
Text Generation
•
Updated
about 23 hours ago
•
2
luckeciano/Qwen-2.5-7B-GRPO-Base_902
Text Generation
•
Updated
1 day ago
•
2
luckeciano/Qwen-2.5-7B-GRPO-Base-NoAdvNorm_666
Text Generation
•
Updated
1 day ago
•
6
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_117
Text Generation
•
Updated
1 day ago
•
2
luckeciano/Qwen-2.5-7B-GRPO-Base_881
Text Generation
•
Updated
1 day ago
•
4
Expand 201 models
datasets
10
Sort: Recently updated
luckeciano/mistral8x22b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
92.9k
•
349
luckeciano/llama370b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
82.5k
•
288
luckeciano/llama370b-features-reddit
Viewer
•
Updated
May 7, 2024
•
150k
•
20
luckeciano/mistral8x22b-features-reddit
Viewer
•
Updated
Apr 22, 2024
•
166k
•
24
luckeciano/hermes-reddit-post-features
Viewer
•
Updated
Apr 18, 2024
•
92.7k
•
824
luckeciano/llama27b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
189k
•
29
luckeciano/falcon7b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
159k
•
26
luckeciano/hermes-features-ultrafeedback
Viewer
•
Updated
Mar 7, 2024
•
63.8k
•
28
luckeciano/reddit-features-hermes
Viewer
•
Updated
Feb 13, 2024
•
169k
•
25
luckeciano/learning-to-summarize
Viewer
•
Updated
Jan 17, 2024
•
426k
•
54