Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
17
minju
iaminju
Follow
starsuzi's profile picture
wgcyeo's profile picture
saytes's profile picture
7 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Distilling LLM Agent into Small Models with Retrieval and Code Tools
upvoted
a
paper
16 days ago
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
upvoted
a
paper
16 days ago
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction
View all activity
Organizations
Papers
1
arxiv:
2504.17192
models
13
Sort: Recently updated
iaminju/rlpvr_pref_only
Updated
Mar 28
•
3
iaminju/rlpvr_math_only
Updated
Mar 28
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3
Updated
Feb 28
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2
Updated
Feb 28
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k
Updated
Feb 27
•
4
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k
Text Generation
•
Updated
Feb 26
•
7
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k
Text Generation
•
Updated
Feb 26
•
9
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref
Text Generation
•
Updated
Feb 25
•
11
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref
Text Generation
•
Updated
Feb 25
•
7
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s
Updated
Feb 25
Expand 13 models
datasets
1
iaminju/paper2code
Viewer
•
Updated
May 1
•
90
•
120
•
1