minju's picture

1 17

minju

iaminju

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

upvoted a paper 16 days ago

FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA

upvoted a paper 16 days ago

Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

View all activity

Organizations

Papers 1

arxiv:2504.17192

models 13

iaminju/rlpvr_pref_only

Updated Mar 28 • 3

iaminju/rlpvr_math_only

Updated Mar 28 • 1

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2

Updated Feb 28 • 1

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k

Updated Feb 27 • 4

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k

Text Generation • Updated Feb 26 • 7

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k

Text Generation • Updated Feb 26 • 9

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref

Text Generation • Updated Feb 25 • 11

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref

Text Generation • Updated Feb 25 • 7

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s

datasets 1

iaminju/paper2code

Viewer • Updated May 1 • 90 • 120 • 1