1 2 7

Abhranil Chandra

abhranil14

AI & ML interests

Reinforcement Learning, Deep Unsupervised Learning, NLP and Bayesian Deep Learning

Recent Activity

updated a model 3 days ago

abhranil14/gemma2_2B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_2048

published a model 3 days ago

abhranil14/gemma2_2B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_2048

updated a model 3 days ago

abhranil14/llama_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_1024

View all activity

Organizations

Find the current local time in any timezone

models 53

abhranil14/gemma2_2B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_2048

Updated 3 days ago

abhranil14/llama_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_1024

Updated 3 days ago

abhranil14/llama_FF_self_distill_gold_batch256_lr10e-6_warmup0.1_correct_1perqs_new

Updated 4 days ago

abhranil14/Gemma2B_FF_on_qwen14B_wrong_2130_batch256_lr10e-6_warmup0.1_30_epoch_linear_lr

Updated 28 days ago

abhranil14/Qwen1.5B_FF_on_human_gold_7500_batch256_lr10e-6_warmup0.1_linear_lr

Updated 28 days ago

abhranil14/Gemma2B_FF_on_human_gold_7500_batch256_lr10e-6_warmup0.1

Updated Jul 2

abhranil14/Gemma_FF_on_Gemma27B_wrong_soln_wrt_human_1_soln_per_qs_6076_batch256_lr10e-6_warmup0.1

Updated Jul 2

abhranil14/Gemma2B_FF_on_qwen14B_gold_6158_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr

Updated Jul 2

abhranil14/Gemma2B_FF_on_gemma2B_self_distill_wrong_7044_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr

Updated Jun 26

abhranil14/Gemma2B_FF_on_gemma2B_self_distill_gold_1295_batch256_lr10e-6_warmup0.1_54_epoch_linear_lr

Updated Jun 26

View 53 models

datasets 5

abhranil14/VideoAgent_Data

Preview • Updated Jul 17 • 8

abhranil14/syn_qs_and_soln_cleaned_0_and_less20_multiple_soln_per_qs_1937545

Viewer • Updated May 12 • 1.94M • 2

abhranil14/syn_qs_and_soln_cleaned_0_and_less20_1_soln_per_qs_131845

Viewer • Updated May 12 • 132k

abhranil14/instruct-human-assistant-prompt-clean-105k

Viewer • Updated Sep 18, 2024 • 105k • 7

abhranil14/first-instruct-human-assistant-prompt-clean-33k

Viewer • Updated Sep 18, 2024 • 33.1k • 4

Abhranil Chandra

AI & ML interests

Recent Activity

Organizations

Collections 8

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Offline Reinforcement Learning for LLM Multi-Step Reasoning

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Offline Reinforcement Learning for LLM Multi-Step Reasoning

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Papers 5

spaces 1

First Agent Template

models 53

abhranil14/gemma2_2B_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_2048

abhranil14/llama_FF_gemini_flash_gold_7114_batch256_lr10e-6_warmup0.1_max_tokens_1024

abhranil14/llama_FF_self_distill_gold_batch256_lr10e-6_warmup0.1_correct_1perqs_new

abhranil14/Gemma2B_FF_on_qwen14B_wrong_2130_batch256_lr10e-6_warmup0.1_30_epoch_linear_lr

abhranil14/Qwen1.5B_FF_on_human_gold_7500_batch256_lr10e-6_warmup0.1_linear_lr

abhranil14/Gemma2B_FF_on_human_gold_7500_batch256_lr10e-6_warmup0.1

abhranil14/Gemma_FF_on_Gemma27B_wrong_soln_wrt_human_1_soln_per_qs_6076_batch256_lr10e-6_warmup0.1

abhranil14/Gemma2B_FF_on_qwen14B_gold_6158_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr

abhranil14/Gemma2B_FF_on_gemma2B_self_distill_wrong_7044_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr

abhranil14/Gemma2B_FF_on_gemma2B_self_distill_gold_1295_batch256_lr10e-6_warmup0.1_54_epoch_linear_lr

datasets 5

abhranil14/VideoAgent_Data

abhranil14/syn_qs_and_soln_cleaned_0_and_less20_multiple_soln_per_qs_1937545

abhranil14/syn_qs_and_soln_cleaned_0_and_less20_1_soln_per_qs_131845

abhranil14/instruct-human-assistant-prompt-clean-105k

abhranil14/first-instruct-human-assistant-prompt-clean-33k

Abhranil Chandra

AI & ML interests

Recent Activity

Organizations

Collections 8

Papers 5

spaces 1

First Agent Template

models 53 Sort: Recently updated

datasets 5 Sort: Recently updated

models 53

datasets 5