Ritvik Rastogi's picture

10 1

Ritvik Rastogi

Ritvik19

·

https://ritvik19.github.io

AI & ML interests

Machine Learning Deep Learning, Natural Language Processing, Computer Vision

Organizations

Ritvik19's activity

commented 2 papers 18 days ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published 28 days ago • 30 •

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published 28 days ago • 30 •

commented 8 papers about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94 •

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94 •

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23 • 16 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

commented 3 papers about 2 months ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 13 •

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 13 •

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Paper • 2504.06214 • Published Apr 8 •

commented 2 papers 2 months ago

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 8 •

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 8 •

New activity in open-acc/README 7 months ago

[24/ 11] What are you working on this week! 💪

#2 opened 7 months ago by

New activity in Ritvik19/openhermes-danube2-sft-qlora about 1 year ago

Adding Evaluation Results

#1 opened about 1 year ago by

leaderboard-pr-bot

New activity in Ritvik19/Sudoku-Dataset over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter