1 31 9

Yan Varakin

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Recent Activity

updated a Space 11 days ago

ZDPLI/SkinLesionClassifierHAM10K

upvoted an article 27 days ago

StackLLaMA: A hands-on guide to train LLaMA with RLHF

upvoted an article 27 days ago

Fine-tune Llama 2 with DPO

View all activity

Organizations

ZDPLI's activity

upvoted 2 articles 27 days ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

and 6 others •

Apr 5, 2023

• 37

Article

Fine-tune Llama 2 with DPO

and 2 others •

Aug 8, 2023

• 54

upvoted a paper 27 days ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 48

upvoted an article 28 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 148

upvoted 4 papers 29 days ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 37

upvoted a paper about 1 month ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29 • 91

upvoted 3 papers 4 months ago

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21 • 7

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 75

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28

upvoted 2 papers 5 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 28

upvoted 2 papers 6 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

upvoted a paper 7 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125

upvoted an article 7 months ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 12

upvoted a collection 7 months ago

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 41

upvoted a paper 7 months ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29, 2024 • 14