AK's picture

AK PRO

akhaliq

·

_akhaliq

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

fluxions/vui

upvoted a collection 1 day ago

ReasonFLux-Coder

liked a model 3 days ago

lerobot/smolvla_base

View all activity

Organizations

akhaliq's activity

upvoted a collection 1 day ago

ReasonFLux-Coder

Coding LLMs excel at both writing code and generating unit tests. • 9 items • Updated 11 days ago • 6

upvoted 2 collections 10 days ago

Enigmata

Resources for the Enigmata Project: https://seed-enigmata.github.io. • 4 items • Updated 10 days ago • 1

ARM

8 items • Updated 10 days ago • 3

upvoted a collection 13 days ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated 8 days ago • 252

upvoted a collection 15 days ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

7 items • Updated 15 days ago • 3

upvoted a paper 16 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 16 days ago • 129

upvoted a paper 21 days ago

LightLab: Controlling Light Sources in Images with Diffusion Models

Paper • 2505.09608 • Published 22 days ago • 31

upvoted a paper 22 days ago

Fast Text-to-Audio Generation with Adversarial Post-Training

Paper • 2505.08175 • Published 24 days ago • 22

upvoted a collection 29 days ago

ZeroSearch_google

8 items • Updated 10 days ago • 28

upvoted 2 collections about 1 month ago

LLaMA-Omni

13 items • Updated 20 days ago • 16

LiveCC

Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025) • 8 items • Updated Apr 23 • 4

upvoted an article about 2 months ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

By

and 1 other •

Apr 16

• 37

upvoted a paper about 2 months ago

Towards Learning to Complete Anything in Lidar

Paper • 2504.12264 • Published Apr 16 • 10

upvoted a collection about 2 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65

upvoted 5 collections 2 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 522

LeX-Art

8 items • Updated Apr 1 • 4

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 16 days ago • 139

PP-VCtrl

10 items • Updated Mar 17 • 2

Open-RS

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21 • 12

upvoted a collection 3 months ago

JARVIS-VLA-v1

Vision-Language-Action Models in Minecraft. • 4 items • Updated Mar 22 • 11