37 31 230

Kaizhao Liang PRO

kz919

https://kyleliang919.github.io/

AI & ML interests

Search = AGI?

Recent Activity

liked a model 3 days ago

ByteDance-Seed/BAGEL-7B-MoT

liked a model 5 days ago

apple/coreml-stable-diffusion-mixed-bit-palettization

liked a Space 9 days ago

hansyan/perflow-triposr

View all activity

Organizations

kz919's activity

upvoted a paper 16 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 123

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.25k

upvoted 3 papers 4 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 142

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30

upvoted 2 articles 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 862

Article

Welcome to Inference Providers on the Hub 🔥

and 6 others •

Jan 28

• 483

upvoted 2 papers 5 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 8

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 77

upvoted 2 papers 6 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 22

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 21

upvoted 2 papers 9 months ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 98

Memory-Efficient LLM Training with Online Subspace Descent

Paper • 2408.12857 • Published Aug 23, 2024 • 14

upvoted an article 10 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

and 2 others •

Apr 15, 2024

• 180

upvoted a paper 11 months ago

Longhorn: State Space Models are Amortized Online Learners

Paper • 2407.14207 • Published Jul 19, 2024 • 18

upvoted a paper 12 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 94

upvoted an article 12 months ago

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 92

upvoted 3 papers about 1 year ago

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Paper • 2402.04347 • Published Feb 6, 2024 • 15

Towards Modular LLMs by Building and Reusing a Library of LoRAs

Paper • 2405.11157 • Published May 18, 2024 • 31

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13, 2024 • 28