Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shiyi Cao's picture
1 9 1

Shiyi Cao

eva98
pestafford's profile picture Ligeng-Zhu's profile picture tudorizer's profile picture
·

AI & ML interests

None yet

Organizations

Efficient-Large-Model's profile picture LLaVA Internal's profile picture NovaSky's profile picture

upvoted 2 collections 6 months ago

NovaSky Papers

Collection
2 items • Updated Feb 21 • 3

Sky-T1-7B

Collection
A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated Feb 14 • 7
upvoted 2 papers 6 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 41
upvoted 2 papers 7 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 283
upvoted a paper 9 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60
upvoted a paper 12 months ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 53
upvoted a paper about 1 year ago

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 42
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs