Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ruisi Cai's picture
1 4

Ruisi Cai

CCCCRS
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
authored a paper 5 months ago
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
upvoted a paper 5 months ago
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
View all activity

Organizations

DeepMamba's profile picture

CCCCRS's activity

upvoted a paper about 2 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 92
upvoted a paper 5 months ago

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Paper • 2501.00658 • Published Dec 31, 2024 • 7
upvoted a paper 7 months ago

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15
upvoted a paper 11 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs