Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Meng Li
ml-pku
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
30 days ago
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
authored
a paper
about 2 months ago
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
authored
a paper
about 2 months ago
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
View all activity
Organizations
None yet
Papers
9
arxiv:
2504.05897
arxiv:
2501.06807
arxiv:
2408.10284
arxiv:
2402.13485
Expand 9 papers
models
0
None public yet
datasets
0
None public yet