Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
peng's picture
1 25 147

peng

superpeng
·

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago
xl-zhao/PromptCoT-QwQ-Dataset
liked a dataset 3 months ago
Flmc/DISC-Med-SFT
liked a dataset 3 months ago
simplescaling/s1K-1.1
View all activity

Organizations

None yet

Collections 5

LLM Pretrain
  • How to Train Data-Efficient LLMs

    Paper • 2402.09668 • Published Feb 15, 2024 • 43
  • Adapting Large Language Models via Reading Comprehension

    Paper • 2309.09530 • Published Sep 18, 2023 • 78
  • GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Paper • 2403.03507 • Published Mar 6, 2024 • 189
  • MathScale: Scaling Instruction Tuning for Mathematical Reasoning

    Paper • 2403.02884 • Published Mar 5, 2024 • 17
LLM Fine-Tune
  • BitDelta: Your Fine-Tune May Only Be Worth One Bit

    Paper • 2402.10193 • Published Feb 15, 2024 • 23
  • StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

    Paper • 2402.16671 • Published Feb 26, 2024 • 30
  • LoRA Learns Less and Forgets Less

    Paper • 2405.09673 • Published May 15, 2024 • 89
  • NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

    Paper • 2405.17428 • Published May 27, 2024 • 20

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs