Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

distily
/
distily_bitnet_gpt2

TensorBoard
Safetensors
Distily
gpt2
bitnet
1.58b
Generated from Trainer
Model card Files Files and versions Metrics Training metrics Community
distily_bitnet_gpt2 / logs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 10 commits

This model has 1 file scanned as unsafe.

lapp0's picture
lapp0
End of training
f7be570 verified 12 months ago
  • attn_layer_mapper=last, attn_loss_fn=mse, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
    Training in progress, step 61875 12 months ago
  • attn_layer_mapper=layer-2, attn_loss_fn=cos, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
    End of training 12 months ago
  • attn_layer_mapper=layer-2, attn_loss_fn=mse, attn_weight=1.0, lr_scheduler_type=cosine, warmup_ratio=0.5
    Training in progress, step 61875 12 months ago
  • dataset_sample_size=1000000
    End of training 12 months ago
  • lr_scheduler_type=cosine, warmup_ratio=0.5
    Training in progress, step 61875 12 months ago
  • lr_scheduler_type=linear, warmup_ratio=0.5
    Training in progress, step 61875 12 months ago
  • completed.flag
    0 Bytes
    Training in progress, step 61875 12 months ago
  • events.out.tfevents.1724138424.5f530b1cf724
    29.7 MB
    LFS
    End of training 12 months ago
  • events.out.tfevents.1724152434.5f530b1cf724
    588 Bytes
    LFS
    Training in progress, step 61875 12 months ago