Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pico-lm
/
pico-decoder-large
like
0
Follow
Pico Language Model
44
Text Generation
Safetensors
pico-lm/pretokenized-dolma
English
pico_decoder
custom_code
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
pico-decoder-large
/
fabric_state
/
checkpoint
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
rdiehlmartinez
pico-decoder-large-1 trained to 100k steps
752b9ff
3 days ago
bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.utils.tensor_fragment.fragment_address"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"deepspeed.utils.tensor_fragment.fragment_address"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch._utils._rebuild_tensor_v2"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.utils.tensor_fragment.fragment_address"
,
"collections.OrderedDict"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"torch.FloatStorage"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.utils.tensor_fragment.fragment_address"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"deepspeed.utils.tensor_fragment.fragment_address"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"deepspeed.utils.tensor_fragment.fragment_address"
,
"torch.FloatStorage"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt
pickle
Detected Pickle imports (6)
"deepspeed.utils.tensor_fragment.fragment_address"
,
"deepspeed.runtime.fp16.loss_scaler.LossScaler"
,
"torch._utils._rebuild_tensor_v2"
,
"deepspeed.runtime.zero.config.ZeroStageEnum"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
How to fix it?
427 MB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago
mp_rank_00_model_states.pt
pickle
Detected Pickle imports (5)
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch.Size"
,
"__builtin__.set"
How to fix it?
1.14 GB
LFS
pico-decoder-large-1 trained to 100k steps
3 days ago