Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LocalDoc
/
az-en-unigram-tokenizer-50k
like
1
Follow
LocalDoc
20
English
Azerbaijani
sentencepiece
unigram
tokenizer
azerbaijani
english
bilingual
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
az-en-unigram-tokenizer-50k
/
special_tokens_map.json
vrashad
Upload bilingual AZ-EN unigram tokenizer (50k vocab)
1630ba8
verified
3 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
125 Bytes
{
"cls_token"
:
"[CLS]"
,
"mask_token"
:
"[MASK]"
,
"pad_token"
:
"[PAD]"
,
"sep_token"
:
"[SEP]"
,
"unk_token"
:
"[UNK]"
}