tokenizers: Fast State-of-the-Art Tokenizers optimized for Research and Production1
Provides an implementation of today's most used tokenizers, with a focus on
performance and versatility.
... part of T2,
get it here
URL: https://github.com/huggingface/tokenizers
Author: Anthony MOI <m [dot] anthony [dot] moi [at] gmail [dot] com>
Maintainer: The T2 Project <t2 [at] t2-project [dot] org>
License: APL
Status: Stable
Version: 0.22.0
Download: https://github.com/huggingface/tokenizers/ tokenizers-0.22.0.tar.gz
T2 source: tokenizers.cache
T2 source: tokenizers.desc
Build time (on reference hardware): 120% (relative to binutils)2
Installed size (on reference hardware): 9.66 MB, 43 files
Dependencies (build time detected):
bash
coreutils
curl
diffutils
gawk
grep
gzip
linux-header
openssl
pyrequests
python
python-gpep517
python-maturin
rustc
sed
tar
Installed files (on reference hardware):
[show]
usr/lib64/python3.12/site-packages/tokenizers
usr/lib64/python3.12/site-packages/tokenizers-0.21.0.dist-info
usr/lib64/python3.12/site-packages/tokenizers-0.21.0.dist-info/METADATA
usr/lib64/python3.12/site-packages/tokenizers-0.21.0.dist-info/RECORD
usr/lib64/python3.12/site-packages/tokenizers-0.21.0.dist-info/WHEEL
usr/lib64/python3.12/site-packages/tokenizers-0.22.0.dist-info
usr/lib64/python3.12/site-packages/tokenizers-0.22.0.dist-info/METADATA
usr/lib64/python3.12/site-packages/tokenizers-0.22.0.dist-info/RECORD
usr/lib64/python3.12/site-packages/tokenizers-0.22.0.dist-info/WHEEL
usr/lib64/python3.12/site-packages/tokenizers/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/decoders
usr/lib64/python3.12/site-packages/tokenizers/decoders/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/decoders/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/implementations
usr/lib64/python3.12/site-packages/tokenizers/implementations/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/base_tokenizer.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/bert_wordpiece.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/byte_level_bpe.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/char_level_bpe.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/sentencepiece_bpe.py
usr/lib64/python3.12/site-packages/tokenizers/implementations/sentencepiece_unigram.py
usr/lib64/python3.12/site-packages/tokenizers/models
usr/lib64/python3.12/site-packages/tokenizers/models/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/models/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/normalizers
usr/lib64/python3.12/site-packages/tokenizers/normalizers/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/normalizers/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/pre_tokenizers
usr/lib64/python3.12/site-packages/tokenizers/pre_tokenizers/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/pre_tokenizers/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/processors
usr/lib64/python3.12/site-packages/tokenizers/processors/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/processors/__init__.pyi
usr/lib64/python3.12/site-packages/tokenizers/tokenizers.abi3.so
usr/lib64/python3.12/site-packages/tokenizers/tools
usr/lib64/python3.12/site-packages/tokenizers/tools/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/tools/visualizer-styles.css
usr/lib64/python3.12/site-packages/tokenizers/tools/visualizer.py
usr/lib64/python3.12/site-packages/tokenizers/trainers
usr/lib64/python3.12/site-packages/tokenizers/trainers/__init__.py
usr/lib64/python3.12/site-packages/tokenizers/trainers/__init__.pyi
usr/share/doc/tokenizers
var/adm/dependencies/tokenizers
var/adm/descs/tokenizers
var/adm/flists/tokenizers
var/adm/md5sums/tokenizers
var/adm/packages/tokenizers
1) This page was automatically generated from the T2
package source. Corrections, such as dead links, URL changes or typos
need to be performed directly on that source.
2) Compatible with Linux From Scratch's
"Standard Build Unit" (SBU).