-
Notifications
You must be signed in to change notification settings - Fork 991
Comparing changes
Open a pull request
base repository: huggingface/tokenizers
base: v0.21.4
head repository: huggingface/tokenizers
compare: v0.22.1
- 20 commits
- 26 files changed
- 14 contributors
Commits on Jul 22, 2025
-
Bump on-headers and compression (#1827)
--- updated-dependencies: - dependency-name: on-headers dependency-version: 1.1.0 dependency-type: indirect - dependency-name: compression dependency-version: 1.8.1 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9164247 - Browse repository at this point
Copy the full SHA 9164247View commit details
Commits on Jul 29, 2025
-
Implement
from_bytesandread_bytesMethods in WordPiece Tokenize……r for WebAssembly Compatibility (#1758) * Add from_bytes and read_bytes method to WordPiece * Change wordpiece method return value --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ed2cda5 - Browse repository at this point
Copy the full SHA ed2cda5View commit details
Commits on Aug 27, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 95b882a - Browse repository at this point
Copy the full SHA 95b882aView commit details
Commits on Aug 29, 2025
-
* update * update * updates * up * oikay * use stream input * nice all test pass? * fmt * dev * rename * simplify a hell lot * proper testing * fix inti * fix test * nits * make clippy happy now * fmt fml * remove the prints * fix gate
Configuration menu - View commit details
-
Copy full SHA for abee958 - Browse repository at this point
Copy the full SHA abee958View commit details -
Configuration menu - View commit details
-
Copy full SHA for 49a0907 - Browse repository at this point
Copy the full SHA 49a0907View commit details -
Configuration menu - View commit details
-
Copy full SHA for b0464b2 - Browse repository at this point
Copy the full SHA b0464b2View commit details -
Update quicktour.mdx re: Issue #1625 (#1846)
Update broken wikitext-103 and tokenizers-pipeline links
Configuration menu - View commit details
-
Copy full SHA for ec54228 - Browse repository at this point
Copy the full SHA ec54228View commit details -
Configuration menu - View commit details
-
Copy full SHA for c5eb93f - Browse repository at this point
Copy the full SHA c5eb93fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c01a4f - Browse repository at this point
Copy the full SHA 7c01a4fView commit details -
RUSTSEC-2024-0436 - replace paste with pastey (#1834)
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for b43d8d7 - Browse repository at this point
Copy the full SHA b43d8d7View commit details -
Tokenizer: Add native async bindings, via py03-async-runtimes. (#1843)
* add async bindings * update based on review! * us hf internal testing for testing * reduce burden for the CI * asyn is not necessarily fast * remove comments --------- Co-authored-by: Arthur <arthur.zucker@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for bd1149c - Browse repository at this point
Copy the full SHA bd1149cView commit details -
Revert "RUSTSEC-2024-0436 - replace paste with pastey (#1834)"
This reverts commit b43d8d7.
Configuration menu - View commit details
-
Copy full SHA for 9bafd82 - Browse repository at this point
Copy the full SHA 9bafd82View commit details -
Configuration menu - View commit details
-
Copy full SHA for da1cc3b - Browse repository at this point
Copy the full SHA da1cc3bView commit details
Commits on Aug 30, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 57eb8d7 - Browse repository at this point
Copy the full SHA 57eb8d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7b02178 - Browse repository at this point
Copy the full SHA 7b02178View commit details -
Configuration menu - View commit details
-
Copy full SHA for c91d76a - Browse repository at this point
Copy the full SHA c91d76aView commit details
Commits on Sep 16, 2025
-
chore(trainer): add and improve trainer signature (#1838)
* chore(trainers): add __init__ to fix python type check errors * restore * chore(trainer): add and improve trainer signature * clean fix * chore(fmt): fix cargo fmt error --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c0d3697 - Browse repository at this point
Copy the full SHA c0d3697View commit details
Commits on Sep 19, 2025
-
Bump
huggingface_hubupper version (#1866)* Test hfh 1.0 rc0 * Tokenizers works on both 0.x and 1.x versions
Configuration menu - View commit details
-
Copy full SHA for 972e7fc - Browse repository at this point
Copy the full SHA 972e7fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cbd461 - Browse repository at this point
Copy the full SHA 6cbd461View commit details -
Configuration menu - View commit details
-
Copy full SHA for afaae08 - Browse repository at this point
Copy the full SHA afaae08View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v0.21.4...v0.22.1