-
Notifications
You must be signed in to change notification settings - Fork 29.4k
🚨🚨🚨 [SPM
] Finish fix spm models 🚨🚨🚨
#25224
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ArthurZucker
merged 36 commits into
huggingface:main
from
ArthurZucker:finish-fix-spm-models
Aug 17, 2023
Merged
Changes from all commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
c99c130
fix EVERYTHING
ArthurZucker acf31e2
more fixes
ArthurZucker 7305aff
⚗️⚗️ Tokenizer magic ⚗️⚗️
ArthurZucker 01b8347
wrong value but test passes for the TODO
ArthurZucker b9ddbbb
update
ArthurZucker 83af718
updat
ArthurZucker 0babe38
safe protobuf import?
ArthurZucker 0fdf51e
style
ArthurZucker 2d197a1
non gated repo
ArthurZucker e9c7a72
update
ArthurZucker 94964cd
fixup
ArthurZucker cc9ddcf
Merge branch 'main' of https://github.com/huggingface/transformers in…
ArthurZucker 45cae43
Update src/transformers/models/llama/tokenization_llama.py
ArthurZucker 53557a9
Update src/transformers/models/llama/tokenization_llama.py
ArthurZucker e049d11
Update tests/models/t5/test_tokenization_t5.py
ArthurZucker b64b2d2
nits
ArthurZucker cb95361
fix t5 too
ArthurZucker a86bf78
use assert equal
ArthurZucker 913cd1d
fix llama decoding
ArthurZucker ef28574
nits on t5
ArthurZucker 4f65261
fixup
ArthurZucker ad7f8c6
only remove the prefix space, not other spaces
ArthurZucker 76d00cc
more deconding tests and more todos
ArthurZucker 9cb92b6
fix CI as well
ArthurZucker 204153f
fixup
ArthurZucker 9f37103
skip failing test on CI (its tf its ok)
ArthurZucker 700ee64
Merge branch 'main' of https://github.com/huggingface/transformers in…
ArthurZucker 4b5315b
skip test_subword_regularization_tokenizer that is also crashing on t…
ArthurZucker a4ed16f
Merge branch 'main' of https://github.com/huggingface/transformers in…
ArthurZucker e7906c2
update llama
ArthurZucker ad33c97
revert good fixes
ArthurZucker f890882
fixup
ArthurZucker b7f98bc
empty
ArthurZucker bb79083
explain why we need to encode with an additional token
ArthurZucker 3f8ac96
better warning?
ArthurZucker 4249986
nits
ArthurZucker File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.