Skip to content

Issues: explosion/curated-transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Convert QKV projection splitting methods into Torch modules feat/layers Feature: Layers type/maintenance Type: Maintenance
#343 opened Oct 3, 2023 by danieldk updated Oct 3, 2023 v2.0.0
Make QkvMode ADT-like feat/layers Feature: Layers type/maintenance Type: Maintenance
#344 opened Oct 3, 2023 by danieldk updated Oct 3, 2023 v2.0.0
Expose more outputs through the Generator interface feat/generation Feature: Generation type/feature Type: Feature
#345 opened Oct 3, 2023 by danieldk updated Oct 3, 2023 v2.0.0
Add a an extras/contrib package type/maintenance Type: Maintenance
#347 opened Oct 3, 2023 by danieldk updated Oct 3, 2023 Undecided
Option to only return the last hidden layer output from models feat/model Feature: models type/feature Type: Feature
#342 opened Oct 3, 2023 by danieldk updated Oct 4, 2023 v2.0.0
Add support for attention sinks feat/layers Feature: Layers feat/model Feature: models type/feature Type: Feature
#350 opened Oct 4, 2023 by danieldk updated Oct 4, 2023 Undecided
Support for Encoder-Decoder-style architectures feat/model Feature: models type/feature Type: Feature
#340 opened Oct 2, 2023 by bilelomrani1 updated Oct 5, 2023 Undecided
Add support for Mistral feat/model Feature: models type/feature Type: Feature
#341 opened Oct 3, 2023 by danieldk updated Oct 19, 2023 v2.1.0
Support DeBERTa v2/3 feat/model Feature: models type/feature Type: Feature
#348 opened Oct 3, 2023 by danieldk updated Oct 19, 2023 Undecided
Move the old Falcon architecuture to the extras/addons pacakage type/maintenance Type: Maintenance
#355 opened Oct 19, 2023 by shadeMe updated Oct 19, 2023 Undecided
Thoughts on jaxtyping feat/misc Feature: Miscellaneous type/feature Type: Feature
#246 opened Jul 14, 2023 by Ryu1845 updated Oct 19, 2023 Undecided
Add Low-Rank Adapters injection into base models feat/training Feature: Training/Fine-tuning type/feature Type: Feature
#312 opened Aug 28, 2023 by bilelomrani1 updated Oct 19, 2023 Undecided
Optimal Qlora settings feat/training Feature: Training/Fine-tuning type/feature Type: Feature
#316 opened Sep 2, 2023 by KnutJaegersberg updated Oct 19, 2023 Undecided
Output logits for generation feat/generation Feature: Generation type/feature Type: Feature
#311 opened Aug 24, 2023 by mayankjobanputra updated Oct 19, 2023 v2.0.0
Add suggested PyTorch LLM optimizations feat/generation Feature: Generation feat/model Feature: models
#356 opened Dec 1, 2023 by danieldk updated Dec 1, 2023
Truncation of sequences that are beyond the model's maximum length feat/tokenization Feature: Tokenization/piecer type/bug Type: Bug type/feature Type: Feature
#359 opened Jan 14, 2024 by MootezSaaD updated Jan 31, 2024
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.