Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

OpenNMT / CTranslate2 Public

Notifications You must be signed in to change notification settings
Fork 294
Star 3.4k

Code
Issues 160
Pull requests 26
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: OpenNMT/CTranslate2

Releases · OpenNMT/CTranslate2

CTranslate2 4.5.0

22 Oct 11:23

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.5.0 Latest

Latest

Note: The Ctranslate2 Python package now supports CUDNN 9 and is no longer compatible with CUDNN 8.

New features

Support Phi3 (#1800)
Support Mistral Nemo (#1785)
Support Wav2Vec2Bert ASR (#1778)

Fixes and improvements

Upgrade to CUDNN9 (#1803)
Fix logits vocab (#1786 + #1791)
Update doc AWQ (#1795)

Assets 2

Loading

michaelfeil and yudelevi reacted with thumbs up emoji

All reactions

👍 2 reactions

2 people reacted

CTranslate2 4.4.0

09 Sep 09:21

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.4.0

Removed: Flash Attention support in the Python package due to significant package size increase with minimal performance gain.
Note: Flash Attention remains supported in the C++ package with the WITH_FLASH_ATTN option.
Flash Attention may be re-added in the future if substantial improvements are made.

New features

Support Llama3 (#1751)
Support Gemma2 (#1772)
Add log probs for all tokens in vocab (#1755)
Grouped conv1d (#1749 + #1758)

Fixes and improvements

Fix pipeline (#1723 + #1747)
Some improvements in flash attention (#1732)
Fix crash when using return_alternative on CUDA (#1733)
Quantization AWQ GEMM + GEMV (#1727)

Assets 2

Loading

avan06, homink, NeonBohdan, solaoi, and jhj0517 reacted with thumbs up emoji

PC91 and solaoi reacted with hooray emoji

All reactions

👍 5 reactions
🎉 2 reactions

6 people reacted

CTranslate2 4.3.1

11 Jun 09:16

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.3.1

Note: Because of exceeding project's size on Pypi (> 20 GB), the release v4.3.0 was pushed unsuccessfully.

Fixes and improvements

Improve the compilation (#1706 and #1705)
Fix position bias in tensor parallel mode (#1714)

Assets 2

Loading

All reactions

CTranslate2 4.3.0

17 May 08:20

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.3.0

New features

Support phi-3 (8k and 128k) (#1700 and #1680)

Fixes and improvements

Fix regression Flash Attention (#1695)

Assets 2

Loading

avan06, muzuiget, and foxxxx001 reacted with thumbs up emoji

sogaiu reacted with heart emoji

All reactions

👍 3 reactions
❤️ 1 reaction

4 people reacted

CTranslate2 4.2.1

24 Apr 10:04

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.2.1

Note: Because of the increasing of package's size (> 100 MB), the release v4.2.0 was pushed unsuccessfully.

New features

Support load/unload for generator/Whisper Attention (#1670)

Fixes and improvements

Fix Llama 3 (#1671)

Assets 2

Loading

avan06, NeonBohdan, BBC-Esq, and vilsonrodrigues reacted with thumbs up emoji

All reactions

👍 4 reactions

4 people reacted

CTranslate2 4.2.0

10 Apr 11:41

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.2.0

New features

Support Flash Attention (#1651)
Implementation of gemm for FLOAT32 compute type with RUY backend (#1598)
Conv1D quantization for only CPU (DNNL and CUDA backend is not supported) (#1601)

Fixes and improvements

Fix bug tensor parallel (#1643)
Use BestSampler when temperature is 0 (#1659)
Fix bug gemma (#1660)
Optimize loading/unloading time for Translator with cache (#1645)

Assets 2

Loading

michaelfeil, vilsonrodrigues, muzuiget, masa-oi, BBC-Esq, Daniel-Heo, avan06, FenardH, and StableFluffy reacted with heart emoji

All reactions

❤️ 9 reactions

9 people reacted

CTranslate2 4.1.1

12 Mar 08:59

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.1.1

Fixes and improvements

Fix classifiers in setup.py to push pypi package

Assets 2

Loading

All reactions

CTranslate2 4.1.0

11 Mar 16:15

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.1.0

New features

Support Gemma Model (#1631)
Support Tensor Parallelism (#1599)

Fixes and improvements

Avoid initializing unused GPU (#1633)
Read very large tensor by chunk if the size > max value of int (#1636)
Update Readme

Assets 2

Loading

michaelfeil reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

CTranslate2 4.0.0

15 Feb 12:51

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 4.0.0

This major version introduces the breaking change while updating to cuda 12.

Breaking changes

Python

Support cuda 12

New features

Add feature to_device() in class StorageView in Python to move data between host <-> device

Fixes and improvements

Implement Conv1D with im2col and GEMM to improvement in performance
Get tokens in the range of the vocab size for LlaMa models
Fix loss of performance
Update cibuildwheel to 2.16.5

Assets 2

Loading

avan06 and BBC-Esq reacted with heart emoji

BBC-Esq, nickchomey, and yudelevi reacted with rocket emoji

All reactions

❤️ 2 reactions
🚀 3 reactions

4 people reacted

CTranslate2 3.24.0

09 Jan 09:17

minhthuc2502

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

CTranslate2 3.24.0

New features

Support of new option offset to ignore token score of special tokens

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 … 12 13 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.