This GitHub organization belongs to the Barcelona Supercomputing Center (BSC). It is not affiliated with, sponsored by, or endorsed by LangTec. No association between BSC and the LangTec trademark is intended.
Language Technologies Laboratory - BSC
Popular repositories Loading
-
Wikiextractor-V2
Wikiextractor-V2 PublicEnhaced version of Wikiextrator: A wikipedia dumps extractor
-
-
AnonymizationPipeline
AnonymizationPipeline PublicAnonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.
-
mt-evaluation
mt-evaluation PublicForked from EleutherAI/lm-evaluation-harness
A framework for evaluating Machine Translation models.
-
vocos
vocos PublicForked from gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Repositories
- DocAlign Public
Pipeline for preprocessing, aligning, and rebuilding parallel text at document level.
langtech-bsc/DocAlign’s past year of commit activity - gorilla Public Forked from ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
langtech-bsc/gorilla’s past year of commit activity - mt-wrapper Public
compatibility wrapper to use SalamandraTA as a replacement for Google Translate or DeepL
langtech-bsc/mt-wrapper’s past year of commit activity - bsc-lt_tokenizers Public
LanguageTechnologiesUnit' s repository for tokenizer training and evaluation scripts.
langtech-bsc/bsc-lt_tokenizers’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…