Skip to content

jdvelasquez-git/multilingual-glossary-builder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

⚑ AI-Assisted Glossary Builder

If you work in Localization or Globalization your in organization, you likely rely on glossaries to ensure brand/product message consistency, especially for technical specifications. This handy Python script will help you leverage content from existing translation memories to create an almost-perfect bilingual glossary leveraging Perplexity's Sonar model. You can read more about why I made this script by visiting Localization Times (blog).

πŸ“„ Requirements

  1. A list of terms in Excel format.
  2. A translation memory in Excel format. Check your TMS documentation for details: Phrase, MemoQ, Trados (via GC).
  3. Perplexity API key. You will need to use a valid payment method. Check their API pricing (Sonar) for more details.

πŸ’Ύ Libraries Used

  1. OpenAI
  2. Pandas; openpyxl; requests

πŸ“„ General Instructions

  1. Edit the script and add your API key (line 40).
  2. Run the script.
  3. Drag and drop your Excel files.
  4. Specify the column header names for each Excel file.
  5. Enjoy your new glossary.

πŸ’­ Future Improvements

  1. Ability to use TMX or Excel files interchangeably via lxml and some conditionals).
  2. Reverse matching of extracted terms for validation purposes.
  3. Code optimizations for reduced friction during the wizard.

About

AI-Assisted Multilingual Glossary Builder

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages