Performance improvements #429

art-from-the-machine · 2024-11-13T20:22:57Z

For both Piper and XTTS, voice model loading functions are now called as soon as the first NPC is ready, and are run asynchronously while the LLM generates a response
Added option to Mantella UI to enable / disable lip file generation (or select "Lazy" to skip lip generation for the first returned sentence to improve response times)
Added tiktoken encoding model locally to data/ folder to avoid connecting to the internet
Async OpenAI client is created on init to improve response times at the start of a conversation
Minor performance improvements to conversation summaries loading, NPC searching in character_df, and "trust" calculation
Added checks to see if context values exist in JSON input before trying to load them
Added timing logs to many functions for better performance monitoring

…arch

art-from-the-machine added 12 commits November 12, 2024 17:30

Changed paragraphs from a list to a set to avoid checking for duplicates

624ed67

Replaced apply functions with vectorized functions in character df se…

c6466f1

…arch

Added module info to time_it function

c8557be

Added explicit log length calc function

5046a17

Added error handling for context values

db83d58

Load voice model as soon as character is selected

00a7a95

Load Piper voice model asynchronously

3b1f0f9

Load voice model and set XTTS settings asynchronously

2d26a71

Added option to disable lip generation

7a2ba4b

OpenAI client improvements

3a06ba5

Added timing logs to functions

3abc2ce

Added local tiktoken encoding

091f85c

art-from-the-machine merged commit 3b307d2 into main Nov 13, 2024

art-from-the-machine deleted the performance_improvements branch November 13, 2024 20:23

Provide feedback