Improved output parsing, small refactoring of LLM client #504

Leidtier · 2025-02-08T16:34:55Z

Improvements for output parsing:

handling of consecutive sentence terminators (.!? etc. this includes ...)
respect max number of sentences again
improved fusing of sentences that are too short for the TTS
narration parsing now can deal with sentences marked as speech by quotes ("")
created clearly readable enums to indicate if a sentence is speech or narration

Small refactoring of the LLM client

created an AIClient interface that is now used throughout the software instead of the LLMClient
consolidated the 5 different methods to measure text by tokens into 2 (get_count_tokens and is_too_long)
added an alternate implementation of the AIClient interface called LLMTestClient that can be initiated with specific responses to test the parsing logic

Fixed dropping of duplicate summary paragraphs in context.
Fixed several typing issues.

Improvements for output parsing: - handling of consecutive sentence terminators (.!? etc.) - respect max number of sentences again - improved fusing of sentences that are too short for the TTS - narration parsing now can deal with sentences marked as speech by quotes ("") - created clearly readable enums to indicate if a sentence is speech or narration Small refactoring of the LLM client - created an `AIClient` interface that is now used throughout the software instead of the `LLMClient` - consolidated the 5 different methods to measure text by tokens into 2 (`get_count_tokens` and `is_too_long`) - added an alternate implementation of the `AIClient` interface called `LLMTestClient` that can be initiated with specific responses to test the parsing logic Fixed dropping of duplicate summary paragraphs in context. Fixed several typing issues.

…new changes

This reverts commit 99708ad.

…next sentence

Leidtier added 7 commits February 1, 2025 18:43

Add handling of alternating voice line files

99708ad

Merge remote-tracking branch 'upstream/main' into improve-output-parsing

bb42bd6

Removed fixes for '...' and max sentences as they are handled by the …

1d3b157

…new changes

Removed max sentence check from radiant conversations

fcd6acb

Fixed a few more things that got lost in the merge

238d765

Revert "Add handling of alternating voice line files"

b011d24

This reverts commit 99708ad.

Leidtier mentioned this pull request Feb 9, 2025

Fix duplicate voice lines by using alternating voice lines and topic info #507

Merged

art-from-the-machine added 5 commits February 12, 2025 13:36

Rename interim variables to parsed_sentence and pending_sentence

fd3d9ee

Process first sentence as soon as it is ready instead of waiting for …

6ff44eb

…next sentence

Time narration parsing

1756896

Allow image LLMs to make request call

5d5a8d1

Merge branch 'main' into pr/Leidtier/504

1356537

art-from-the-machine approved these changes Feb 12, 2025

View reviewed changes

art-from-the-machine merged commit 806a333 into art-from-the-machine:main Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved output parsing, small refactoring of LLM client #504

Improved output parsing, small refactoring of LLM client #504

Leidtier commented Feb 8, 2025 •

edited

Loading

Improved output parsing, small refactoring of LLM client #504

Improved output parsing, small refactoring of LLM client #504

Conversation

Leidtier commented Feb 8, 2025 • edited Loading

Leidtier commented Feb 8, 2025 •

edited

Loading