Skip to content

[IMPROVEMENT] Rescue of PR 2106: Intelligent Teletext sentence merging#2190

Closed
Atul-Chahar wants to merge 1 commit intoCCExtractor:masterfrom
Atul-Chahar:improvement/teletext-sentence-merging
Closed

[IMPROVEMENT] Rescue of PR 2106: Intelligent Teletext sentence merging#2190
Atul-Chahar wants to merge 1 commit intoCCExtractor:masterfrom
Atul-Chahar:improvement/teletext-sentence-merging

Conversation

@Atul-Chahar
Copy link
Contributor

In raising this pull request, I confirm the following (please check boxes):

My familiarity with the project is as follows (check one):

  • I am an active contributor to CCExtractor.

Summary

This PR rescues the Teletext sentence-merging feature proposed by @Apprentice2907 in #2106, while addressing the reviewer feedback to split it out from unassociated commits.

I have:

  1. Created a clean branch from master containing only the telxcc.c changes to delay flushing subtitles until a sentence completes.
  2. Fixed a critical buffer bug in the original PR: The original PR mistakenly used page_buffer_add_string to append the current fragment to itself, which would cause the buffer to be zeroed out and the fragment lost. I replaced this with proper manual reallocation and concatenation of ctx->page_buffer_prev and ctx->ucs2_buffer_prev so the fragments actually merge correctly.

All C and Rust tests pass locally.

Fixes #2106 requirement for a clean, focused PR.

@ccextractor-bot
Copy link
Collaborator

CCExtractor CI platform finished running the test files on linux. Below is a summary of the test results, when compared to test for commit 90128d8...:
Report Name Tests Passed
Broken 10/13
CEA-708 2/14
DVB 2/7
DVD 3/3
DVR-MS 2/2
General 27/27
Hardsubx 1/1
Hauppage 3/3
MP4 3/3
NoCC 10/10
Options 76/86
Teletext 0/21
WTV 13/13
XDS 34/34

Your PR breaks these cases:

NOTE: The following tests have been failing on the master branch as well as the PR:

  • ccextractor --startcreditsforatmost 2 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed:

    Test 8730

Congratulations: Merging this PR would fix the following tests:

  • ccextractor --autoprogram --out=srt --latin1 --quant 0 85271be4d2..., Last passed: Never
  • ccextractor --autoprogram --out=ttxt --latin1 --ucla dab1c1bd65..., Last passed: Never
  • ccextractor --out=srt --latin1 --autoprogram 29e5ffd34b..., Last passed: Never
  • ccextractor --out=spupng c83f765c66..., Last passed: Never
  • ccextractor --parsePAT --out=srt c83f765c66..., Last passed: Never
  • ccextractor --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsnotbefore 1 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsforatleast 1 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --endcreditsforatleast 3 --endcreditstext "CCextractor Ends crdit Testing" addf5e2fc9..., Last passed: Never

It seems that not all tests were passed completely. This is an indication that the output of some files is not as expected (but might be according to you).

Check the result page for more info.

@Atul-Chahar Atul-Chahar closed this Mar 8, 2026
@Atul-Chahar Atul-Chahar deleted the improvement/teletext-sentence-merging branch March 8, 2026 00:15
@ccextractor-bot
Copy link
Collaborator

CCExtractor CI platform finished running the test files on windows. Below is a summary of the test results, when compared to test for commit e4bcade...:
Report Name Tests Passed
Broken 10/13
CEA-708 2/14
DVB 2/7
DVD 3/3
DVR-MS 2/2
General 27/27
Hardsubx 1/1
Hauppage 3/3
MP4 3/3
NoCC 10/10
Options 78/86
Teletext 0/21
WTV 13/13
XDS 34/34

Your PR breaks these cases:

NOTE: The following tests have been failing on the master branch as well as the PR:

Congratulations: Merging this PR would fix the following tests:

  • ccextractor --autoprogram --out=srt --latin1 --quant 0 85271be4d2..., Last passed: Never
  • ccextractor --autoprogram --out=ttxt --latin1 --ucla dab1c1bd65..., Last passed: Never
  • ccextractor --out=srt --latin1 --autoprogram 29e5ffd34b..., Last passed: Never
  • ccextractor --out=spupng c83f765c66..., Last passed: Never
  • ccextractor --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsnotbefore 1 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsnotafter 2 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsforatleast 1 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never
  • ccextractor --startcreditsforatmost 2 --startcreditstext "CCextractor Start crdit Testing" c4dd893cb9..., Last passed: Never

It seems that not all tests were passed completely. This is an indication that the output of some files is not as expected (but might be according to you).

Check the result page for more info.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants