Potential UTF-8 / Latin-1 regression #60

UsernamesLame · 2024-09-20T13:33:57Z

[2024-09-20 09:12:13,861] {model.py:132} INFO - Transcribing ...

Traceback (most recent call last):
  File "/Users/user/Desktop/whisper-metal/__main__.py", line 4, in <module>
    segments = model.transcribe('file.mp3')
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/user/Desktop/whisper-metal/.venv/lib/python3.12/site-packages/pywhispercpp/model.py", line 133, in transcribe
    res = self._transcribe(audio, n_processors=n_processors)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/user/Desktop/whisper-metal/.venv/lib/python3.12/site-packages/pywhispercpp/model.py", line 249, in _transcribe
    res = Model._get_segments(self._ctx, 0, n)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/user/Desktop/whisper-metal/.venv/lib/python3.12/site-packages/pywhispercpp/model.py", line 154, in _get_segments
    text = pw.whisper_full_get_segment_text(ctx, i)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 57-58: invalid continuation byte

I think we have a regression!

Originally posted in #59 (comment)

Just generating a separate issue so we don't disrupt that thread. @abdeladim-s I'm assuming this has something to do with dropping pydub. Are we not normalizing values anymore?

The text was updated successfully, but these errors were encountered:

UsernamesLame · 2024-09-20T13:38:54Z

Uninstalled pywhispercpp I installed from git and re-installed from pip, and the regression is gone, but so is CoreML.

Also CoreML is a lot slower than CPU inference on M1 Pro in macOS Sequoia.

UsernamesLame · 2024-10-07T16:49:08Z

@abdeladim-s Wanna follow up on this? or should I consider it a one off?

abdeladim-s · 2024-10-08T00:49:18Z

@UsernamesLame, I though you were following in #59, the issue was that the dylib files were not included in the wheel.
I think the new build resolved the issue!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential UTF-8 / Latin-1 regression #60

Potential UTF-8 / Latin-1 regression #60

UsernamesLame commented Sep 20, 2024 •

edited

Loading

UsernamesLame commented Sep 20, 2024

UsernamesLame commented Oct 7, 2024

abdeladim-s commented Oct 8, 2024

Potential UTF-8 / Latin-1 regression #60

Potential UTF-8 / Latin-1 regression #60

Comments

UsernamesLame commented Sep 20, 2024 • edited Loading

UsernamesLame commented Sep 20, 2024

UsernamesLame commented Oct 7, 2024

abdeladim-s commented Oct 8, 2024

UsernamesLame commented Sep 20, 2024 •

edited

Loading