Skip to content

It is suggested that Markitdown [audio-transcription] should be able to utilize the local Whisper model to transcribe MP3 and MP4 files #1860

@zhangweildlh

Description

@zhangweildlh

It is suggested that Markitdown [audio-transcription] should be able to utilize the local Whisper model to transcribe MP3 and MP4 files
I am unable to connect and use the OpenAI Whisper API for MP3 and MP4 transcription. I can only install ffmpeg, torch, and openai-whisper through the following commands and perform transcription locally. However, Markdown does not support invoking the local Whisper model for transcription. I hope this feature will be supported in future versions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions