Skip to content

southpark.de extractor broke due to the site structure changing, using the ComedyCentral extractor should work. #26763

Closed
@okh-mzny

Description

Checklist

  • I'm reporting a broken site support
  • I've verified that I'm running youtube-dl version 2020.09.20
  • I've checked that all provided URLs are alive and playable in a browser
  • I've checked that all URLs and arguments with special characters are properly quoted or escaped
  • I've searched the bugtracker for similar issues including closed ones

Verbose log

A:\>youtube-dl -v https://www.southpark.de/alle-episoden/s20e02-skankhunt
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: ['-v', 'https://www.southpark.de/alle-episoden/s20e02-skankhunt']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2020.09.20
[debug] Python version 3.4.4 (CPython) - Windows-10-10.0.14393
[debug] exe versions: ffmpeg git-2020-07-13-7772666, ffprobe git-2020-07-13-7772666
[debug] Proxy map: {}
[southpark.de] s20e02-skankhunt: Downloading webpage
Traceback (most recent call last):
  File "__main__.py", line 19, in <module>
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\__init__.py", line 474, in main
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\__init__.py", line 464, in _real_main
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\YoutubeDL.py", line 2019, in download
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\YoutubeDL.py", line 797, in extract_info
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\common.py", line 532, in extract
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\mtv.py", line 287, in _real_extract
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\mtv.py", line 213, in _get_videos_info
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\mtv.py", line 39, in _id_from_uri
AttributeError: 'NoneType' object has no attribute 'split'

A:\>youtube-dl -v https://www.southpark.de/folgen/fi4nmu/south-park-mexikanischer-joker-staffel-23-ep-1
[debug] System config: []
[debug] User config: []
[debug] Custom config: []
[debug] Command-line args: ['-v', 'https://www.southpark.de/folgen/fi4nmu/south-park-mexikanischer-joker-staffel-23-ep-1']
[debug] Encodings: locale cp1252, fs mbcs, out cp850, pref cp1252
[debug] youtube-dl version 2020.09.20
[debug] Python version 3.4.4 (CPython) - Windows-10-10.0.14393
[debug] exe versions: ffmpeg git-2020-07-13-7772666, ffprobe git-2020-07-13-7772666
[debug] Proxy map: {}
[generic] south-park-mexikanischer-joker-staffel-23-ep-1: Requesting header
WARNING: Falling back on generic information extractor.
[generic] south-park-mexikanischer-joker-staffel-23-ep-1: Downloading webpage
[generic] south-park-mexikanischer-joker-staffel-23-ep-1: Extracting information
ERROR: Unsupported URL: https://www.southpark.de/folgen/fi4nmu/south-park-mexikanischer-joker-staffel-23-ep-1
Traceback (most recent call last):
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\YoutubeDL.py", line 797, in extract_info
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\common.py", line 532, in extract
  File "C:\Users\dst\AppData\Roaming\Build archive\youtube-dl\ytdl-org\tmpksi3o1r1\build\youtube_dl\extractor\generic.py", line 3382, in _real_extract
youtube_dl.utils.UnsupportedError: Unsupported URL: https://www.southpark.de/folgen/fi4nmu/south-park-mexikanischer-joker-staffel-23-ep-1

Description

Southpark.de has recently changed its site, breaking the southpark.de extractor. The first command was ran with the old url format which the southpark.de extractor works with, however it fails due to the old url redirecting to the new one. Below the first command is a second command with the new southpark.de url format, which fails due to no extractor being implemented for it.

I observed that the new southpark.de website looks and works similar to cc.com. Using the ComedyCentral extractor for southpark.de should probably work, I have not been able to test it out due to there being no way to force the use of an extractor for an unknown url.

Hope this will be fixed.

Thank you.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions