[BrainPOP] Add new extractor (Closes #10000) #10025

nehalvpatel · 2016-07-07T04:59:12Z

Before submitting a pull request make sure you have:

At least skimmed through adding new extractor tutorial and youtube-dl coding conventions sections
Searched the bugtracker for similar pull requests

What is the purpose of your pull request?

Bug fix
New extractor
New feature

Description of your pull request and other information

Added a new extractor for BrainPOP. In addition to the video, it extracts the description and the thumbnail. It should work on any video on the site, even if it's in a different language.

TRox1972 · 2016-07-08T16:42:57Z

youtube_dl/extractor/brainpop.py

+
+        self.report_extraction(video_id)
+
+        ec_token = self._html_search_regex(r'ec_token : \'(.+)\'', webpage, 'token')


Its easier to parse the JSON instead of searching through it manually.

@TRox1972 Right, but the JSON is mixed in with the HTML. How should I go about extracting it?

EDIT: I think I found a way.

…ions

TRox1972 · 2016-07-11T19:17:58Z

youtube_dl/extractor/brainpop.py

+
+        self.report_extraction(video_id)
+
+        ec_token = self._html_search_regex(r"ec_token : '([^']*)'", webpage, 'token')


It this needed? Does it work without it?

The token? Yea, you can't access the videos without it.

…bscription video detection

TRox1972 · 2016-07-13T00:29:09Z

youtube_dl/extractor/brainpop.py

+            'id': content['category']['unit']['topic']['EntryID'],
+            'display_id': display_id,
+            'title': remove_end(settings['title'], ' - BrainPOP'),
+            'description': settings['description'],


This should be non-fatal

TRox1972 · 2016-07-13T02:31:42Z

youtube_dl/extractor/brainpop.py

            'display_id': display_id,
-            'title': remove_end(settings['title'], ' - BrainPOP'),
-            'description': settings['description'],
+            'title': remove_end(settings.get('title', display_id), ' - BrainPOP'),


The title should be mandatory

Such a pattern is OK if title is missing in settings. @nehalvpatel Do you have an example?

title can be extracted from <title>(.*)</title> also. One of them should be fatal, right?

Either way is fine. In general <title> is better than extracting from embedded data as the former is less likely to change. If there's an example that <title> or settings['title'] is missing, a fallback should be provided, otherwise the extraction should be fatal.

EpicCodeWizard · 2021-09-21T04:53:52Z

Does this thing still work?

Authored by: MinePlayersPE Based on ytdl-org/youtube-dl#10025

nehalvpatel added 2 commits July 6, 2016 23:36

[BrainPOP] Add new extractor

45abe20

[BrainPOP] Clean up code and account for non-mandatory fields

f56a9db

TRox1972 reviewed Jul 8, 2016
View reviewed changes

[BrainPOP] Switch from regex to parsing JSON and include both resolut…

b00d17e

…ions

TRox1972 reviewed Jul 11, 2016
View reviewed changes

[BrainPOP] Optimize regex and extractor, improve metadata, and add su…

7022e24

…bscription video detection

TRox1972 reviewed Jul 13, 2016
View reviewed changes

[BrainPOP] Trim code and make optional metadata less brittle

f02b57d

TRox1972 reviewed Jul 13, 2016
View reviewed changes

dstftw added the pending-fixes label Aug 27, 2016

dstftw force-pushed the master branch from fa77986 to 0c7a631 Compare June 24, 2017 22:03

dstftw force-pushed the master branch from 4991699 to 1141e91 Compare August 5, 2017 00:42

dstftw force-pushed the master branch from 293617b to af0f742 Compare October 11, 2017 16:48

dstftw force-pushed the master branch from 37318e1 to 65220c3 Compare January 27, 2018 22:49

dstftw force-pushed the master branch from 8d14fa1 to 5399ab3 Compare February 4, 2018 00:55

dstftw force-pushed the master branch from c486aa9 to 5ee7ae5 Compare December 9, 2018 15:38

dstftw force-pushed the master branch from 8cd780c to de0359c Compare January 4, 2019 20:44

dstftw force-pushed the master branch from d99bab0 to e118a87 Compare January 23, 2019 18:40

dstftw force-pushed the master branch from 5e26784 to da2069f Compare September 13, 2020 13:52

cypheron mentioned this pull request Feb 3, 2021

Evaluation / overview of new proposed extractors / sites #28054

Open

dirkf force-pushed the master branch from 01bf89e to 4c6fba3 Compare August 26, 2022 07:51

MinePlayersPE mentioned this pull request Jan 28, 2023

[BrainPOP] Add extractors yt-dlp/yt-dlp#6106

Merged

9 tasks

pukkandan pushed a commit to yt-dlp/yt-dlp that referenced this pull request Apr 12, 2023

[extractor/BrainPOP] Add extractors (#6106)

979568f

Authored by: MinePlayersPE Based on ytdl-org/youtube-dl#10025

dirkf closed this Aug 1, 2023

dirkf added the defunct PR source branch is not accessible label Oct 2, 2023

aalsuwaidi pushed a commit to aalsuwaidi/yt-dlp that referenced this pull request Apr 21, 2024

[extractor/BrainPOP] Add extractors (yt-dlp#6106)

41a1ed2

Authored by: MinePlayersPE Based on ytdl-org/youtube-dl#10025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BrainPOP] Add new extractor (Closes #10000) #10025

[BrainPOP] Add new extractor (Closes #10000) #10025

nehalvpatel commented Jul 7, 2016 •

edited

Loading

TRox1972 Jul 8, 2016

nehalvpatel Jul 8, 2016 •

edited

Loading

TRox1972 Jul 11, 2016

nehalvpatel Jul 12, 2016

TRox1972 Jul 13, 2016

TRox1972 Jul 13, 2016

yan12125 Jul 13, 2016

TRox1972 Jul 13, 2016 •

edited

Loading

yan12125 Jul 13, 2016

EpicCodeWizard commented Sep 21, 2021


		self.report_extraction(video_id)

		ec_token = self._html_search_regex(r'ec_token : \'(.+)\'', webpage, 'token')


		self.report_extraction(video_id)

		ec_token = self._html_search_regex(r"ec_token : '([^']*)'", webpage, 'token')

[BrainPOP] Add new extractor (Closes #10000) #10025

[BrainPOP] Add new extractor (Closes #10000) #10025

Conversation

nehalvpatel commented Jul 7, 2016 • edited Loading

Before submitting a pull request make sure you have:

What is the purpose of your pull request?

Description of your pull request and other information

TRox1972 Jul 8, 2016

Choose a reason for hiding this comment

nehalvpatel Jul 8, 2016 • edited Loading

Choose a reason for hiding this comment

TRox1972 Jul 11, 2016

Choose a reason for hiding this comment

nehalvpatel Jul 12, 2016

Choose a reason for hiding this comment

TRox1972 Jul 13, 2016

Choose a reason for hiding this comment

TRox1972 Jul 13, 2016

Choose a reason for hiding this comment

yan12125 Jul 13, 2016

Choose a reason for hiding this comment

TRox1972 Jul 13, 2016 • edited Loading

Choose a reason for hiding this comment

yan12125 Jul 13, 2016

Choose a reason for hiding this comment

EpicCodeWizard commented Sep 21, 2021

nehalvpatel commented Jul 7, 2016 •

edited

Loading

nehalvpatel Jul 8, 2016 •

edited

Loading

TRox1972 Jul 13, 2016 •

edited

Loading