Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Article full-text and meta-data extraction #5

Open
nikhilkumarsingh opened this issue Nov 11, 2017 · 5 comments
Open

Article full-text and meta-data extraction #5

nikhilkumarsingh opened this issue Nov 11, 2017 · 5 comments
Labels

Comments

@nikhilkumarsingh
Copy link
Owner

Check out newspaper.

This task can be divided into 2 parts:

  1. Instead of creating a dictionary, make an Article object for each fetched article.
  2. Use newspaper module to fetch meta data and text of the article as well.
@CodeVINCI
Copy link
Contributor

CodeVINCI commented Dec 2, 2017

task1 means - client.get_news() must return a list of Article objects?

@CodeVINCI
Copy link
Contributor

#10

@ghost
Copy link

ghost commented Oct 4, 2018

is it possible to retrieve the date and time of the news? is there any modification possible at a custom level?

@nikhilkumarsingh
Copy link
Owner Author

You can try to modify this function to scrape any extra data according to your needs.

@ghost
Copy link

ghost commented Oct 6, 2018

I was thinking is there a possible way to get the news result till page five or more results than just 10?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants