PIB articles for the year - 2022
Dec, 2022
Nov, 2022
Oct, 2022
Sep, 2022
Aug, 2022
Jul, 2022
Jun, 2022
May, 2022
Apr, 2022
Mar, 2022
Feb, 2022
Jan, 2022
PIB articles for the year - 2021
Dec, 2021
Nov, 2021
Oct, 2021
Sep, 2021
Aug, 2021
Jul, 2021
Jun, 2021
May, 2021
Apr, 2021
Mar, 2021
Feb, 2021
Jan, 2021
PIB articles for the year - 2020
Dec, 2020
Nov, 2020
Oct, 2020
Sep, 2020
Aug, 2020
Jul, 2020
Jun, 2020
May, 2020
Apr, 2020
Mar, 2020
Feb, 2020
Jan, 2020
Download articles from Press Information Bureau, India. This might be helpful for candidates preparing for different govt examinations.
You can either use the spider by cloning this repo and following the instructions given below
or
You can download the articles direcly from the release section or by clicking on the badges above for the year and month.
There are 4 different kind of zips in the release section for every month
- Day wise PIB_Daily | MMM_YYYY : These zips contain the PIB articles for the date DD/MMM/YYYY
- Month wise PIB_Monthly | MMM_YYYY : These zips contain the PIB articles for the whole month MMM/YYYY
- Text files PIB_Text | MMM_YYYY : These zips contain the text extracted from pdf of the whole month MMM/YYYY
- Article Link files PIB_LINKS | MMM_YYYY : These text files contains links for the article for the date DD/MMM/YYYY
#Clone the repo with:
git clone https://github.com/nit-in/pib
#cd to the cloned repo
cd pib
#installing required packages
pip install -r requirements.txt
#when these steps are done,you are ready to run the spider and download the articles.
#source the env file
source .env
#if you are using shell other than bash then
bash --init-file .env
#run the spider
pib start_date end_date #(date format: yyyy-mm-dd)
#example => to download the articles from June 1st, 2021 to June 15th, 2021; use
pib 2021-06-01 2021-06-15
For an Entire Month
#For the month of Jan, 2021
pib_month Jan 2021
#For the month of Dec, 2020
pib_month Dec 2020
#For present day
pib_today
#For last day
pib_last_day
Any suggestions and improvements are welcome.