Skip to content

AbhishikthAleti97/praw_seq2seq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

praw_seq2seq

Program to scrape reddit data using reddit-api

Create an API using reddit account to scrape data, enter the credentials in a new python credentials.py file, run the program (change the limiters and other paramenters, if necessary).

Voila, you now have a dataset of your favorite subreddit. This program collects data in a certain format.

1st line is a comment, 2nd line is a reply to the 1st.

Each line represents a comment submission. This program is optimal to train seq-seq chatbots.

Plans:

Make it beautiful by re-writing the entire program using OOPS concepts. Making it available to install using pip. Making it a part of chatterbot's features (just like how twitter data is scraped on chatterbot).

About

Program to scrape reddit data using reddit-api

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages