GitHub - prasunkantidey/AmazonProductRecommendation

AmazonReview contains the entire project with source codes.

Folder structure is AmazonReview/data/ AmazonReview/src/com/analysis/ AmazonReview/src/com/preprocess/

"data" stores all the outputs (graph pdf, reduced dataset). Input file (For Dataset reduction or ReviewNetwork) should also be stored in this folder.

"analysis" has 2 python files.

ARClass defines the structure of each product and stores informations from amazon data.

ReviewNetwork does all the calculation and computations.

"preprocess" has only 1 python file.

As name suggests, it reduces the dataset and saves the output in a txt file in "data" folder.

Running the project:

If running with original dataset (amazon-meta.txt), then it must be reduced using ReduceDataset.py.
You can define your reduced filename in there. However, for now it generates "amazon_reduced_data_100.txt" for 100 items. Item count can be set in this file in "max_count" variable.
This expects the input file to be in "data" folder and will save the output file in same "data" folder.

Note: We already provided 2 sample reduced dataset of size 100 and 4000. For full dataset: https://snap.stanford.edu/data/web-Amazon.html

To just see the recommendation, run "ReviewNetwork.py".
It expects an input filename as "amazon_reduced_data_100.txt". However, 100 can be changed by changing value of "input_size" variable in the file.
Number of recommendations per product can also be set from the file by "number_of_recommendation" variable. Currently it's set to 5.
If "input_size" is less then 200 items, it will show the generated graph (nodes and edges) and save it in "data" folder as pdf format.
Output (Recommended products) will be shown in terminal, but will not be saved in file.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
src/com		src/com
Readme.md		Readme.md

Provide feedback