Skip to content

Python notebook to explore the power of local clustering coefficient to approximate the Salesrank

License

Notifications You must be signed in to change notification settings

Sproc01/Lfn_AmazonBookAnalysis

Repository files navigation

Lfn_AmazonBookAnalysis

The dataset used is available at Stanford site

The project main idea is explained in Project Proposal

The project mid-term report is Mid-Term Report

The project final report is Report

Motivation

This project explores the potential of the local clustering coefficient as a feature of popularity for books and genres. The clustering coefficient gives a strong metric of correlation between books bought together, so it can be used to determine a possible order of the ele- ments, obtained by analysing how often a book appears in a triangle. Given the values of the local clustering coefficient, we investigated the possibil- ity of approximating the salesrank order for the books. We also compare the order for the different genres that can be obtained using the salesrank and the clustering coefficient of the different books within the genre. Then we try to classify each book into 4 different categories based on a new joint definition of popularity that we came up with, with the objective of capturing the nature of the books.

About

Python notebook to explore the power of local clustering coefficient to approximate the Salesrank

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published