Skip to content

Basic recommendation engine#57

Merged
vishalbollu merged 5 commits into
masterfrom
recommendations-example
Apr 26, 2019
Merged

Basic recommendation engine#57
vishalbollu merged 5 commits into
masterfrom
recommendations-example

Conversation

@vishalbollu
Copy link
Copy Markdown
Contributor

@vishalbollu vishalbollu commented Apr 18, 2019

Closes #58


Checklist:

  • Run make test and make lint
  • Test end to end manually (e.g. build/push all images, restart local operator, and run cx refresh in an example folder)
  • Update documentation
  • Update examples and cx init
  • Alert team if dev environment changed
  • Cherry-pick into release branches if it's a bugfix
  • Delete the branch once it's merged

@vishalbollu
Copy link
Copy Markdown
Contributor Author

vishalbollu commented Apr 22, 2019

  • showcases a collaborative filtering approach to recommendation system using deep learning
  • uses movielens 100k dataset dataset as input
  • dataset is a zip file containing ratings.csv, movies.csv and other information
  • our pipeline will use ratings.csv to group similar users to each other and similar movies to eachother and predict the rating that a user may give to a movie
  • dataset contains the reviews of 600 users for 9000 movies with the schema: userid,movieid,rating,timestamp
  • the userid range is [1, 610] and movieid range from [1, 193609]
  • movieid range is large compared to the number of unique movies in dataset so shrink the range by creating a new index of only the movies that are present in the dataset
  • index userid and movieid columns by treating them as string and using index_string
  • embed indexed userid and movieid columns
  • train a regression model to minimize RMSE for rating predictions
  • predict the rating of a user with a userid that is mentioned in the input dataset to an unrated movie with movieid that is in the input dataset

@vishalbollu vishalbollu changed the title [WIP] Basic recommendation engine Basic recommendation engine Apr 22, 2019
- user_id_index
- movie_id_index
hparams:
embedding_size: 10
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you split embedding_size for each of the features?

@vishalbollu vishalbollu merged commit 15a2f7a into master Apr 26, 2019
@vishalbollu vishalbollu deleted the recommendations-example branch April 26, 2019 23:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add recommendation example

3 participants