Skip to content

Basic recommendation engine #57

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Apr 26, 2019
Merged

Basic recommendation engine #57

merged 5 commits into from
Apr 26, 2019

Conversation

vishalbollu
Copy link
Contributor

@vishalbollu vishalbollu commented Apr 18, 2019

Closes #58


Checklist:

  • Run make test and make lint
  • Test end to end manually (e.g. build/push all images, restart local operator, and run cx refresh in an example folder)
  • Update documentation
  • Update examples and cx init
  • Alert team if dev environment changed
  • Cherry-pick into release branches if it's a bugfix
  • Delete the branch once it's merged

@vishalbollu
Copy link
Contributor Author

vishalbollu commented Apr 22, 2019

  • showcases a collaborative filtering approach to recommendation system using deep learning
  • uses movielens 100k dataset dataset as input
  • dataset is a zip file containing ratings.csv, movies.csv and other information
  • our pipeline will use ratings.csv to group similar users to each other and similar movies to eachother and predict the rating that a user may give to a movie
  • dataset contains the reviews of 600 users for 9000 movies with the schema: userid,movieid,rating,timestamp
  • the userid range is [1, 610] and movieid range from [1, 193609]
  • movieid range is large compared to the number of unique movies in dataset so shrink the range by creating a new index of only the movies that are present in the dataset
  • index userid and movieid columns by treating them as string and using index_string
  • embed indexed userid and movieid columns
  • train a regression model to minimize RMSE for rating predictions
  • predict the rating of a user with a userid that is mentioned in the input dataset to an unrated movie with movieid that is in the input dataset

@vishalbollu vishalbollu changed the title [WIP] Basic recommendation engine Basic recommendation engine Apr 22, 2019
- user_id_index
- movie_id_index
hparams:
embedding_size: 10
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you split embedding_size for each of the features?

@vishalbollu vishalbollu merged commit 15a2f7a into master Apr 26, 2019
@vishalbollu vishalbollu deleted the recommendations-example branch April 26, 2019 23:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add recommendation example
3 participants