-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add the unsupervised bipartite GraphSAGE model on the Taobao dataset #6144
Conversation
for more information, see https://pre-commit.ci
Codecov Report
@@ Coverage Diff @@
## master #6144 +/- ##
=======================================
Coverage 85.09% 85.09%
=======================================
Files 402 402
Lines 21672 21672
=======================================
Hits 18442 18442
Misses 3230 3230
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
#6431 created a `pyproject.toml` which aimed to create a single configuration file for `yapf`, `isort`, `coverage`, `pyright` among others. All the configuration for `yapf` now lies within the `pyproject.toml` file. This PR aims to delete the redundant `.style.yapf` possibly added back after (#6144). Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>
This PR adds an implementation of unsupervised bipartite GraphSAGE on the Taobao User Behaviors dataset offered by Alibaba. The Taobao dataset contains a heterogeneous graph, where nodes represent users and items, and edges represent different types of behaviors between users and items.
We use the i2i co-occurrence matrix to construct the i2i-graph. When applying GraphSAGE, the model follows the i-i-i pattern to encode the item embedding and the i-i-u pattern to encode the user embedding. As the task is unsupervised, we use negative sampling and
binary_cross_entropy_with_logits
to compute the loss in the model.