Add `dropout_path` as a structured form of `dropout_edge` #5531

EdisonLeeeee · 2022-09-25T15:01:32Z

This PR implements DropPath from MaskGAE: Masked Graph Modeling Meets Graph Autoencoders. DropPath is a structured form of DropEdge, which drops a group of edges (paths) based on random walks.

DropPath follows three steps to sample edges to drop:

Sample a set of root nodes $R$ with probability r from a Bernoulli distribution. (0<=r<=1),
Perform random walks starting from root nodes $R$
Drop edges sampled by random walks

Link to #5452

codecov · 2022-09-25T15:05:40Z

Codecov Report

Merging #5531 (c87956f) into master (7fb1eb6) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #5531      +/-   ##
==========================================
+ Coverage   83.69%   83.72%   +0.02%     
==========================================
  Files         346      346              
  Lines       19049    19080      +31     
==========================================
+ Hits        15943    15974      +31     
  Misses       3106     3106

Impacted Files	Coverage Δ
torch_geometric/utils/dropout.py	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

torch_geometric/utils/dropout.py

EdisonLeeeee · 2022-09-26T08:36:20Z

A quick question (maybe stupid): how to disable the GitHub bot that always removes labels when I push the code? That's a bit annoying :-(

wsad1

Thanks for adding this.

test/utils/test_dropout.py

torch_geometric/utils/dropout.py

rusty1s · 2022-09-28T12:46:15Z

Let's also add it to the CHANGELOG.md. Feel free to merge afterwards.

EdisonLeeeee · 2022-09-28T16:03:16Z

Since there are some big changes made, I'm afraid it is still necessary to check it again :(

wsad1 · 2022-09-29T07:44:19Z

torch_geometric/utils/dropout.py

+
+    row, col = edge_index
+    sample_mask = torch.rand(row.size(0), device=edge_index.device) <= p
+    start = row[sample_mask].repeat(walks_per_node)


Doesn't this mean we'll start many more than walks_per_node random walks for nodes with a high out-degree.
Sorry for the back and forth, but I felt the way you defined start perviously was better and actually meant we start exactly walks_per_node random walks`.

Agreed. Or maybe we can add an option such as walk_by='edge' or walk_by='node' to align them?

Maybe I am missing something. Why do we want to support walk_by='edge'?

Oh sorry I meant we can sample starting nodes node-wise (previous version) or edge-wise (current version). For edge-wise sampling, as you mentioned, we can start random walks for nodes with a high out-degree, this seems to make sense for graphs with particular imbalanced structures.

I think this is the last one that needs to be resolved.

No strong opinion. I think we can safely merge this and wait on user feedback. To me, it makes sense to sample per edge since we are dropping paths.

Maybe just add a comment for now and clarify in the doc?

Alternatively, we can let p accept Tensor to specify the nodes to start random walks, rather than randomly sampled from nodes and edges. Let's leave it as future work and merge it now.

Apologies, i forgot to respond. its good that you merged it :).

No worries :)

rusty1s

Looks perfect :)

torch_geometric/utils/dropout.py

Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

torch_geometric/utils/dropout.py

) * add dropout_path * test * doc * doc and annotation * add is_sorted argument * doc * drop force_undirected argument * permute edge ids if edges were sorted * update test * update test * update README * drop p and q; rename r to p * changelog * sample on row * Update torch_geometric/utils/dropout.py Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de> * Update torch_geometric/utils/dropout.py Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

EdisonLeeeee added 2 commits September 25, 2022 22:55

add dropout_path

d1b4246

test

58b39ff

EdisonLeeeee added feature 1 - Priority P1 transform labels Sep 25, 2022

doc

ff1957b

github-actions bot removed the transform label Sep 25, 2022

doc and annotation

329d54b

EdisonLeeeee self-assigned this Sep 25, 2022

rusty1s added the transform label Sep 25, 2022

rusty1s reviewed Sep 26, 2022

View reviewed changes

torch_geometric/utils/dropout.py Outdated Show resolved Hide resolved

torch_geometric/utils/dropout.py Outdated Show resolved Hide resolved

torch_geometric/utils/dropout.py Show resolved Hide resolved

add is_sorted argument

3d65553

github-actions bot removed the transform label Sep 26, 2022

EdisonLeeeee added 4 commits September 26, 2022 16:41

doc

c8d9001

drop force_undirected argument

0d70da4

permute edge ids if edges were sorted

c8cd4b7

update test

496752f

EdisonLeeeee requested a review from rusty1s September 26, 2022 09:20

wsad1 approved these changes Sep 27, 2022

View reviewed changes

wsad1 reviewed Sep 27, 2022

View reviewed changes

test/utils/test_dropout.py Show resolved Hide resolved

EdisonLeeeee added 3 commits September 27, 2022 22:55

update test

6eb9dc2

Merge branch 'master' into dropout_path

2684455

update README

c7b891e

rusty1s approved these changes Sep 28, 2022

View reviewed changes

torch_geometric/utils/dropout.py Outdated Show resolved Hide resolved

torch_geometric/utils/dropout.py Outdated Show resolved Hide resolved

EdisonLeeeee added 2 commits September 28, 2022 23:53

drop p and q; rename r to p

beca824

changelog

3733347

sample on row

08cde11

wsad1 reviewed Sep 29, 2022

View reviewed changes

rusty1s approved these changes Sep 29, 2022

View reviewed changes

torch_geometric/utils/dropout.py Show resolved Hide resolved

EdisonLeeeee and others added 2 commits September 30, 2022 00:12

Update torch_geometric/utils/dropout.py

3bd530d

Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

Merge branch 'master' into dropout_path

de2ef8f

rusty1s reviewed Sep 30, 2022

View reviewed changes

torch_geometric/utils/dropout.py Outdated Show resolved Hide resolved

Update torch_geometric/utils/dropout.py

c87956f

EdisonLeeeee merged commit d8800b6 into master Sep 30, 2022

EdisonLeeeee deleted the dropout_path branch September 30, 2022 07:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `dropout_path` as a structured form of `dropout_edge` #5531

Add `dropout_path` as a structured form of `dropout_edge` #5531

EdisonLeeeee commented Sep 25, 2022 •

edited

Loading

codecov bot commented Sep 25, 2022 •

edited

Loading

EdisonLeeeee commented Sep 26, 2022

wsad1 left a comment

rusty1s commented Sep 28, 2022

EdisonLeeeee commented Sep 28, 2022

wsad1 Sep 29, 2022

EdisonLeeeee Sep 29, 2022

wsad1 Sep 29, 2022

EdisonLeeeee Sep 29, 2022

EdisonLeeeee Sep 30, 2022

rusty1s Sep 30, 2022

rusty1s Sep 30, 2022 •

edited

Loading

EdisonLeeeee Sep 30, 2022

wsad1 Sep 30, 2022

EdisonLeeeee Sep 30, 2022

rusty1s left a comment

Add dropout_path as a structured form of dropout_edge #5531

Add dropout_path as a structured form of dropout_edge #5531

Conversation

EdisonLeeeee commented Sep 25, 2022 • edited Loading

codecov bot commented Sep 25, 2022 • edited Loading

Codecov Report

EdisonLeeeee commented Sep 26, 2022

wsad1 left a comment

Choose a reason for hiding this comment

rusty1s commented Sep 28, 2022

EdisonLeeeee commented Sep 28, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rusty1s Sep 30, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rusty1s left a comment

Choose a reason for hiding this comment

Add `dropout_path` as a structured form of `dropout_edge` #5531

Add `dropout_path` as a structured form of `dropout_edge` #5531

EdisonLeeeee commented Sep 25, 2022 •

edited

Loading

codecov bot commented Sep 25, 2022 •

edited

Loading

rusty1s Sep 30, 2022 •

edited

Loading