GNN-VPA: A Variance-Preserving Aggregation Strategy for Graph Neural Networks

This repository contains the code base for the paper available here.

Abstract

Graph neural networks (GNNs), and especially message-passing neural networks, excel in various domains such as physics, drug discovery, and molecular modeling. The expressivity of GNNs with respect to their ability to discriminate non-isomorphic graphs critically depends on the functions employed for message aggregation and graph-level readout. By applying signal propagation theory, we propose a variance-preserving aggregation function (VPA) that maintains expressivity, but yields improved forward and backward dynamics. Experiments demonstrate that VPA leads to increased predictive performance for popular GNN architectures as well as improved learning dynamics. Our results could pave the way towards normalizer-free or self-normalizing GNNs.

Figure 1: Comparison of commonly used message aggregation functions and variance-preserving aggregation.

Setup

Install using conda:

conda env create -f environment.yaml
conda activate gnn-vpa

Install using pip:

pip install -r requirements.txt

Data

The method was evaluated on the TUDataset collection, consisting of five social network datasets (IMDB-BINARY, IMDB-MULTI, COLLAB, REDDIT-BINARY and REDDIT-MULTI-5K) and four bioinformatics datasets (MUTAG, PROTEINS, PTC and NCI1) [1]. They are downloaded automatically upon the first call of the dataloader defined in ./src/datasets.py.

Usage

Train a single GNN:

python train.py model=gin agg=vpa dataset_name=IMDB-BINARY batch_run=false

To run the code for all 9 datasets and 10-fold cross-validation use the flag batch_run:

python train.py model=gin agg=vpa batch_run=true

To reproduce all results from the paper:

python train.py --multirun model=gin,gcn agg=vpa,sum,mean,max batch_run=true
python train.py --multirun model=sgc,gat agg=vpa,default batch_run=true

For a complete overview of available parameters see ./conf.

Sources

Our code is based on the pytorch-geometric framework [2] and the implementation of the GIN architecture is inspired by [3].

[1] Morris, C., Kriege, N. M., Bause, F., Kersting, K., Mutzel, P., and Neumann, M. TUDataset: A collection of benchmark datasets for learning with graphs. In ICML 2020 Workshop on Graph Representation Learning and Beyond (GRL+ 2020), 2020.

[2] https://pytorch-geometric.readthedocs.io/en/latest/

[3] https://github.com/weihua916/powerful-gnns/tree/master

Citation

If you find this work helpful, please cite

@article{schneckenreiter_gnn-vpa_2024,
   author = {Schneckenreiter, Lisa and Freinschlag, Richard and Sestak, Florian and Brandstetter, Johannes and Klambauer, G{\"u}nter and Mayr, Andreas},
   title = {{GNN-VPA}: A Variance-Preserving Aggregation Strategy for Graph Neural Networks},
   journal={arXiv preprint arXiv:2403.04747},
   year = {2024},
   institution = {ELLIS Unit and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University, Linz},
   doi = {arXiv:2403.04747}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
conf		conf
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
environment.yaml		environment.yaml
gnn-vpa.png		gnn-vpa.png
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GNN-VPA: A Variance-Preserving Aggregation Strategy for Graph Neural Networks

Abstract

Setup

Data

Usage

Sources

Citation

About

Releases

Packages

Languages

License

ml-jku/GNN-VPA

Folders and files

Latest commit

History

Repository files navigation

GNN-VPA: A Variance-Preserving Aggregation Strategy for Graph Neural Networks

Abstract

Setup

Data

Usage

Sources

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages