GitHub - Raphaaal/fieldy: Fine-grained attention in hierarchical transformers for tabular time-series.

This repository hosts the code for the paper Fine-grained Attention in Hierarchical Transformers for Tabular Time-series by R. Azorin, Z. Ben Houidi, M. Gallo, A. Finamore, and P. Michiardi, accepted at KDD'24 10th MiLeTS Workshop.

Fieldy is a fine-grained hierarchical Transformer that contextualizes fields at both the row and column levels. We compare our proposal against state of the art models on regression and classification tasks using public tabular time-series datasets. Our results show that combining row-wise and column-wise attention improves performance without increasing model size.

Requirements

Create an environment with conda create --name fieldy python==3.8.16
Activate it with conda activate fieldy
Install the requirements with pip install -r requirements.txt

Note: you need a CUDA device with at least 16 GB of VRAM to train all the models. If you want to use CPU or multi-GPU training, you may consider integrating HuggingFace Accelerate into the existing code.

Datasets loading

Choose an option below to load the preprocessed datasets:

With DropBox:

Use this download link
Move the downloaded files to:
- ./data/kdd/KDDDataset_pt.pkl
- ./data/kdd/KDDDataset_ft.pkl
- ./data/prsa/PRSADataset_labeled.pkl

With the raw CSVs:

To preprocess the KDD dataset, use ./dataset/kdd.ipynb.
To preprocess the PRSA dataset, you have nothing else to do, it will be automatically triggered.

Models training

To train and evaluate models on the KDD Loan default prediction dataset:

Run chmod +x kdd.sh
Run ./kdd.sh

To train and evaluate models on the PRSA Beijing pollution dataset:

Run chmod +x prsa.sh
Run ./prsa.sh

Results will be saved under ./results.

Plot results

Use ./plots/results2latex.ipynb.

Toy task for field-wise attention

Use ./plots/field_wise_attention.ipynb.

Citation

If you use this paper or code as a reference, please cite it with:

@misc{azorin2024finegrained,
      title={Fine-grained Attention in Hierarchical Transformers for Tabular Time-series}, 
      author={Raphael Azorin and Zied Ben Houidi and Massimo Gallo and Alessandro Finamore and Pietro Michiardi},
      year={2024},
      eprint={2406.15327},
      archivePrefix={arXiv},
}

Acknowledgements

This repository is built on top of TabBERT. We would also like to thanks the authors of UniTTab, for discussions on metrics and pre-processing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

Datasets loading

Models training

Plot results

Toy task for field-wise attention

Citation

Acknowledgements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
dataset		dataset
misc		misc
models		models
plots		plots
README.md		README.md
args.py		args.py
intro_fig.png		intro_fig.png
kdd.sh		kdd.sh
main.py		main.py
prsa.sh		prsa.sh
requirements.txt		requirements.txt

Raphaaal/fieldy

Folders and files

Latest commit

History

Repository files navigation

Requirements

Datasets loading

Models training

Plot results

Toy task for field-wise attention

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages