Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

This is the official code implementation of the proposed methods introduced in the EMNLP 2024 paper:

The relation-extraction-based reading order prediction method on the ROOR dataset;
The reading-order-relation-enhanced methods on several VrD tasks.

To run these codes, please refer to the README file in the corresponding directories (rop/ and rore/). For the proposed ROOR dataset, please refer to ROOR-Datasets.

Citation

If the work is helpful to you, please kindly cite the paper as:

@article{zhang2024modeling,
  title={Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding},
  author={Zhang, Chong and Tu, Yi and Zhao, Yixi and Yuan, Chenshu and Chen, Huan and Zhang, Yue and Chai, Mingxu and Guo, Ya and Zhu, Huijia and Zhang, Qi and others},
  journal={arXiv preprint arXiv:2409.19672},
  year={2024}
}

Requirements

The experiments require at least 24GB GPU memory and CUDA 11.3.
Please make sure that torchtext is not installed, which could possibly cause unknown problems with pytorch-lightning installed.

conda create -n LayoutIE python=3.8
conda activate LayoutIE
pip install pip==24.0
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
pip install torchmetrics==0.11.1
pip install transformers==4.26.1
pip install pytorch-lightning==1.5.9
pip install nltk==3.8.1
pip install jieba==0.42.1
pip install seqeval==1.2.2
pip install ark_nlp==0.0.9
pip install opencv-python-headless==4.7.0.68
pip install timm==0.6.12
pip install sentencepiece==0.1.97
pip install six==1.16.0
pip install textdistance==4.6.2

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
make_weights		make_weights
rop		rop
rore		rore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

Citation

Requirements

About

Releases

Packages

Languages

License

chongzhangFDU/ROOR

Folders and files

Latest commit

History

Repository files navigation

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

Citation

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages