This repository provides a curated list of resources in the research domain of Document Layout Analysis (DLA), updated in my free time.
Methods based on heuristic rules are not included.
"Document Layout Analysis" OR "Document Layout Detection" OR "Page Object Detection" OR "Document Structure Analysis" OR "Document Structure Extraction" OR "Document Reading Order" OR "Document Hierarchy"
- Survey / Competition
- Methodology
- 2.1 Page Object Detection
- 2.2 Document Structure Analysis
- Data
- 3.1 Dataset
- 3.2 Data Synthesis / Data Generation / Data Augmentation
- Datasets and annotations for layout analysis of scientific articles, (UNIFI), 2024-03, IJDAR-2024
- Advancements in Financial Document Structure Extraction: Insights from Five Years of FinTOC (2019-2023), (Outscale, NITK, Lancaster), 2023-12, BigData-2023
- Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis, (UVA, Elsevier), 2023-08, arXiv
- ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents, (IBM), 2023-05, ICDAR-2023
- A Methodological Study of Document Layout Analysis, (XJU), 2022-10, VRHCIAI-2022
- A Survey of Graphical Page Object Detection with Deep Neural Networks, (TU Kaiserslautern, DFKI), 2021-06, Applied-Sciences.2021
- ICDAR 2021 Competition on Scientific Literature Parsing, (IBM, UniMelb, Oracle), 2021-06, ICDAR-2021
- Document Layout Analysis: A Comprehensive Survey, (KFUPM), 2019-10, ACM-Computing-Surveys.2019
- ICDAR2019 Competition on Recognition of Documents with Complex Layouts - RDCL2019, (Salford), 2019-09, ICDAR-2019
- ICDAR2017 Competition on Page Object Detection, (PKU), 2017-11, ICDAR-2017
- ICDAR2017 Competition on Recognition of Documents with Complex Layouts - RDCL2017, (Salford), 2017-11, ICDAR-2017
DocSAM
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning, (CAS, UCAS), 2025-04, CVPR-2025DLAdapter
SFDLA: Source-Free Document Layout Analysis, (KIT, ETH Zurich), 2025-03, arXivPP-DocLayout
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction, (Baidu), 2025-03, arXivUniHDSA
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis, (USTC, Microsoft), 2025-03, Pattern-Recognition.2025EDocNet
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation, (SEU), 2025-02, arXivDoPTA
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment, (Adobe), 2024-12, arXivDocEDA
DocEDA: Automated Extraction and Design of Analog Circuits from Documents with Large Language Model, (SEU), 2024-12, arXivDocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception, (Shanghai AI Lab), 2024-10, arXivUnSupDLA
UnSupDLA: Towards Unsupervised Document Layout Analysis, (TU Kaiserslautern, DFKI), 2024-06, ICDAR-2024-WorkshopDLAFormer
DLAFormer: An End-to-End Transformer For Document Layout Analysis, (USTC, Microsoft), 2024-05, ICDAR-2024CREPE
CREPE: Coordinate-Aware End-to-End Document Parser, (Naver, LINE WORKS), 2024-05, ICDAR-2024- A Hybrid Approach for Document Layout Analysis in Document images, (TU Kaiserslautern, DFKI), 2024-04, ICDAR-2024
M2Doc
M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis, (SCUT, Alibaba), 2024-03, AAAI-2024RoDLA
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models, (KIT, Oxford), 2024-03, CVPR-2024Detect-Order-Construct
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis, (USTC, Microsoft), 2024-01, Pattern-Recognition.2024- The YOLO model that still excels in document layout analysis, (XJU), 2023-11, Signal-Image-and-Video-Processing.2024
DOLNet
Dataset agnostic document object detection, (IIIT), 2023-10, Pattern-Recognition.2023DSG
DSG: An End-to-End Document Structure Generator, (ETH Zurich, LMU Munich), 2023-10, ICDM-2023VGT
Vision Grid Transformer for Document Layout Analysis, (Alibaba), 2023-08, ICCV-2023- A Hybrid Approach to Document Layout Analysis for Heterogeneous Document Images, (Microsoft, USTC, PKU), 2023-08, ICDAR-2023
GLAM
A Graphical Approach to Document Layout Analysis, (Kensho, Meta, Google), 2023-08, ICDAR-2023HiM
HiM: hierarchical multimodal network for document layout analysis, (QUST, ASU), 2023-07, Applied-Intelligence.2023DRFN
DRFN: A unified framework for complex document layout analysis, (ECNU, Fudan, Videt), 2023-05, Information-Processing-&-Management.2023TransDLANet
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis, (SCUT, Huawei, IntSig), 2023-05, CVPR-2023WeLayout
WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents, (Tencent), 2023-05, arXivSwinDocSegmenter
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation, (UAB, ISI), 2023-05, ICDAR-2023SelfDocSeg
SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation, (ISI, UAB, IITK), 2023-05, ICDAR-2023StrucTexTv2
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training, (Baidu), 2023-03, ICLR-2023HSCA-Net
HSCA-Net: A Hybrid Spatial-Channel Attention Network in Multiscale Feature Pyramid for Document Layout Analysis, (QUST, PITT), 2022-12, Journal-of-Artificial-Intelligence-and-Technology.2023- Rethinking Learnable Proposals for Graphical Object Detection in Scanned Document Images, (TU Kaiserslautern, DFKI, LUT), 2022-10, Applied-Sciences.2020
TRDLU
Transformer-Based Approach for Document Layout Understanding, (KSU), 2022-10, ICIP-2022PP-StructureV2
PP-StructureV2: A Stronger Document Analysis System, (Baidu), 2022-10, arXiv- Lateral Feature Enhancement Network for Page Object Detection, (QUST), 2022-08, TIM.2022
Doc-GCN
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis, (Sydney, UWA), 2022-08, COLING-2022- Investigating Attention Mechanism for Page Object Detection in Document Images, (TU Kaiserslautern, DFKI, LTU), 2022-07, Applied-Sciences.2022
DSAP
Exploiting Spatial Attention and Contextual Information for Document Image Segmentation, (XMU, Northumbria, SZU), 2022-05, PAKDD-2022LayoutLMv3
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking, (SYSU, Microsoft), 2022-04, ACM-MM-2022SRRV
SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision, (QUST), 2022-04, IEEE-Transactions-on-Multimedia.2022DiT
DiT: Self-supervised Pre-training for Document Image Transformer, (SJTU, Microsoft), 2022-03, ACM-MM-2022DocBed
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts, (Amazon), 2022-02, IAAI-2022DocSegTr
DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer, (UAB, ISI), 2022-01, arXivUDoc
Unified Pretraining Framework for Document Understanding, (Adobe), 2021-12, NeurIPS-2021L-E3Net
Document Layout Analysis with Aesthetic-Guided Image Augmentation, (Fudan, ECNU, Videt), 2021-11, arXiv- Deep Learning in Time-Frequency Domain for Document Layout Analysis, (UDLA, DETRI, UNICAMP, Analog Devices, ESPE), 2021-11, IEEE-Access.2021
E3Net
Document image layout analysis via explicit edge embedding network, (ECNU, Videt), 2021-10, Information-Sciences.2021- A Page Object Detection Method Based on Mask R-CNN, (QUST, ASU, HIT), 2021-10, IEEE-Access.2021
VTLayout
VTLayout: Fusion of Visual and Text Features for Document Layout Analysis, (UCAS, CAS), 2021-08, PRICAI-2021FS-PARN
Few-shot prototype alignment regularization network for document image layout segementation, (UESTC, YZU, Kyudai, Afanti, QDU), 2021-07, Pattern-Recognition.2021- End-to-end dilated convolution network for document image semantic segmentation, (QUST, ASU), 2021-07, Journal-of-Central-South-University.2021
- Beyond document object detection: instance-level segmentation of complex layouts, (UAB, ISI), 2021-07, IJDAR.2021
SelfDoc
SelfDoc: Self-Supervised Document Representation Learning, (Brandeis, Adobe), 2021-06, CVPR-2021VSR
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations, (Hikvision, ZJU), 2021-05, ICDAR-2021- Document Layout Analysis with an Enhanced Object Detector, (TU Kaiserslautern, DFKI), 2021-04, IPRIA-2021
LAMPRET
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding, (UCLA, Google), 2021-04, arXivDRFN
Document Layout Analysis via Dynamic Residual Feature Fusion, (ECNU, Videt), 2021-04, ICME-2021- Probabilistic homogeneity for document image segmentation, (VUB), 2021-01, Pattern-Recognition.2021
BINYAS
BINYAS: a complex document layout analysis system, (GKCIET, Jadavpur), 2020-11, Multimedia-Tools-and-Applications.2021- Vision-Based Layout Detection from Scientific Literature using Recurrent Convolutional Neural Networks, (KSU), 2020-10, ICPR-2020
- Page Segmentation Using Convolutional Neural Network and Graphical Model, (CAS, UCAS), 2020-08, DAS-2020
- Document Structure Extraction using Prior based High Resolution Hierarchical Semantic Segmentation, (Adobe), 2019-11, ECCV-2020
DocParser
DocParser: Hierarchical Structure Parsing of Document Renderings, (ETH Zurich), 2019-11, AAAI-2021- Instance Aware Document Image Segmentation using Label Pyramid Networks and Deep Watershed Transformation, (CAS, UCAS, Tencent, univ-lr), 2019-09, ICDAR-2019
- Page Segmentation using a Convolutional Neural Network with Trainable Co-Occurrence Features, (Kyudai, SIT), 2019-09, ICDAR-2019
- Page Object Detection from PDF Document Images by Deep Structured Prediction and Supervised Clustering, (CAS, UCAS), 2018-08, ICPR-2018
DeepLayout
DeepLayout: A Semantic Segmentation Approach to Page Layout Analysis, (PKU), 2018-07, ICIC-2018dhSegment
dhSegment: A generic deep-learning approach for document segmentation, (EPFL), 2018-04, ICFHR-2018- Ensemble of Deep Object Detectors for Page Object Detection, (VNU-HCM), 2018-01, IMCOM-2018
- Multi-Scale Multi-Task FCN for Semantic Page Segmentation and Table Detection, (PSU, Adobe), 2017-11, ICDAR-2017
- CNN Based Page Object Detection in Document Images, (PKU), 2017-11, ICDAR-2017
- Fast CNN-Based Document Layout Analysis, (IBM), 2017-10, ICCV-Workshop-2017
MFCN
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network, (PSU, Adobe), 2017-06, CVPR-2017
SCAN
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation, (NEC), 2025-05, arXivXY-Cut++
XY-Cut++: Advanced Layout Ordering via Hierarchical Mask Mechanism on a Novel Benchmark, (TJU), 2025-04, arXivUniHDSA
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis, (USTC, Microsoft), 2025-03, Pattern-Recognition.2025DRGG
Graph-based Document Structure Analysis, (KIT), 2025-02, ICLR-2025DLAFormer
DLAFormer: An End-to-End Transformer For Document Layout Analysis, (USTC, Microsoft), 2024-05, ICDAR-2024- Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction, (Allen AI, HUJI, BIU), 2024-02, ACL-2024
Detect-Order-Construct
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis, (USTC, Microsoft), 2024-01, Pattern-Recognition.2024DSG
DSG: An End-to-End Document Structure Generator, (ETH Zurich, LMU Munich), 2023-10, ICDM-2023- A Hybrid Approach to Document Layout Analysis for Heterogeneous Document Images, (Microsoft, USTC, PKU), 2023-08, ICDAR-2023
- Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation, (Google), 2023-05, ICDAR-2023
DSPS
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures, (USTC, iFLYTEK), 2023-02, AAAI-2023LayerDoc
LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents, (UMD, Adobe), 2023-01, WACV-2023MTD
Multimodal Tree Decoder for Table of Contents Extraction in Document Images, (USTC, iFLYTEK), 2022-08, ICPR-2022XYLayoutLM
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding, (SJTU, Ant Group), 2022-03, CVPR-2022- Reading order detection on handwritten documents, (UPV), 2022-02, Neural-Computing-and-Applications.2022
LayoutReader
LayoutReader: Pre-training of Text and Layout for Reading Order Detection, (UC San Diego, Microsoft), 2021-08, EMNLP-2021HELD
Extracting Variable-Depth Logical Document Hierarchy from Long Documents: Method, Evaluation, and Application, (CAS, UCAS, Tencent, Peng Cheng Lab), 2021-05, Journal-of-Computer-Science-and-Technology.2022LAMPRET
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding, (UCLA, Google), 2021-04, arXiv- An End-to-End OCR Text Re-organization Sequence Learning for Rich-Text Detail Image Comprehension, (ZJU, Alibaba), 2020-11, ECCV-2020
DocStruct
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding, (Sensetime), 2020-10, EMNLP-2020DocParser
DocParser: Hierarchical Structure Parsing of Document Renderings, (ETH Zurich), 2019-11, AAAI-2021- Table-Of-Contents generation on contemporary documents, (Fortia), 2019-09, ICDAR-2019
DocBench-100
XY-Cut++: Advanced Layout Ordering via Hierarchical Mask Mechanism on a Novel Benchmark, (TJU), 2025-04, arXivAnnoPage
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization, (VUTBR, MZK, NKP, CAS), 2025-03, arXivSFDLA
SFDLA: Source-Free Document Layout Analysis, (KIT, ETH Zurich), 2025-03, arXivGraphDoc
Graph-based Document Structure Analysis, (KIT), 2025-02, ICLR-2025OmniDocBench
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations, (Shanghai AI Lab, Abaka AI, 2077AI), 2024-12, CVPR-2025LADaS 2.0
Diachronic Document Dataset for Semantic Layout Analysis, (Inria, PSL, IHEID, UNIGE), 2024-11, arXivREADoc
READoc: A Unified Benchmark for Realistic Document Structured Extraction, (CAS, UCAS), 2024-09, arXivSciPostLayout
SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters, (OMRON SINIC X, Waseda), 2024-07, BMVC-2024Comp-HRDoc
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis, (USTC, Microsoft), 2024-01, Pattern-Recognition.2024ADOPD
ADOPD: A Large-Scale Document Page Decomposition Dataset, (Adobe, OSU, UC Merced, JHU), 2024-01, ICLR-2024E-Periodica
DSG: An End-to-End Document Structure Generator, (ETH Zurich, LMU Munich), 2023-10, ICDM-2023D4LA
Vision Grid Transformer for Document Layout Analysis, (Alibaba), 2023-08, ICCV-2023CDSSE
DRFN: A unified framework for complex document layout analysis, (ECNU, Fudan, Videt), 2023-05, Information-Processing-&-Management.2023M6Doc
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis, (SCUT, Huawei, IntSig), 2023-05, CVPR-2023ETD-ODv2
A New Annotation Method and Dataset for Layout Analysis of Long Documents, (Virginia Tech), 2023-04, WWW-2023BaDLAD
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset, (Bengali.AI, SUST, BRACU, Vanderbilt, Rice, RPI), 2023-03, ICDAR-2023HRDoc
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures, (USTC, iFLYTEK), 2023-02, AAAI-2023TexBiG
A Dataset for Analysing Complex Document Layouts in the Digital Humanities and Its Evaluation with Krippendorff’s Alpha, (uni-weimar), 2022-09, DAGM-GCPR-2022HierDoc
Multimodal Tree Decoder for Table of Contents Extraction in Document Images, (USTC, iFLYTEK), 2022-08, ICPR-2022DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis, (IBM), 2022-06, SIGKDD-2022NewsNet7
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts, (Amazon), 2022-02, IAAI-2022DAD
Segmentation for document layout analysis: not dead yet, (USask, Living Sky), 2022-01, IJDAR.2022FPD
Document Layout Analysis with Aesthetic-Guided Image Augmentation, (Fudan, ECNU, Videt), 2021-11, arXivSciBank
Deep Learning in Time-Frequency Domain for Document Layout Analysis, (UDLA, DETRI, UNICAMP, Analog Devices, ESPE), 2021-11, IEEE-Access.2021ReadingBank
LayoutReader: Pre-training of Text and Layout for Reading Order Detection, (UC San Diego, Microsoft), 2021-08, EMNLP-2021DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis, (BUAA, Microsoft), 2020-06, COLING-2020arXivdocs
DocParser: Hierarchical Structure Parsing of Document Renderings, (ETH Zurich), 2019-11, AAAI-2021article-regions
Visual Detection with Context for Document Layout Analysis, (BNL), 2019-11, EMNLP-IJCNLP-2019PubLayNet
PubLayNet: largest dataset ever for document layout analysis, (IBM), 2019-08, ICDAR-2019ICDAR2017_POD
ICDAR2017 Competition on Page Object Detection, (PKU), 2017-11, ICDAR-2017DSSE-200
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network, (PSU, Adobe), 2017-06, CVPR-2017
DocSynth-300K
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception, (Shanghai AI Lab), 2024-10, arXiv- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models, (KIT, Oxford), 2024-03, CVPR-2024
LACE
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints, (Buffalo, Adobe, MBZUAI), 2024-02, ICLR-2024RanLayNet
RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization, (IIIT, VIT, JNU, NII), 2024-01, MMAsia-2023- WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents, (Tencent), 2023-05, arXiv
- A New Annotation Method and Dataset for Layout Analysis of Long Documents, (Virginia Tech), 2023-04, WWW-2023
- Automatic generation of scientific papers for data augmentation in document layout analysis, (UNIFI), 2023-03, Pattern-Recognition-Letters.2023
LayoutDiffusion
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models, (SJTU, Microsoft), 2023-03, ICCV-2023- Diffusion-based Document Layout Generation, (Purdue, Microsoft), 2023-03, ICDAR-2023
LayoutDM
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation, (CyberAgent, Waseda), 2023-03, CVPR-2023LDGM
Unifying Layout Generation with a Decoupled Diffusion Model, (XJTU, Microsoft, Tsinghua), 2023-03, CVPR-2023PLay
PLay: Parametrically Conditioned Layout Generation using Latent Diffusion, (Google), 2023-01, ICML-2023LayoutFormer++
LayoutFormer++: Conditional Graphic Layout Generation via Constraint Serialization and Decoding Space Restriction, (XJTU, Microsoft, SJTU, BUAA), 2022-08, CVPR-2023- Coarse-to-Fine Generative Modeling for Graphic Layouts, (XJTU, Microsoft), 2022-06, AAAI-2022
DL-DSG
Cross-Domain Document Layout Analysis Using Document Style Guide, (Fudan, ECNU, Videt), 2022-01, Expert-Systems-with-Applications.2024BLT
BLT: Bidirectional Layout Transformer for Controllable Layout Generation, (CMU, Google), 2021-12, ECCV-2022- Synthetic Document Generator for Annotation-free Layout Recognition, (JPMorgan), 2021-11, Pattern-Recognition.2022
- Document image layout analysis via explicit edge embedding network, (ECNU, Videt), 2021-10, Information-Sciences.2021
- A Page Object Detection Method Based on Mask R-CNN, (QUST, ASU, HIT), 2021-10, IEEE-Access.2021
CanvasEmb
CanvasEmb: Learning Layout Representation with Large-scale Pre-training for Graphic Design, (NUS, Microsoft, Meituan), 2021-10, ACM-MM-2021LayoutMCL
Diverse Multimedia Layout Generation with Multi Choice Learning, (UNSW, CSIRO), 2021-10, ACM-MM-2021- Constrained Graphic Layout Generation via Latent Optimization, (Waseda, CyberAgent), 2021-08, ACM-MM-2021
CanvasVAE
CanvasVAE: Learning to Generate Vector Graphic Documents, (CyberAgent), 2021-08, ICCV-2021DocSynth
DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis, (UAB, ISI), 2021-07, ICDAR-2021LayoutGAN
LayoutGAN: Synthesizing Graphic Layouts With Vector-Wireframe Adversarial Networks, (BIT, Adobe), 2021-07, IEEE-Transactions-on-Pattern-Analysis-and-Machine-Intelligence.2021DDR
Document Domain Randomization for Deep Learning Document Layout Extraction, (OSU, Uni Vie, Inria, Uni Stuttgart, Nottingham, ODU, PSU), 2021-05, ICDAR-2021RUITE
RUITE: Refining UI Layout Aesthetics Using Transformer Encoder, (RWTH), 2021-04, IUI-2021VTN
Variational Transformer Networks for Layout Generation, (Google, ETH Zurich, TUM), 2021-04, CVPR-2021LayoutTransformer
LayoutTransformer: Layout Generation and Completion with Self-attention, (UMD, UC San Diego, Amazon), 2020-06, ICCV-2021- Cross-Domain Document Object Detection: Benchmark Suite and Method, (NEU, Adobe), 2020-03, CVPR-2020
NDN
Neural Design Network: Graphic Layout Generation with Constraints, (Google, UC Merced, Yonsei, Georgia Tech), 2019-12, ECCV-2020READ
READ: Recursive Autoencoders for Document Layout Generation, (SFU, TAU, Amazon, Cornell), 2019-09, CVPR-2020-Workshop- Content-aware generative modeling of graphic design layouts, (CityUHK), 2019-07, ACM-Transactions-on-Graphics.2019
LayoutGAN
LayoutGAN: Generating Graphic Layouts with Wireframe Discriminators, (BIT, Adobe), 2019-01, ICLR-2019- Multi-Scale Multi-Task FCN for Semantic Page Segmentation and Table Detection, (PSU, Adobe), 2017-11, ICDAR-2017
- Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network, (PSU, Adobe), 2017-06, CVPR-2017