-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
-
Confidence(unfortunately, this conflates Coords and Text@conf) -
TextType(HANDWRITING→@production=handwritten-printscript|handwritten-cursive,PRINTED→@production=printed) - support tables:
- top-level
TableRegionforTABLEblock - recursive
TextRegionforCELLblock (i.e.ColumnIndex→Roles/TableCellRole/@columnIndex,RowIndex→Roles/TableCellRole/@rowIndex) - recursive
TextRegionforMERGED_CELLblock (i.e.ColumnSpan→Roles/TableCellRole/@colSpan,RowSpan→Roles/TableCellRole/@rowSpan) – diverging recursion between Textract and PAGE? - recursive
TextRegionforTABLE_TITLEandTABLE_FOOTERblock (i.e.Roles/TableCellRole/@header... or via ReadingOrder) -
EntityTypes–STRUCTURED_TABLE|SEMI_STRUCTURED_TABLE(unclear how to represent in PAGE),TABLE_TITLE|TABLE_SECTION_TITLE|TABLE_FOOTER|TABLE_SUMMARY|COLUMN_HEADER(unclear how this looks and compares with the actual recursiveBlockType)? - also via ordered groups in ReadingOrder?
- unclear:
LineItemGroupandLineItems
- top-level
-
PageClassification/PageType(unclear, but probablyPage/@type) - support forms
-
BlockType=KEY_VALUE_SETandEntityTypes=KEY|VALUE→ unclear how to represent: TableRegion or recursive TextRegion? Labels/Label? - register
KEY_VALUE_SET - represent in page
-
- support checkboxes within tables or forms
-
BlockType=SELECTION_ELEMENTandSelectionStatus=SELECTED|NOT_SELECTED→ unclear how to represent - register
SELECTION_ELEMENT - represent in page
-
- ignore query type
Metadata
Metadata
Assignees
Labels
No labels