Allow ref_to_token_annotations to handle more general cases #7

ivyleavedtoadflax · 2020-02-19T22:28:07Z

Previously ref_to_token_annotations would only work in the splitting scenario of converting reference spans (BI, BE, IE, II) spans to token spans (b-r, i-r, e-r, o). This PR allows it also to be used for parsing spans where a reference span (author) can be converted to a series of token spans (author).

Relevant tests are added, along with improved documentation to the command itself.

Previously this command would only work in the parsing scenario of converting reference spans (BI, BE, IE, II) spans to token spans (b-r, i-r, e-r, o). This commit allows it also to be used for parsing spans where a reference span (author) can be converted to a series of token spans (author).

* Improve documentation. * Output documents which have no annotations, instead of removing them.

deep_reference_parser/prodigy/reference_to_token_annotations.py

nsorros · 2020-02-24T10:36:35Z

deep_reference_parser/prodigy/reference_to_token_annotations.py

+        labels can be applied directly to the individual tokens contained within
+        these multi-token spans; for each token in the multi-token span, a span
+        is created with the same label. Symbolically:
+            * [author author author] becomes [author][author][author]


this is not unique to author right?

exactly 👍

nsorros

LGTM 👍

ivyleavedtoadflax force-pushed the feature/ivyleavedtoadflax/parsing_ref_spans_to_token_spans branch from 66958a0 to 115ada2 Compare February 20, 2020 13:13

chg: Update reference_to_token_annotations command

70f9e59

* Improve documentation. * Output documents which have no annotations, instead of removing them.

ivyleavedtoadflax changed the title ~~wip: Handle parsing case in Allow reference_to_token_annotations.py~~ Allow reference_to_token_annotations to handle more general cases Feb 20, 2020

ivyleavedtoadflax changed the title ~~Allow reference_to_token_annotations to handle more general cases~~ Allow ref_to_token_annotations to handle more general cases Feb 20, 2020

ivyleavedtoadflax requested review from aCampello, lizgzil and nsorros February 20, 2020 14:46

ivyleavedtoadflax marked this pull request as ready for review February 20, 2020 14:47

ivyleavedtoadflax added 3 commits February 20, 2020 13:36

chg: Convert UPPER case ref labels to lowercase token labels

9052fcc

chg: 💄 linting

1c7f7bc

fix: Missing else

ab6fd20

nsorros reviewed Feb 24, 2020

View reviewed changes

deep_reference_parser/prodigy/reference_to_token_annotations.py Show resolved Hide resolved

nsorros reviewed Feb 24, 2020

View reviewed changes

deep_reference_parser/prodigy/reference_to_token_annotations.py Show resolved Hide resolved

nsorros reviewed Feb 24, 2020

View reviewed changes

nsorros approved these changes Feb 24, 2020

View reviewed changes

ivyleavedtoadflax merged commit 5f9f078 into master Feb 25, 2020

ivyleavedtoadflax deleted the feature/ivyleavedtoadflax/parsing_ref_spans_to_token_spans branch February 25, 2020 02:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow ref_to_token_annotations to handle more general cases #7

Allow ref_to_token_annotations to handle more general cases #7

Uh oh!

ivyleavedtoadflax commented Feb 19, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

nsorros Feb 24, 2020

Uh oh!

ivyleavedtoadflax Feb 25, 2020

Uh oh!

nsorros left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow ref_to_token_annotations to handle more general cases #7

Allow ref_to_token_annotations to handle more general cases #7

Uh oh!

Conversation

ivyleavedtoadflax commented Feb 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nsorros Feb 24, 2020

Choose a reason for hiding this comment

Uh oh!

ivyleavedtoadflax Feb 25, 2020

Choose a reason for hiding this comment

Uh oh!

nsorros left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ivyleavedtoadflax commented Feb 19, 2020 •

edited

Loading