Skip to content

Commit

Permalink
chore: add new pattern for dash bullet (#109)
Browse files Browse the repository at this point in the history
* add new pattern for dash bullet
  • Loading branch information
mallorih authored Dec 21, 2022
1 parent 962c9dc commit 6f4d9ad
Show file tree
Hide file tree
Showing 4 changed files with 8 additions and 1 deletion.
5 changes: 5 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,8 @@
## 0.3.5-dev0

* Add new pattern to recognize plain text dash bullets
* Add test for bullet patterns

## 0.3.4

* Python-3.7 compat
Expand Down
1 change: 1 addition & 0 deletions test_unstructured/partition/test_text_type.py
Original file line number Diff line number Diff line change
Expand Up @@ -88,6 +88,7 @@ def test_is_possible_title(text, expected, monkeypatch):
("⦿ This is a fine point!", True),
(" This is a fine point!", True),
("* This is a fine point!", True),
("- This is a fine point!", True),
("This is NOT a fine point!", False), # No bullet point
("I love morse code! ● ● ● --- ● ● ●", False), # Not at the beginning
],
Expand Down
2 changes: 1 addition & 1 deletion unstructured/__version__.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__version__ = "0.3.4" # pragma: no cover
__version__ = "0.3.5-dev0" # pragma: no cover
1 change: 1 addition & 0 deletions unstructured/nlp/patterns.py
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,7 @@
"\u2767",
"\u29BE",
"\u29BF",
"\u002D",
"",
"\*", # noqa: W605 NOTE(robinson) - skipping qa because we need the escape for the regex
"\x95",
Expand Down

0 comments on commit 6f4d9ad

Please sign in to comment.