Skip to content

Bilou on refactored iob #65

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 585 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
585 commits
Select commit Hold shift + click to select a range
7a42a23
add description for punctuation removing (#42)
whalebot-helmsman Sep 8, 2017
210f81a
more annotations
whalebot-helmsman Sep 12, 2017
e4fac51
more annotations
whalebot-helmsman Sep 12, 2017
2df7c24
more annotations
whalebot-helmsman Sep 12, 2017
b3956cc
correct ids
whalebot-helmsman Sep 12, 2017
51ef652
correct ids
whalebot-helmsman Sep 12, 2017
f1e002c
does not copy wa-title attributes
whalebot-helmsman Sep 13, 2017
807f3a6
verify conversion
whalebot-helmsman Sep 13, 2017
57bc016
convert annotation
whalebot-helmsman Sep 13, 2017
02aad41
write as html
whalebot-helmsman Sep 13, 2017
b7e1e17
move gate annotations to webannotator
whalebot-helmsman Sep 13, 2017
eb97aa5
tests for html tools
whalebot-helmsman Sep 14, 2017
e541806
pep8 style
whalebot-helmsman Sep 15, 2017
d42dcea
add program description
whalebot-helmsman Sep 15, 2017
37b8728
pep8 style
whalebot-helmsman Sep 15, 2017
80a6fb8
pep8 style
whalebot-helmsman Sep 15, 2017
dca5dd3
add program description
whalebot-helmsman Sep 15, 2017
9e480d4
pep8 style
whalebot-helmsman Sep 15, 2017
c1a1175
ability to pass entities list to verify
whalebot-helmsman Sep 15, 2017
5feb78a
look for annotations in WebAnnotator folder
whalebot-helmsman Sep 15, 2017
6d59b83
pep8
whalebot-helmsman Sep 15, 2017
2a7b013
test attribute removal for wa-title
whalebot-helmsman Sep 15, 2017
3556a57
Merge pull request #47 from whalebot-helmsman/master
kmike Sep 15, 2017
bc26275
mess is gone
whalebot-helmsman Sep 20, 2017
4130d78
no need for gate loader
whalebot-helmsman Sep 20, 2017
c2af278
Merge pull request #48 from whalebot-helmsman/master
kmike Sep 20, 2017
36d56f2
text tokenizer return postions of token
whalebot-helmsman Sep 21, 2017
2d4d2ef
update tests
whalebot-helmsman Sep 21, 2017
80658ca
separate statement for every action
whalebot-helmsman Sep 21, 2017
c52e449
comma preserving test
whalebot-helmsman Sep 21, 2017
8178776
too much tokens around
whalebot-helmsman Sep 21, 2017
51c0932
encode in indices instead of entities
whalebot-helmsman Sep 21, 2017
1a667ec
handle empty lists
whalebot-helmsman Sep 21, 2017
24465b1
pass token length and position from TextToken to HtmlToken
whalebot-helmsman Sep 21, 2017
06befbb
letter perfect detokenization
whalebot-helmsman Sep 22, 2017
e5730b2
do not cleanup tokenized tree by default, separate method for tree cl…
Sep 25, 2017
e340444
update tests for separate tree cleaning
Sep 25, 2017
89673c1
update tests for correct punctuation positions
Sep 25, 2017
7c45984
correct length for replaced quotes
Sep 25, 2017
46fc4df
pep8
Sep 29, 2017
90bdefd
new html tree based to webannotator transformer
Sep 26, 2017
1fb67a0
ignore scripts and styles
Sep 26, 2017
3117640
ignore elements with non-text tokens
Sep 27, 2017
084fb33
as we search use our regexp for text and tail in same moment, our sta…
Sep 27, 2017
43449a1
pep8
Sep 29, 2017
388170e
comma at line end, not start
Sep 29, 2017
71caf61
one join instead of many additions, dont be Schleimel
Sep 29, 2017
37d7470
correct formatting
Sep 29, 2017
e93c6dc
add clarification
Sep 29, 2017
e02c275
fix typo
Sep 29, 2017
f26569f
pep8
Sep 29, 2017
d1aecbb
preserve tokenize method for compatibility
Sep 29, 2017
35a9d88
function to reduce code in tests
Sep 29, 2017
9033188
remove test for nltk tokenizer
Sep 29, 2017
c14f363
test our behaviour, which difers from original treebank tokenizer
Sep 29, 2017
a071cd4
remove useless conversion
Sep 29, 2017
a33f564
rename method to avoid confusion with nltk tokenize_span method
Sep 29, 2017
75a9698
remove brittle tests
Sep 29, 2017
4729323
small benchmark for html tokenizer
Sep 29, 2017
943a44e
Revert "remove brittle tests"
whalebot-helmsman Oct 2, 2017
ba7d6fe
move brittle tests to pytest xfail
whalebot-helmsman Oct 2, 2017
b72bcc1
expect behaviour of nltk tokenizer
whalebot-helmsman Oct 2, 2017
f9190c3
Merge pull request #49 from whalebot-helmsman/master
kmike Oct 2, 2017
09f1699
Merge branch 'master' into webannotator-html
whalebot-helmsman Oct 3, 2017
281d4a5
rename variable
whalebot-helmsman Oct 3, 2017
a0d2519
make TagPosition private
whalebot-helmsman Oct 4, 2017
caa76cc
make translate_to_dfs private
whalebot-helmsman Oct 4, 2017
500ccf4
make fabricate_start/end private
whalebot-helmsman Oct 4, 2017
a743aed
make enclosure private
whalebot-helmsman Oct 4, 2017
f7e7a86
move enclosure deciding to separate function
whalebot-helmsman Oct 4, 2017
91c3962
rename generic tasks to concrete enclosures
whalebot-helmsman Oct 4, 2017
9e3b49a
move dfs order numbering to separate function
whalebot-helmsman Oct 4, 2017
3266427
move start/end tag locating in separate function
whalebot-helmsman Oct 4, 2017
7d56973
pep8
whalebot-helmsman Oct 4, 2017
1dc3f28
high level explanation of whats heppening here
whalebot-helmsman Oct 4, 2017
a92a339
no unicode tags, so string_types is enough
whalebot-helmsman Oct 4, 2017
833603b
reduce code
whalebot-helmsman Oct 4, 2017
4f22537
Merge pull request #50 from whalebot-helmsman/master
kmike Oct 4, 2017
ced2fd8
tutorial rewritten with usage of crfsuite
sibiryakov Oct 17, 2017
67763e6
wapiti link restored
sibiryakov Oct 17, 2017
770d777
Merge pull request #52 from scrapinghub/crfsuite-tutorial
kmike Oct 17, 2017
0bb8fd7
wapiti return bytes, not str
whalebot-helmsman Oct 19, 2017
2d92efb
collect all top N results but return only first of them
whalebot-helmsman Oct 19, 2017
b801d7a
merge top N chains for better recall
whalebot-helmsman Oct 19, 2017
739e269
benchmark script for model prediction
whalebot-helmsman Dec 21, 2017
d8afda6
we need newer wapiti version for python3 support
whalebot-helmsman Dec 21, 2017
0d92091
add various overlapping schemes for chains
whalebot-helmsman Dec 21, 2017
3842740
add description of merging method
whalebot-helmsman Dec 21, 2017
83b5327
Merge pull request #55 from whalebot-helmsman/master
kmike Dec 21, 2017
1713694
there are various types of unusual tags, not only comments
whalebot-helmsman Dec 22, 2017
7a68569
Merge pull request #56 from whalebot-helmsman/master
kmike Dec 22, 2017
0176cdb
non-recursive implementation of algorithm
whalebot-helmsman Dec 22, 2017
f4a1896
add description of WordTokenizer improvements
whalebot-helmsman Dec 22, 2017
3e09c9f
changd comment as code structure changed
whalebot-helmsman Dec 22, 2017
bff4c3e
Merge pull request #57 from whalebot-helmsman/master
kmike Dec 22, 2017
d8b1984
don't declare Python 3.3 support
kmike Dec 29, 2017
d5a7fcf
v0.6
kmike Dec 29, 2017
6b3bc61
fix boolean bug
Kebniss May 3, 2018
0aaef9f
add test case
Kebniss May 10, 2018
7816e9d
add bool test to test_pattern_features
Kebniss May 16, 2018
9fe8988
Merge pull request #59 from scrapinghub/fix-boolean-bug
kmike May 16, 2018
15308c0
update travis to run different python versions
Kebniss May 18, 2018
71bed4f
add branches
Kebniss May 18, 2018
7ddd317
Merge pull request #62 from scrapinghub/fix-travis
kmike May 18, 2018
526ef60
almost working version of refactored tokenize
Kebniss May 31, 2018
bcced73
tmp debugging with possible solution for fixing HtmlToken index issue
Kebniss Jun 1, 2018
687a2d5
create htmlTokens after encoding
Kebniss Jun 5, 2018
4255da2
add bilou_encoder and bilou_group
Kebniss Jun 6, 2018
95716c6
add bilou option remove comments
Kebniss Jun 6, 2018
4d2ae87
add bilou as translator from iob tokens
Kebniss Jun 6, 2018
6b61f5d
add bilou tests
Kebniss Jun 6, 2018
2c7271f
Add bilou option
Kebniss Jun 7, 2018
35e7033
halfway through wapiti
Kebniss Jun 7, 2018
89dfedb
add bilou to wapiti
Kebniss Jun 7, 2018
8eb01bd
fix bug
Kebniss Jun 7, 2018
981d5f1
fix bug
Kebniss Jun 7, 2018
9ea21ea
fix bug
Kebniss Jun 7, 2018
86d2692
remove comments
Kebniss Jun 7, 2018
b6ba5f9
remove comments
Kebniss Jun 7, 2018
157ec78
modify arguments passed to bilou detokenize and add test
Kebniss Jun 7, 2018
ada0c9b
remove newline
Kebniss Jun 7, 2018
44b8857
fix bilou_group and tests
Kebniss Jun 7, 2018
d221f46
fix call to bilou_group and add bilou grouping test
Kebniss Jun 8, 2018
64962a3
add wapiti tests similar to crfsuite
Kebniss Jun 11, 2018
c404e45
fixed merge_tp_n for bilou and added bilou tests for wapiti
Kebniss Jun 12, 2018
12695ce
rebase and add BilouEncoder
Kebniss Jul 6, 2018
d303ccd
update files to new BilouEncoder format
Kebniss Jul 7, 2018
2985fba
update wapiti to work with BilouEncoder
Kebniss Jul 11, 2018
c140626
fix comments tests
Kebniss Jul 12, 2018
3ca8822
update travis to install python-wapiti from repo
Kebniss Jul 13, 2018
fb57468
try to install wapiti from travis
Kebniss Jul 13, 2018
fb2e8ad
fix inline tests
Kebniss Jul 31, 2018
16a9030
fix inline tests
Kebniss Jul 31, 2018
9a0f07a
fix inline tests
Kebniss Jul 31, 2018
d30df9d
fix inline tests
Kebniss Jul 31, 2018
d57e8ba
fix inline tests
Kebniss Jul 31, 2018
e1d3f81
fix inline tests
Kebniss Jul 31, 2018
db39528
install python-wapiti
Kebniss Jul 31, 2018
d245c30
install wapiti
Kebniss Jul 31, 2018
1f3d078
install python wapiti
Kebniss Jul 31, 2018
ec8843a
try to install wapiti
Kebniss Aug 1, 2018
27d6a3a
move install_wapiti in webstruct
Kebniss Aug 1, 2018
aa6b4ca
correct typo in file name
Kebniss Aug 1, 2018
4a75843
move install wapiti
Kebniss Aug 1, 2018
129b367
install GitPython as early as possible
Kebniss Aug 1, 2018
d5bdb93
remove GitPython from addons
Kebniss Aug 1, 2018
a3c66ab
comment strict=True test in bilou
Kebniss Aug 1, 2018
684ac2a
add install wapiti to tox
Kebniss Aug 1, 2018
cb93778
remove duplicate commands
Kebniss Aug 1, 2018
cbd066f
expand commands
Kebniss Aug 2, 2018
4a0862c
remove cd
Kebniss Aug 2, 2018
a59466a
lowercase c
Kebniss Aug 2, 2018
b5cf69f
add cmake to use make
Kebniss Aug 2, 2018
4b05c0a
install deps earlier
Kebniss Aug 2, 2018
8561953
remove earlier install
Kebniss Aug 2, 2018
8a8b803
remove wapiti install from travis
Kebniss Aug 2, 2018
bc3d333
add sudo true
Kebniss Aug 2, 2018
8894439
sudo true
Kebniss Aug 2, 2018
fd49de8
add GitPython to setup
Kebniss Aug 2, 2018
885a737
change install dir
Kebniss Aug 2, 2018
44a6f3a
modify intsall dir
Kebniss Aug 2, 2018
f4bb7f2
modify distdir
Kebniss Aug 2, 2018
d3619c6
modify dist dir
Kebniss Aug 2, 2018
448fee3
modify directory
Kebniss Aug 2, 2018
18471d2
modify dir
Kebniss Aug 2, 2018
ef21059
modify dir
Kebniss Aug 2, 2018
8ea646e
change dir
Kebniss Aug 2, 2018
c376736
change dir
Kebniss Aug 3, 2018
5aa85b6
modify pytest command
Kebniss Aug 3, 2018
9ef33ca
change test path
Kebniss Aug 3, 2018
b72264b
update test command
Kebniss Aug 3, 2018
bcf1e55
check for wapiti command
whalebot-helmsman Aug 27, 2018
7486b87
install wapiti binary
whalebot-helmsman Aug 27, 2018
6e958d7
use sudo
whalebot-helmsman Aug 27, 2018
581f6e7
return to directory
whalebot-helmsman Aug 27, 2018
ad17fbf
move test to separate file
whalebot-helmsman Aug 28, 2018
9e46156
Merge pull request #67 from whalebot-helmsman/travis-wapiti
kmike Aug 28, 2018
c10d9e9
almost working version of refactored tokenize
Kebniss May 31, 2018
faa0d82
tmp debugging with possible solution for fixing HtmlToken index issue
Kebniss Jun 1, 2018
d2ecc08
create htmlTokens after encoding
Kebniss Jun 5, 2018
4de042f
add bilou_encoder and bilou_group
Kebniss Jun 6, 2018
ed7a0f0
add bilou option remove comments
Kebniss Jun 6, 2018
a861544
add bilou as translator from iob tokens
Kebniss Jun 6, 2018
d23b263
add bilou tests
Kebniss Jun 6, 2018
225ab9c
Add bilou option
Kebniss Jun 7, 2018
6001c4e
halfway through wapiti
Kebniss Jun 7, 2018
c688f36
add bilou to wapiti
Kebniss Jun 7, 2018
aeb4ed3
fix bug
Kebniss Jun 7, 2018
9fdabe0
fix bug
Kebniss Jun 7, 2018
697a239
fix bug
Kebniss Jun 7, 2018
f26a6dd
remove comments
Kebniss Jun 7, 2018
10548c5
remove comments
Kebniss Jun 7, 2018
8c0fb08
modify arguments passed to bilou detokenize and add test
Kebniss Jun 7, 2018
b4f2eee
remove newline
Kebniss Jun 7, 2018
b0733fb
fix bilou_group and tests
Kebniss Jun 7, 2018
a5bbc33
fix call to bilou_group and add bilou grouping test
Kebniss Jun 8, 2018
f566091
add wapiti tests similar to crfsuite
Kebniss Jun 11, 2018
3be0deb
fixed merge_tp_n for bilou and added bilou tests for wapiti
Kebniss Jun 12, 2018
e01546f
rebase and add BilouEncoder
Kebniss Jul 6, 2018
d3a64fd
update files to new BilouEncoder format
Kebniss Jul 7, 2018
678185d
update wapiti to work with BilouEncoder
Kebniss Jul 11, 2018
161ab4b
fix comments tests
Kebniss Jul 12, 2018
c91dbce
update travis to install python-wapiti from repo
Kebniss Jul 13, 2018
34764fe
try to install wapiti from travis
Kebniss Jul 13, 2018
0a11760
fix inline tests
Kebniss Jul 31, 2018
a6a010d
fix inline tests
Kebniss Jul 31, 2018
9f92235
fix inline tests
Kebniss Jul 31, 2018
c125a0a
fix inline tests
Kebniss Jul 31, 2018
a263991
fix inline tests
Kebniss Jul 31, 2018
a0c0ab8
fix inline tests
Kebniss Jul 31, 2018
92d9113
install python-wapiti
Kebniss Jul 31, 2018
07f9046
install wapiti
Kebniss Jul 31, 2018
0146222
install python wapiti
Kebniss Jul 31, 2018
8aaed89
try to install wapiti
Kebniss Aug 1, 2018
792962f
move install_wapiti in webstruct
Kebniss Aug 1, 2018
2c3d027
correct typo in file name
Kebniss Aug 1, 2018
f5b5164
move install wapiti
Kebniss Aug 1, 2018
c36b242
install GitPython as early as possible
Kebniss Aug 1, 2018
daa5c77
remove GitPython from addons
Kebniss Aug 1, 2018
41eff7e
comment strict=True test in bilou
Kebniss Aug 1, 2018
d5e489e
add install wapiti to tox
Kebniss Aug 1, 2018
413d766
remove duplicate commands
Kebniss Aug 1, 2018
9a0f692
expand commands
Kebniss Aug 2, 2018
78ce844
remove cd
Kebniss Aug 2, 2018
bef9c35
lowercase c
Kebniss Aug 2, 2018
8352725
add cmake to use make
Kebniss Aug 2, 2018
c09015e
install deps earlier
Kebniss Aug 2, 2018
78a930f
remove earlier install
Kebniss Aug 2, 2018
ceea90e
remove wapiti install from travis
Kebniss Aug 2, 2018
4561ad4
add sudo true
Kebniss Aug 2, 2018
9ada2b8
sudo true
Kebniss Aug 2, 2018
0899a2e
add GitPython to setup
Kebniss Aug 2, 2018
21ee599
change install dir
Kebniss Aug 2, 2018
e0f8753
modify intsall dir
Kebniss Aug 2, 2018
3a6beb6
modify distdir
Kebniss Aug 2, 2018
e3467bb
modify dist dir
Kebniss Aug 2, 2018
4664692
modify directory
Kebniss Aug 2, 2018
fc8b848
modify dir
Kebniss Aug 2, 2018
56778cd
modify dir
Kebniss Aug 2, 2018
f8c7035
change dir
Kebniss Aug 2, 2018
fa0404e
change dir
Kebniss Aug 3, 2018
8c34e82
modify pytest command
Kebniss Aug 3, 2018
48b3d5d
change test path
Kebniss Aug 3, 2018
423f01e
update test command
Kebniss Aug 3, 2018
293f841
rebased
Kebniss Aug 30, 2018
ccece68
remove wrong wapiti related files
Kebniss Aug 30, 2018
3e55f53
remove commented code and broken inline test
Kebniss Aug 30, 2018
844777d
remove gitpython
Kebniss Aug 30, 2018
2789ec7
remove broken inline test
Kebniss Aug 30, 2018
6edfae9
add wapiti bilou tests
Kebniss Aug 30, 2018
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
19 changes: 18 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,10 @@ pip-log.txt
# Unit test / coverage reports
.coverage
.tox
cover
nosetests.xml
.cache
htmlcov/

# Translations
*.mo
Expand All @@ -35,5 +38,19 @@ nosetests.xml
.pydevproject

# Other
.idea
webstruct_data/datastore

.ipynb_checkpoints
docs/_build
webstruct_data/todo
notebooks/old
notebooks/*.zip
notebooks/*.html
notebooks/*.ipynb
notebooks/*.marisa
notebooks/*.wapiti
notebooks/*.crfsuite
webstruct_data/corpus/us_contact_pages/cleaned
example/_data/*
example/*.joblib
example/*.html
42 changes: 42 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
language: python
python: 3.5
sudo: false

branches:
only:
- master
- /^\d\.\d+$/

matrix:
include:
- python: 2.7
env: TOXENV=py27
- python: 3.4
env: TOXENV=py34
- python: 3.5
env: TOXENV=py35
- python: 3.6
env: TOXENV=py36

addons:
apt:
packages:
- python-numpy
- python-scipy
- libatlas-base-dev
- liblapack-dev
- gfortran

install:
- pip install -U pip tox codecov
- git clone https://github.com/Jekub/Wapiti /tmp/wapiti
- pushd /tmp/wapiti && make && sudo make install && popd

script: tox

after_success:
- codecov

cache:
directories:
- $HOME/.cache/pip
59 changes: 59 additions & 0 deletions CHANGES.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
Changes
=======

0.6 (2017-12-29)
----------------

* A complete example (contact extractor) is added to the repo;
* fixed a lot of issues in the annotated data;
* fixed loading of ``<title>`` annotations;
* all annotated data is converted from GATE to WebAnnotator format;
* text tokenizers allow to optionally return original token positions;
* converting text from tokenized to raw is now lossless;
* ``webstruct.webannotator.to_webannotator`` is rewritten;
* ``<script>``, ``<style>`` elements, HTML comments and processing
instructions are ignored when they are inside entities;
* tutorial is rewritten for CRFSuite;
* Wapiti support is fixed in Python 3;
* top-N parsing support when using Wapiti; an option to merge top N chains,
to increase recall;
* benchmarking script;
* don't declare Python 3.3 support (it is EOL).

0.5 (2017-05-10)
----------------

* webstruct.model.NER now uses ``requests`` library to make HTTP requests;
* changed default headers used by webstruct.model.NER;
* new ``webstruct.infer_domain`` module useful for proper cross-validation;
* webstruct.webannotator.to_webannotator got an option to add ``<base>``
tag with the original URL to the page;
* fixed a warning in webstruct.gazetteers.geonames.read_geonames;
* add a few more country names to countries.txt list.

0.4.1 (2016-11-28)
------------------

* fixed a bug in NER.extract().

0.4 (2016-11-26)
----------------

* sklearn-crfsuite_ is used as a CRFsuite wrapper, CRFsuiteCRF class
is removed;
* comments are preserved in HTML trees because recent Firefox puts
``<base>`` tags to a comment when saving pages, and this affects
WebAnnotator;
* fixed 'dont_penalize' argument of webstruct.NER.extract_groups_from_url;
* new webstruct.model.extract_entity_groups utility function;
* HtmlTokenizer and HtmlToken are moved to their own module
(webstruct.html_tokenizer);
* test improvements;

.. _sklearn-crfsuite: https://github.com/TeamHG-Memex/sklearn-crfsuite

0.3 (2016-09-19)
----------------

There are many changes from previous version: API is changed,
Python 3 is supported, better gazetteers support, CRFsuite support, etc.
45 changes: 45 additions & 0 deletions README.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,45 @@
Webstruct
=========

.. image:: https://img.shields.io/pypi/v/webstruct.svg
:target: https://pypi.python.org/pypi/webstruct
:alt: PyPI Version

.. image:: https://travis-ci.org/scrapinghub/webstruct.svg?branch=master
:target: https://travis-ci.org/scrapinghub/webstruct
:alt: Build Status

.. image:: https://codecov.io/gh/scrapinghub/webstruct/branch/master/graph/badge.svg
:target: https://codecov.io/gh/scrapinghub/webstruct
:alt: Code Coverage

.. image:: https://readthedocs.org/projects/webstruct/badge/?version=latest
:target: http://webstruct.readthedocs.io/en/latest/
:alt: Documentation


Webstruct is a library for creating statistical NER_ systems that work
on HTML data, i.e. a library for building tools that extract named
entities (addresses, organization names, open hours, etc) from webpages.

Unlike most NER systems, webstruct works on HTML data, not only
on text data. This allows to define features that use HTML structure,
and also to embed annotation results back into HTML.

Read the docs_ for more info.

License is MIT.

.. _docs: http://webstruct.readthedocs.io/en/latest/
.. _NER: http://en.wikipedia.org/wiki/Named-entity_recognition

Contributing
------------

* Source code: https://github.com/scrapinghub/webstruct
* Bug tracker: https://github.com/scrapinghub/webstruct/issues

To run tests, make sure tox_ is installed, then run
``tox`` from the source root.

.. _tox: https://tox.readthedocs.io/en/latest/
13 changes: 0 additions & 13 deletions block_model/README.md

This file was deleted.

11 changes: 0 additions & 11 deletions block_model/convert_html.py

This file was deleted.

16 changes: 0 additions & 16 deletions block_model/convert_labeled_data.py

This file was deleted.

132 changes: 0 additions & 132 deletions block_model/data/1.html

This file was deleted.

32 changes: 0 additions & 32 deletions block_model/data/1.txt

This file was deleted.

Loading