You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Adds parse command that can be called with `python -m deep_reference_parser parse`
6
+
* Rename predict command to 'split' which can be called with `python -m deep_reference_parser parse`
7
+
* Squashes most `tensorflow`, `keras_contrib`, and `numpy` warnings in `__init__.py` resulting from old versions and soon-to-be deprecated functions.
8
+
* Reduces verbosity of logging, improving CLI clarity.
9
+
10
+
## 2020.2.0 - Pre-release
11
+
12
+
First release. Features train and predict functions tested mainly for the task of labelling reference (e.g. academic references) spans in policy documents (e.g. documents produced by government, NGOs, etc).
Copy file name to clipboardExpand all lines: README.md
+47-19Lines changed: 47 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,15 +4,15 @@
4
4
5
5
Deep Reference Parser is a Bi-direction Long Short Term Memory (BiLSTM) Deep Neural Network with a stacked Conditional Random Field (CRF) for identifying references from text. It is designed to be used in the [Reach](https://github.com/wellcometrust/reach) tool to replace a number of existing machine learning models which find references, and extract the constituent parts (e.g. author, year, publication, volume, etc).
6
6
7
-
The intention for this project, like Rodrigues et al. (2018)is to implement a MultiTask model which will complete three tasks simultaneously: reference span detection, reference component detection, and reference type classification.
7
+
The BiLSTM model is based on Rodrigues et al. (2018), and like this project, the intention is to implement a MultiTask model which will complete three tasks simultaneously: reference span detection (splitting), reference component detection (parsing), and reference type classification (classification) in a single neural network and stacked CRF.
8
8
9
9
### Current status:
10
10
11
11
|Component|Individual|MultiTask|
12
12
|---|---|---|
13
-
|Spans|✔️ Implemented|❌ Not Implemented|
14
-
|Components|❌ Not Implemented|❌ Not Implemented|
15
-
|Type|❌ Not Implemented|❌ Not Implemented|
13
+
|Spans (splitting)|✔️ Implemented|❌ Not Implemented|
14
+
|Components (parsing)|✔️ Implemented|❌ Not Implemented|
15
+
|Type (classification)|❌ Not Implemented|❌ Not Implemented|
16
16
17
17
### The model
18
18
@@ -29,7 +29,9 @@ The model itself is based on the work of [Rodrigues et al. (2018)](https://githu
29
29
30
30
### Performance
31
31
32
-
#### Span detection
32
+
On the validation set.
33
+
34
+
#### Span detection (splitting)
33
35
34
36
|token|f1|support|
35
37
|---|---|---|
@@ -39,13 +41,24 @@ The model itself is based on the work of [Rodrigues et al. (2018)](https://githu
39
41
|o|0.9561|32666|
40
42
|weighted avg|0.9746|129959|
41
43
44
+
#### Components (parsing)
45
+
46
+
|token|f1|support|
47
+
|---|---|---|
48
+
|author|0.9467|2818|
49
+
|title|0.8994|4931|
50
+
|year|0.8774|418|
51
+
|o|0.9592|13685|
52
+
|weighted avg|0.9425|21852|
53
+
42
54
#### Computing requirements
43
55
44
56
Models are trained on AWS instances using CPU only.
If you wish to use a custom model that you have trained, you must specify the config file which defines the hyperparameters for that model using the `-c` flag:
0 commit comments