This repository has been archived by the owner on Jul 26, 2019. It is now read-only.
This repository has been archived by the owner on Jul 26, 2019. It is now read-only.
Poor performance and poor results #15
Open
Description
I'm trying to fine tune BERT on STS-B dataset.
I used the following notebook to fine tune it using BERT-keras.
(As described in the paper, I just added a classification layer using the CLS token of the output of BERT).
However, there is great differences in performance and results between this notebook and the script used in the official version for fine tuning :
BERT-keras | Official BERT | |
---|---|---|
Pearson | 0.0254 | 0.8956 |
Spearman | 0.0289 | 0.7942 |
MSE | 2.2691 | 0.5456 |
Training time | 9h | 10min |
Note : Pearson / Spearman and correlation metrics used to evaluate the accuracy on the STS-B dataset
Why there is such a difference between the 2 approach ?
Metadata
Assignees
Labels
No labels
Activity