Dense dimension sparse input #6563

tabergma · 2020-09-03T12:05:00Z

Proposed changes:
Do not set the output dimension of the sparse-to-dense layers to the same dimension as the dense features.

closes #6555

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

github-actions · 2020-09-03T15:44:55Z

Commit: 96a529d, The full report is available as an artifact.

Dataset: Carbon Bot

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `42s`, train: `3m45s`, total: `4m27s`	0.8388	0.6260	0.5894
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `19s`, train: `2m43s`, total: `3m1s`	0.7417	0.6260	0.4651

Dataset: Hermit

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m23s`, train: `23m19s`, total: `24m41s`	0.8866	0.7487	`no data`
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `35s`, train: `19m16s`, total: `19m50s`	0.8281	0.7487	`no data`

Dataset: Sara

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m3s`, train: `7m44s`, total: `8m47s`	0.8981	0.8683	0.9283
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `27s`, train: `5m18s`, total: `5m44s`	0.8139	0.8683	0.8522

github-actions · 2020-09-04T10:48:46Z

Commit: 83c9305, The full report is available as an artifact.

Dataset: Carbon Bot

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `41s`, train: `3m40s`, total: `4m21s`	0.8272	0.6260	0.6026
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `21s`, train: `2m43s`, total: `3m4s`	0.7437	0.6260	0.5033

Dataset: Hermit

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m20s`, train: `23m25s`, total: `24m45s`	0.8931	0.7487	`no data`
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `35s`, train: `19m25s`, total: `20m0s`	0.8290	0.7487	`no data`

Dataset: Sara

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m3s`, train: `7m51s`, total: `8m54s`	0.8981	0.8683	0.9196
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `28s`, train: `5m17s`, total: `5m44s`	0.8384	0.8683	0.8413

github-actions · 2020-09-04T13:28:11Z

Commit: 73f5d12, The full report is available as an artifact.

Dataset: Carbon Bot

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `45s`, train: `4m5s`, total: `4m49s`	0.8485	0.6260	0.5430
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `20s`, train: `2m53s`, total: `3m13s`	0.7223	0.6260	0.4967

Dataset: Hermit

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `37s`, train: `20m0s`, total: `20m36s`	0.8253	0.7487	`no data`

Dataset: Sara

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m7s`, train: `8m13s`, total: `9m19s`	0.8962	0.8683	0.9217
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `28s`, train: `5m44s`, total: `6m11s`	0.8335	0.8683	0.8261

github-actions · 2020-09-04T16:48:32Z

/modeltest

include:
 - dataset: 
      - "Carbon Bot"
      - "Sara"
   config:
      - "Sparse + DIET(bow) + ResponseSelector(bow)"
      - "Sparse + DIET(seq) + ResponseSelector(bow)"
      - "Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)"
      - "Sparse + ConveRT + DIET(seq) + ResponseSelector(bow)"
      - "Sparse + Spacy + DIET(bow) + ResponseSelector(bow)"
      - "Sparse + Spacy + DIET(seq) + ResponseSelector(bow)"

github-actions · 2020-09-04T19:24:25Z

Commit: 9e65197, The full report is available as an artifact.

Dataset: Carbon Bot

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `43s`, train: `3m50s`, total: `4m32s`	0.8369	0.6260	0.5960
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `20s`, train: `2m45s`, total: `3m4s`	0.7340	0.6260	0.5298

Dataset: Sara

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + ConveRT + DIET(bow) + ResponseSelector(bow)` test: `1m2s`, train: `7m56s`, total: `8m58s`	0.8874	0.8683	0.9283
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `28s`, train: `5m34s`, total: `6m1s`	0.8325	0.8683	0.8522

tabergma · 2020-09-07T07:14:34Z

Results match those from the master branch (see https://metabase.rasa.com/dashboard/166)

Ghostvv · 2020-09-07T08:00:56Z

rasa/nlu/selectors/response_selector.py

@@ -72,6 +71,7 @@
    TENSORBOARD_LOG_LEVEL,
    CONCAT_DIMENSION,
    FEATURIZERS,
+    DENSE_DIMENSION,


in response selector, 128 is not enough?

since we changed it to predict labels, it is basically another DIET

When I set it to something lower, the performance dropped (see the tables above).

Ghostvv

did it reduce train run time for DIET?

tabergma · 2020-09-07T09:00:07Z

The train time for Sparse + ConveRT + DIET(bow) + ResponseSelector(bow) increased for Sara and Carbon Bot by a couple of seconds. Without ConveRT it stayed the same.

tabergma added 3 commits September 3, 2020 13:54

remove dense dimension

f15ea6c

add changelog

4a98acd

update response selector and docs

945e7cd

tabergma added status:model-regression-tests and removed status:model-regression-tests labels Sep 3, 2020

github-actions bot deleted a comment from tabergma Sep 3, 2020

tabergma added the runner:gpu label Sep 3, 2020

github-actions bot removed status:model-regression-tests runner:gpu labels Sep 3, 2020

tabergma added 2 commits September 4, 2020 10:26

do not calculate min of dense dim

405b418

Merge branch 'master' into dense-dimension-sparse-input

5091ba2

tabergma added runner:gpu status:model-regression-tests labels Sep 4, 2020

github-actions bot removed status:model-regression-tests runner:gpu labels Sep 4, 2020

set default dimension to 256

8ca8344

tabergma added runner:gpu status:model-regression-tests labels Sep 4, 2020

github-actions bot removed status:model-regression-tests runner:gpu labels Sep 4, 2020

tabergma added 2 commits September 4, 2020 15:44

update default values

278490a

update docs

83c3ec8

tabergma added runner:gpu status:model-regression-tests labels Sep 4, 2020

Merge branch 'master' into dense-dimension-sparse-input

f1c2560

tabergma removed the status:model-regression-tests label Sep 4, 2020

tabergma added runner:gpu status:model-regression-tests and removed runner:gpu status:model-regression-tests labels Sep 4, 2020

github-actions bot deleted a comment from tabergma Sep 4, 2020

tabergma added 2 commits September 7, 2020 09:12

update changelog

6cc2154

Merge branch 'master' into dense-dimension-sparse-input

89078e1

tabergma requested a review from Ghostvv September 7, 2020 07:13

RasaHQ deleted a comment from github-actions bot Sep 7, 2020

Ghostvv reviewed Sep 7, 2020

View reviewed changes

Ghostvv approved these changes Sep 7, 2020

View reviewed changes

Merge branch 'master' into dense-dimension-sparse-input

113ad1d

tabergma added the status:ready-to-merge label Sep 7, 2020

rasabot added 4 commits September 7, 2020 11:08

Merge branch 'master' into dense-dimension-sparse-input

093b427

Merge branch 'master' into dense-dimension-sparse-input

be9364a

Merge branch 'master' into dense-dimension-sparse-input

dd48d4c

Merge branch 'master' into dense-dimension-sparse-input

984703e

rasabot merged commit 1478b39 into master Sep 8, 2020

rasabot deleted the dense-dimension-sparse-input branch September 8, 2020 09:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dense dimension sparse input #6563

Dense dimension sparse input #6563

tabergma commented Sep 3, 2020 •

edited

Loading

github-actions bot commented Sep 3, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

tabergma commented Sep 7, 2020

Ghostvv Sep 7, 2020

Ghostvv Sep 7, 2020

tabergma Sep 7, 2020

Ghostvv left a comment

tabergma commented Sep 7, 2020

Dense dimension sparse input #6563

Dense dimension sparse input #6563

Conversation

tabergma commented Sep 3, 2020 • edited Loading

github-actions bot commented Sep 3, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

github-actions bot commented Sep 4, 2020

tabergma commented Sep 7, 2020

Ghostvv Sep 7, 2020

Choose a reason for hiding this comment

Ghostvv Sep 7, 2020

Choose a reason for hiding this comment

tabergma Sep 7, 2020

Choose a reason for hiding this comment

Ghostvv left a comment

Choose a reason for hiding this comment

tabergma commented Sep 7, 2020

tabergma commented Sep 3, 2020 •

edited

Loading