Open
Description
Hi. Kudos for this nice work. I am trying to reproduce the results on DailyDialog dataset. It will be very helpful if you can clarify the following details.
In Issue #13, you mentioned using "nltk.word_tokenize() to tokenize the sentence and then concatenate the tokens" to make the format of the generated dialogue same as the reference response. I have two questions here,
- Did you use any post-processing on the reference files?
- Did you try only nltk.word_tokenize() or some other tokenizer as well?
It will be very useful if you can briefly mention your post-processing steps.
Metadata
Metadata
Assignees
Labels
No labels
Activity