-
Notifications
You must be signed in to change notification settings - Fork 2k
Cannot replicate the results mentioned in the repo (English-Vietnamese) . #61
Comments
You have --src=vi --tgt=en, so it's Vietnamese-English (for English-Vietnamese, try --src=en --tgt=vi) & that's about the number we got :) I'll update the tutorial with Vietnamese - English results as well. |
Thanks for the clarification, I tried with --src=en --tgt=vi and I got a Bleu of 25.38 on the test set. Could you clarify if the difference between my results and what was reported (0.72 in the Bleu score) is normal? |
@mjlaali have you tried running it again? it could be just variance (I got 26.4). |
@lmthang regarding replicating IWSLT results: I manage using python 2.7.12, but in python 3.5.2 I end up with BLEU 0.7, so something seems really broken there. I'm not sure yet what it is. |
@bastings Yes, there is an encoding problem in python3. I think this pull request fixed the problem. |
Hi @oahziur in that PR you mention this is fixed in the latest version, but I tested it with the latest code yesterday, so it seems it is not fixed. Are you sure |
The printing to stdout part was fixed, but |
@oahziur yes you are right; I just validated it, and with those changes I get 25.9 test on IWSLT15 en->vi. |
Are you loading the same model (the one you got 26.4 in python 2), but get 25.9 in python 3? |
@bastings You are right, in the second run using two GPUs I got a BLEU of 26.3. For the sake of clarity, in the run with a BLEU of 25.38, I trained my model using a CPU. |
It was a separate run @oahziur so nothing to worry about. However, setting the same random seed, I do get different results when training with python2 and python3. Is this expected? |
@bastings Thanks! I was worry about there are still some encoding issue in python3, but I think that is fixed since you can get consistent results. I think some randomness during training is expected. You should be able to reset |
Yes, it seemed that |
I cannot replicate the result that mentioned in the repo. Here are my settings:
Python 2.7
Tensorflow 1.2.1
Using a docker based on nvidia/cuda:8.0-cudnn5-devel-ubuntu14.04
The command I ran was:
python2 -m nmt.nmt
--src=vi --tgt=en
--vocab_prefix=/data/nmt/iwslt15/vocab
--train_prefix=/data/nmt/iwslt15/train
--dev_prefix=/data/nmt/iwslt15/tst2012
--test_prefix=/data/nmt/iwslt15/tst2013
--out_dir=/data/nmt/models/nmt_attention_model
--hparams_path=nmt/standard_hparams/iwslt15.json
--num_gpus=2
I got the blue score of 24.83, however, on the website, 26.1 has been reported.
The text was updated successfully, but these errors were encountered: