Skip to content

Training details #2

Open
Open
@wonjininfo

Description

Hi,
Congratulations on your work and thanks for sharing your code!

I am doing a research on summarization and now I am trying to use the code.
I would like to train the model on the other dataset.
Would you please share some details on the training steps when training the model on CNN-DM dataset?
Such as hyperparameters, the time spent to train the code and when (=which epoch) did you stoped the training.

ps) 50 epochs of training, which is the example command-line option, took only a few minutes (like less than an hour) on my GPU. Is this normal?

Thanks,
Wonjin

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions