-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Closed
Description
In Distributed training VGG16 benchmark
- Does
Single Node Single Threadmean single node, if so, thePServer Count: 10, Trainer Count: 20below better be removed, since it's confusing.- Answer: yes it means single node, removed the confusing part from the doc.
- In
Different Batch Size, there isPer trainer CPU Core: 1, why only 1 core is used, we probably should be utilizing all the computing resource of one node.- Answer: actually it means the test is using
MKL_NUM_THREADS=1(doc is now updated), the entire CPU is available for the trainer process. Detail please see here. @typhoonzero will update the GPU results.
- Answer: actually it means the test is using
- Does
78.64%inAccelerate Ratemeans the performance compared to ideal scenario?- Answer: yes.
- In
Accelerate Rate,PaddlePaddle v2is much better thanFluidin thetrainer count = 20case, do you know why is it so? Same question for the metric inDifferent Pserver Count, V2 is much better than Fluid.- Answer: No, need to figure it out.
Metadata
Metadata
Assignees
Labels
No labels