@@ -16,48 +16,166 @@ Setting environment variable: `MKL_NUM_THREADS=1`.
1616
1717- Metrics: samples / sec
1818
19- | Batch Size | 32 | 64 | 128 | 256 |
20- | -- | -- | -- | -- | -- |
21- | PaddlePaddle Fluid | 15.44 | 16.32 | 16.74 | 16.79 |
22- | PaddlePaddle v2 | 15.97 | 17.04 | 17.60 | 17.83 |
23- | TensorFlow | 9.09 | 9.10 | 9.24 | 8.66 |
19+ <table >
20+ <thead >
21+ <tr >
22+ <th >Batch Size </th >
23+ <th > 32</th >
24+ <th >64</th >
25+ <th >128 </th >
26+ <th >256</th >
27+ </tr >
28+ </thead >
29+ <tbody >
30+ <tr >
31+ <td > PaddlePaddle Fluid</td >
32+ <td > 15.44 </td >
33+ <td > 16.32 </td >
34+ <td > 16.74 </td >
35+ <td > 16.79 </td >
36+ </tr >
37+ <tr >
38+ <td >PaddlePaddle v2 </td >
39+ <td > 15.97 </td >
40+ <td > 17.04 </td >
41+ <td > 17.60 </td >
42+ <td > 17.83 </td >
43+ </tr >
44+ <tr >
45+ <td >TensorFlow </td >
46+ <td > 9.09 </td >
47+ <td > 9.10 </td >
48+ <td > 9.24 </td >
49+ <td > 8.66 </td >
50+ </tr >
51+ </tbody >
52+ </table >
53+
2454
2555### Different Batch Size
2656
2757- PServer Count: 10
2858- Trainer Count: 20
2959- Metrics: samples / sec
3060
31- | Batch Size | 32 | 64 | 128 | 256 |
32- | -- | -- | -- | -- | -- |
33- | PaddlePaddle Fluid | 190.20 | 222.15 | 247.40 | 258.18 |
34- | PaddlePaddle v2 | 170.96 | 233.71 | 256.14 | 329.23 |
35- | TensorFlow | - | - | - | - |
36-
61+ <table >
62+ <thead >
63+ <tr >
64+ <th >Batch Size </th >
65+ <th > 32</th >
66+ <th >64</th >
67+ <th >128 </th >
68+ <th >256</th >
69+ </tr >
70+ </thead >
71+ <tbody >
72+ <tr >
73+ <td > PaddlePaddle Fluid</td >
74+ <td > 190.20 </td >
75+ <td > 222.15 </td >
76+ <td > 247.40 </td >
77+ <td > 258.18 </td >
78+ </tr >
79+ <tr >
80+ <td >PaddlePaddle v2 </td >
81+ <td > 170.96 </td >
82+ <td > 233.71 </td >
83+ <td > 256.14 </td >
84+ <td > 329.23 </td >
85+ </tr >
86+ <tr >
87+ <td >TensorFlow </td >
88+ <td > - </td >
89+ <td > - </td >
90+ <td > - </td >
91+ <td > - </td >
92+ </tr >
93+ </tbody >
94+ </table >
3795
3896### Accelerate Rate
3997
4098- Pserver Count: 20
4199- Batch Size: 128
42100- Metrics: samples / sec
43101
44- | Trainer Count | 20 | 40 | 80 | 100 |
45- | -- | -- | -- | -- | -- |
46- | PaddlePaddle Fluid | 263.29 (78.64%) | 518.80 (77.47%) | 836.26 (62.44%) | 1019.29 (60.89%) |
47- | PaddlePaddle v2 (need more tests) | 326.85 (92.85%) | 534.58 (75.93%) | 853.30 (60.60%) | 1041.99 (59.20%) |
48- | TensorFlow | - | - | - | - |
102+ <table >
103+ <thead >
104+ <tr >
105+ <th >Trainer Count </th >
106+ <th >20</th >
107+ <th >40</th >
108+ <th >80</th >
109+ <th >100</th >
110+ </tr >
111+ </thead >
112+ <tbody >
113+ <tr >
114+ <td > PaddlePaddle Fluid</td >
115+ <td > 263.29 (78.64%) </td >
116+ <td > 518.80 (77.47%) </td >
117+ <td > 836.26 (62.44%) </td >
118+ <td > 1019.29 (60.89%) </td >
119+ </tr >
120+ <tr >
121+ <td >PaddlePaddle v2 (need more tests) </td >
122+ <td > 326.85 (92.85%) </td >
123+ <td > 534.58 (75.93%) </td >
124+ <td > 853.30 (60.60%) </td >
125+ <td > 1041.99 (59.20%) </td >
126+ </tr >
127+ <tr >
128+ <td >TensorFlow </td >
129+ <td > - </td >
130+ <td > - </td >
131+ <td > - </td >
132+ <td > - </td >
133+ </tr >
134+ </tbody >
135+ </table >
136+
49137
50138### Different Pserver Count
51139
52140- Trainer Count: 60
53141- Batch Size: 128
54142- Metrics: samples/ sec
55143
56- | PServer Count | 3 | 6 | 10 | 20 |
57- | -- | -- | -- | -- | -- |
58- | PaddlePaddle Fluid(should fix in next PR) | 589.1 | 592.6 | 656.4 | 655.8 |
59- | PaddlePaddle v2 | 593.4 | 791.3 | 729.7 | 821.7 |
60- | TensorFlow | - | - | - | - |
144+ <table >
145+ <thead >
146+ <tr >
147+ <th >PServer Count </th >
148+ <th >3</th >
149+ <th >6</th >
150+ <th >10</th >
151+ <th >20</th >
152+ </tr >
153+ </thead >
154+ <tbody >
155+ <tr >
156+ <td > PaddlePaddle Fluid(should fix in next PR) </td >
157+ <td > 589.1 </td >
158+ <td > 592.6 </td >
159+ <td > 656.4 </td >
160+ <td > 655.8 </td >
161+ </tr >
162+ <tr >
163+ <td >PaddlePaddle v2 (need more tests) </td >
164+ <td > 593.4 </td >
165+ <td > 791.3 </td >
166+ <td > 729.7 </td >
167+ <td > 821.7 </td >
168+ </tr >
169+ <tr >
170+ <td >TensorFlow </td >
171+ <td > - </td >
172+ <td > - </td >
173+ <td > - </td >
174+ <td > - </td >
175+ </tr >
176+ </tbody >
177+ </table >
178+
61179
62180* The performance gap between Fuild and v2 comes from the network interference.*
63181
0 commit comments