How to choose the final policy in the search phase?

Hi!
I have set learning rate=0.001. But when searching, the valid metric curve is not unstable. 
Now I choose the top-25 sub_policy(epoch=36, 1 epoch=500 iter)，because the valid metric curve looks good. Then the second train stage is converged. I don't know why? 
Additionally, I choose the sub_policy in the case of curve oscillation, the second train stage  is not stable and model is not converged.
During the entire search phase, the valid metric are oscillating. Is this normal?

I don't know how to choose the final policy, looks like I select sub_policy randomly.  Thanks!

The search stage figure: 
X axis-epoch, Y axis-valid metric.
![image](https://user-images.githubusercontent.com/57698625/112098971-9fcd5780-8bdd-11eb-9b23-bd8be65729b7.png)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to choose the final policy in the search phase? #17

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to choose the final policy in the search phase? #17

Description

Activity

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions