Please read the first article, then proceed to the second and this repository to understand the reasoning behind the current structure.
Hi, thanks for your great work! I have a question about the training configuration: Why is there no learning rate scheduling policy in the current implementation? I found that the code defines a ...