I'd like to save multiple agents during training, in order to pick the best one.
but the save function overwrites the previous one.
I think this can be solved by adding a suffix in the file name, such as use a time stamp.
ref:
keras's ModelCheckpoint
Flux's Checkpointing