-
Notifications
You must be signed in to change notification settings - Fork 61
Closed
Labels
Description
Is your feature request related to a problem? Please describe.
I am concerned about how to evaluate the performance of the models as the documentation does not clearly explain how the test data is split. I also would like to estimate better how long to run iterations if I have already experienced with a related data set.
Describe the solution you'd like
I suggest adding to documentation:
- how is test data split. E.g. is it from the end, randomly or based on auto-selection of columns?
- how are the iterations generated. It would seem they might be in fixed order, or my data is fairly similar and leads to it. For example, if iteration 201 performed best, then when I re-train I could stop after iteration 201 next time, if I know this. If they are not in fixed order, then I know to let my next dataset run a bit longer.