-
Notifications
You must be signed in to change notification settings - Fork 66
[ML] Fixing memory_status output on fatal error #1432
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
tveasey
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I had a couple of minor comments, but my major comment is that I don't see how making the results writer a member of CDataFrameAnalyzer helps the situation because this also has automatic storage in the context of data_frame_analyzer command.
|
@tveasey thank you for your review. At the end it turned out enough to have one additional |
tveasey
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As discussed, we probably need to revisit cleanup on HANDLE_FATAL going forward, but this is the best fix for 7.9.
When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this. There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test CDataFrameAnalyzerTrainingTest.testMemoryLimitHandling to reduce runtime. Additionally I took the new messaging from elastic#1428 and added it here to avoid merge conflicts.
When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this. There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test CDataFrameAnalyzerTrainingTest.testMemoryLimitHandling to reduce runtime. Additionally I took the new messaging from elastic#1428 and added it here to avoid merge conflicts.
When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this.
There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test
CDataFrameAnalyzerTrainingTest.testMemoryLimitHandlingto reduce runtime.Additionally I took the new messaging from #1428 and added it here to avoid merge conflicts.