Skip to content

Conversation

@valeriy42
Copy link
Contributor

@valeriy42 valeriy42 commented Jul 21, 2020

When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this.

There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test CDataFrameAnalyzerTrainingTest.testMemoryLimitHandling to reduce runtime.

Additionally I took the new messaging from #1428 and added it here to avoid merge conflicts.

@valeriy42 valeriy42 changed the title [WIP][ML] Fixing memory_status output on fatal error [ML] Fixing memory_status output on fatal error Jul 22, 2020
Copy link
Contributor

@tveasey tveasey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had a couple of minor comments, but my major comment is that I don't see how making the results writer a member of CDataFrameAnalyzer helps the situation because this also has automatic storage in the context of data_frame_analyzer command.

@valeriy42
Copy link
Contributor Author

@tveasey thank you for your review. At the end it turned out enough to have one additional flush call to fix the code. It would be great if you could take another look.

Copy link
Contributor

@tveasey tveasey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As discussed, we probably need to revisit cleanup on HANDLE_FATAL going forward, but this is the best fix for 7.9.

@valeriy42 valeriy42 merged commit 877a0d8 into elastic:master Jul 24, 2020
@valeriy42 valeriy42 deleted the fix-memory-status-race branch July 24, 2020 11:14
valeriy42 added a commit to valeriy42/ml-cpp that referenced this pull request Jul 24, 2020
When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this.

There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test CDataFrameAnalyzerTrainingTest.testMemoryLimitHandling to reduce runtime.

Additionally I took the new messaging from elastic#1428 and added it here to avoid merge conflicts.
valeriy42 added a commit to valeriy42/ml-cpp that referenced this pull request Jul 24, 2020
When the job is failing due to the memory limit exceeding, the job fails to emit the final memory_status message. This PR fixes this.

There was a type in the "hard_limit" tag which I fixed. Also I tweaked the parameters of the unit test CDataFrameAnalyzerTrainingTest.testMemoryLimitHandling to reduce runtime.

Additionally I took the new messaging from elastic#1428 and added it here to avoid merge conflicts.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants