Increase InternalHistogramTests coverage #36004

javanna · 2018-11-28T14:29:18Z

In InternalHistogramTests we were randomizing different values but minDocCount was hardcoded to 1. It's important to test other values, especially 0 as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same interval, minDocCount and emptyBucketInfo. Also assertions need to take into account that more (or less) buckets are expected depending on minDocCount.

This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase.

Also relates to #26856 as one more key comparison needed to use Double.compare to properly handle NaN values, which was triggered by the increased test coverage.

In this test we were randomizing different values but minDocCount was hardcoded to 1. It's important to test other values, especially `0` as it's the default. The test needed some adapting in the way buckets are randomly generated: all aggs need to share the same interval, minDocCount and emptyBucketInfo. Also assertions need to take into account that more (or less) buckets are expected depending on minDocCount. This was originated by elastic#35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to elastic#26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, this was triggered by the increased test coverage.

elasticmachine · 2018-11-28T14:29:20Z

Pinging @elastic/es-analytics-geo

jimczi

I left one question, LGTM otherwise

jimczi · 2018-11-28T14:58:35Z

...test/java/org/elasticsearch/search/aggregations/bucket/histogram/InternalHistogramTests.java

+    public void setUp() throws Exception {
        super.setUp();
        keyed = randomBoolean();
        format = randomNumericDocValueFormat();


Can you add a small comment explaining why we need to use the same interval, offset, ... in all tests ?
The other solution would be to add an abstract method in InternalAggregationTestCase that creates a list of random instances for the reduce test.

I will add a comment. Given the issues that this triggered in other test methods, and the fact that we were already doing the same for a couple of fields, I think it makes sense to make this change overall rather than changing only the reduce test. I think it makes the test more realistic too.

I thought a bit more about this and I think I understand your comment better now. Why limit the randomization of different test instances if same interval etc. are needed only for proper reduction tests? On the other hand, it seems like reduction tests are the only case where we call createTestInstance multiple times as part of the same test method. The consequence of the current change is that all of the test methods in the same run will reuse the same interval, offset, minDocCount and emptyBucketInfo values, which may limit test coverage a bit (it's not true that it makes the test more realistic like I previously said). I wonder if it's worth changing this though and making the base class more complicated.

We could have a createTestReduceInstances that by default call createTestInstance and that can be extended by sub classes like this one to ensure that the instances share the same parameter. I agree that it would complicate the base class a little bit but it would also make the test more realistic so I have mixed feelings. Let's continue the discussion since we also need to fix the date_histogram tests.

In `InternalHistogramTests` we were randomizing different values but `minDocCount` was hardcoded to `1`. It's important to test other values, especially `0` as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same `interval`, `minDocCount` and `emptyBucketInfo`. Also assertions need to take into account that more (or less) buckets are expected depending on `minDocCount`. This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to #26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, which was triggered by the increased test coverage.

javanna added 2 commits November 28, 2018 15:15

add else branch

fe1ebb6

javanna added >test Issues or PRs that are addressing/adding tests :Analytics/Aggregations Aggregations v7.0.0 v6.6.0 labels Nov 28, 2018

javanna requested review from colings86 and jimczi November 28, 2018 14:29

jimczi approved these changes Nov 28, 2018

View reviewed changes

add comment

089eb62

javanna merged commit 4b85769 into elastic:master Nov 28, 2018

javanna added the backport pending label Nov 28, 2018

javanna mentioned this pull request Nov 29, 2018

Histogram aggs: add empty buckets only in the final reduce step #35921

Merged

javanna removed the backport pending label Dec 3, 2018

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Increase InternalHistogramTests coverage #36004

Increase InternalHistogramTests coverage #36004

Uh oh!

javanna commented Nov 28, 2018

Uh oh!

elasticmachine commented Nov 28, 2018

Uh oh!

jimczi left a comment

Uh oh!

jimczi Nov 28, 2018

Uh oh!

javanna Nov 28, 2018

Uh oh!

javanna Nov 28, 2018

Uh oh!

jimczi Nov 28, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Increase InternalHistogramTests coverage #36004

Increase InternalHistogramTests coverage #36004

Uh oh!

Conversation

javanna commented Nov 28, 2018

Uh oh!

elasticmachine commented Nov 28, 2018

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

jimczi Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

javanna Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

javanna Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

jimczi Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants