Skip to content

Conversation

@viirya
Copy link
Member

@viirya viirya commented Oct 20, 2016

What changes were proposed in this pull request?

This PR is proposed to solve the problem #14912 tried to solve before. Simply said, currently some predicates can not be correctly pushdown through operators due to its format is a bunch of ORs.

A simple example is (a > 10) || (b > 2 && c == 3). If a datasource has attributes a and b, this filtering predicate cannot be pushdown. If we can convert it to CNF (a > 10 || b > 2) && (a > 10 || c == 3). Then we can push down (a > 10 || b > 2).

To convert the predicate to CNF format can solve this formally instead of a hacky way on #14912.

We have previous PRs for CNF conversion, such as #8200. Most of added tests in CNFNormalizationSuite are copied from #8200.

How was this patch tested?

Jenkins tests.

Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark before opening a pull request.

@SparkQA
Copy link

SparkQA commented Oct 20, 2016

Test build #67234 has finished for PR 15558 at commit 5343947.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@viirya
Copy link
Member Author

viirya commented Oct 24, 2016

cc @hvanhovell @cloud-fan Can you review this when you have time? Thanks!

@viirya
Copy link
Member Author

viirya commented Nov 3, 2016

ping @hvanhovell @cloud-fan again.

@cloud-fan
Copy link
Contributor

cc @liancheng

@SparkQA
Copy link

SparkQA commented Dec 13, 2016

Test build #70038 has finished for PR 15558 at commit 5343947.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants