HOTFIX: Ignore python metastore files in RAT checks. #393

pwendell · 2014-04-11T20:13:20Z

This was causing some errors with pull request tests.

AmplabJenkins · 2014-04-11T20:18:13Z

Merged build triggered.

AmplabJenkins · 2014-04-11T20:18:21Z

Merged build started.

pwendell · 2014-04-11T20:23:10Z

Okay this passed the RAT checks so I'm going to merge it.

AmplabJenkins · 2014-04-11T21:49:15Z

Merged build finished.

AmplabJenkins · 2014-04-11T21:49:16Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14060/

This was causing some errors with pull request tests. Author: Patrick Wendell <[email protected]> Closes #393 from pwendell/hotfix and squashes the following commits: 6201dd3 [Patrick Wendell] HOTFIX: Ignore python metastore files in RAT checks.

Better error handling in Spark Streaming and more API cleanup Earlier errors in jobs generated by Spark Streaming (or in the generation of jobs) could not be caught from the main driver thread (i.e. the thread that called StreamingContext.start()) as it would be thrown in different threads. With this change, after `ssc.start`, one can call `ssc.awaitTermination()` which will be block until the ssc is closed, or there is an exception. This makes it easier to debug. This change also adds ssc.stop(<stop-spark-context>) where you can stop StreamingContext without stopping the SparkContext. Also fixes the bug that came up with PRs apache#393 and apache#381. MetadataCleaner default value has been changed from 3500 to -1 for normal SparkContext and 3600 when creating a StreamingContext. Also, updated StreamingListenerBus with changes similar to SparkListenerBus in apache#392. And changed a lot of protected[streaming] to private[streaming].

This was causing some errors with pull request tests. Author: Patrick Wendell <[email protected]> Closes apache#393 from pwendell/hotfix and squashes the following commits: 6201dd3 [Patrick Wendell] HOTFIX: Ignore python metastore files in RAT checks.

Run npm install under osb-checker/common

…ateExec` (apache#393) ### What changes were proposed in this pull request? Before evaluating the generator function in `GenerateExec`, initialize non-deterministic expressions. ### Why are the changes needed? The following query fails: ``` select * from explode( transform(sequence(0, cast(rand()*1000 as int) + 1), x -> x * 22) ); 23/09/14 09:27:25 ERROR Executor: Exception in task 0.0 in stage 3.0 (TID 3) java.lang.IllegalArgumentException: requirement failed: Nondeterministic expression org.apache.spark.sql.catalyst.expressions.Rand should be initialized before eval. at scala.Predef$.require(Predef.scala:281) at org.apache.spark.sql.catalyst.expressions.Nondeterministic.eval(Expression.scala:497) at org.apache.spark.sql.catalyst.expressions.Nondeterministic.eval$(Expression.scala:495) at org.apache.spark.sql.catalyst.expressions.RDG.eval(randomExpressions.scala:35) at org.apache.spark.sql.catalyst.expressions.BinaryArithmetic.eval(arithmetic.scala:384) at org.apache.spark.sql.catalyst.expressions.UnaryExpression.eval(Expression.scala:543) at org.apache.spark.sql.catalyst.expressions.BinaryArithmetic.eval(arithmetic.scala:384) at org.apache.spark.sql.catalyst.expressions.Sequence.eval(collectionOperations.scala:3062) at org.apache.spark.sql.catalyst.expressions.SimpleHigherOrderFunction.eval(higherOrderFunctions.scala:275) at org.apache.spark.sql.catalyst.expressions.SimpleHigherOrderFunction.eval$(higherOrderFunctions.scala:274) at org.apache.spark.sql.catalyst.expressions.ArrayTransform.eval(higherOrderFunctions.scala:308) at org.apache.spark.sql.catalyst.expressions.ExplodeBase.eval(generators.scala:375) at org.apache.spark.sql.execution.GenerateExec.$anonfun$doExecute$8(GenerateExec.scala:108) ... ``` However, this query succeeds: ``` select * from explode( sequence(0, cast(rand()*1000 as int) + 1) ); 0 1 2 3 ... 801 802 803 ``` The difference is that `transform` turns off whole-stage codegen, which exposes a bug in `GenerateExec` in which the non-deterministic expression passed to the generator function is not initialized before being used. This PR fixes the bug in `GenerateExec`. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? New unit test. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#42933 from bersprockets/nondeterm_issue. Lead-authored-by: Bruce Robbins <[email protected]> (cherry picked from commit e097f91) Signed-off-by: Hyukjin Kwon <[email protected]> Co-authored-by: Bruce Robbins <[email protected]> Co-authored-by: Hyukjin Kwon <[email protected]>

HOTFIX: Ignore python metastore files in RAT checks.

6201dd3

This was causing some errors with pull request tests.

pwendell mentioned this pull request Apr 11, 2014

[SPARK-1436] In-memory columnar storage bug fixes #374

Closed

asfgit closed this in 6a0f8e3 Apr 12, 2014

mccheah pushed a commit to mccheah/spark that referenced this pull request Nov 28, 2018

Merge pull request apache#393 from palantir/dv/upstream

dcd5aae

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019

Merge pull request apache#393 from liu-sheng/adjust-osb-checker

5419514

Run npm install under osb-checker/common

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HOTFIX: Ignore python metastore files in RAT checks. #393

HOTFIX: Ignore python metastore files in RAT checks. #393

Uh oh!

pwendell commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

pwendell commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HOTFIX: Ignore python metastore files in RAT checks. #393

HOTFIX: Ignore python metastore files in RAT checks. #393

Uh oh!

Conversation

pwendell commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

pwendell commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

AmplabJenkins commented Apr 11, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants