fix the head notation of package object dsl #632

scwf · 2014-05-04T16:17:39Z

Some obvious bugs in the head notation, fix them.

AmplabJenkins · 2014-05-04T16:17:58Z

Can one of the admins verify this patch?

marmbrus · 2014-05-07T00:35:31Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/dsl/package.scala

Don't remove this line.

ok，line added

marmbrus · 2014-05-07T00:36:09Z

I guess this got pretty stale. Thanks for updating it!

Jenkins, test this please.

use GOPROXY for faster downloading of go modules and fix conformance job.

### What changes were proposed in this pull request? In origin way to judge if a DataSet is empty by ``` def isEmpty: Boolean = withAction("isEmpty", limit(1).groupBy().count().queryExecution) { plan => plan.executeCollect().head.getLong(0) == 0 } ``` will add two shuffles by `limit()`, `groupby() and count()`, then collect all data to driver. In this way we can avoid `oom` when collect data to driver. But it will trigger all partitions calculated and add more shuffle process. We change it to ``` def isEmpty: Boolean = withAction("isEmpty", select().queryExecution) { plan => plan.executeTake(1).isEmpty } ``` After these pr, we will add a column pruning to origin LogicalPlan and use `executeTake()` API. then we won't add more shuffle process and just compute only one partition's data in last stage. In this way we can reduce cost when we call `DataSet.isEmpty()` and won't bring memory issue to driver side. ### Why are the changes needed? Optimize Dataset.isEmpty() ### Does this PR introduce any user-facing change? No ### How was this patch tested? Origin UT Closes apache#26500 from AngersZhuuuu/SPARK-29874. Authored-by: angerszhu <[email protected]> Signed-off-by: Wenchen Fan <[email protected]>

KE-42110 Upgrade snappy-java to 1.1.10.1 (apache#632)

apache#632) [HADP-53083][SPARK-47383][CORE] Support `spark.shutdown.timeout` config (apache#311) Make the shutdown hook timeout configurable. If this is not defined it falls back to the existing behavior, which uses a default timeout of 30 seconds, or whatever is defined in core-site.xml for the hadoop.service.shutdown.timeout property. Spark sometimes times out during the shutdown process. This can result in data left in the queues to be dropped and causes metadata loss (e.g. event logs, anything written by custom listeners). This is not easily configurable before this change. The underlying `org.apache.hadoop.util.ShutdownHookManager` has a default timeout of 30 seconds. It can be configured by setting hadoop.service.shutdown.timeout, but this must be done in the core-site.xml/core-default.xml because a new hadoop conf object is created and there is no opportunity to modify it. Yes, a new config `spark.shutdown.timeout` is added. Manual testing in spark-shell. This behavior is not practical to write a unit test for. No Closes apache#45504 from robreeves/sc_shutdown_timeout. Authored-by: Rob Reeves <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]> Signed-off-by: Yujie Li <[email protected]> Co-authored-by: Rob Reeves <[email protected]>

fix the head notation of package object dsl

66ff53b

marmbrus reviewed May 7, 2014
View reviewed changes

Update package.scala

d2d1a9d

scwf closed this May 14, 2014

scwf deleted the dslfix branch August 22, 2014 15:25

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019

Use GOPROXY (apache#632)

6584e08

use GOPROXY for faster downloading of go modules and fix conformance job.

RolatZhang pushed a commit to RolatZhang/spark that referenced this pull request Aug 18, 2023

KE-42110 Upgrade snappy-java to 1.1.10.1 (apache#632)

52d2e5a

RolatZhang pushed a commit to RolatZhang/spark that referenced this pull request Aug 18, 2023

KE-42110 Upgrade snappy-java to 1.1.10.1 (apache#632)

47971e9

RolatZhang pushed a commit to RolatZhang/spark that referenced this pull request Dec 8, 2023

cherry-pick spark3.2 KE commit 47971e9

a0f60da

KE-42110 Upgrade snappy-java to 1.1.10.1 (apache#632)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix the head notation of package object dsl #632

fix the head notation of package object dsl #632

Uh oh!

scwf commented May 4, 2014

Uh oh!

AmplabJenkins commented May 4, 2014

Uh oh!

marmbrus May 7, 2014

Uh oh!

scwf May 8, 2014

Uh oh!

marmbrus commented May 7, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix the head notation of package object dsl #632

fix the head notation of package object dsl #632

Uh oh!

Conversation

scwf commented May 4, 2014

Uh oh!

AmplabJenkins commented May 4, 2014

Uh oh!

marmbrus May 7, 2014

Choose a reason for hiding this comment

Uh oh!

scwf May 8, 2014

Choose a reason for hiding this comment

Uh oh!

marmbrus commented May 7, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants