SPARK-1445: compute-classpath should not print error if lib_managed not found #361

aarondav · 2014-04-08T18:26:19Z

This was added to the check for the assembly jar, forgot it for the datanucleus jars.

…ot found Redirecting stderr to /dev/null/ can't be that much slower than an if [ -z ]...

AmplabJenkins · 2014-04-08T18:27:23Z

Merged build triggered.

AmplabJenkins · 2014-04-08T18:27:33Z

Merged build started.

AmplabJenkins · 2014-04-08T19:22:42Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-04-08T19:22:42Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13898/

…ot found This was added to the check for the assembly jar, forgot it for the datanucleus jars. Author: Aaron Davidson <[email protected]> Closes apache#361 from aarondav/cc and squashes the following commits: 8facc16 [Aaron Davidson] SPARK-1445: compute-classpath should not print error if lib_managed not found

This commit tries to solve issue apache#359 by allowing the `spark.executor.cores` configuration key to take fractional values, e.g., 0.5 or 1.5. The value is used to specify the cpu request when creating the executor pods, which is allowed to be fractional by Kubernetes. When the value is passed to the executor process through the environment variable `SPARK_EXECUTOR_CORES`, the value is rounded up to the closest integer as required by the `CoarseGrainedExecutorBackend`. Signed-off-by: Yinan Li <[email protected]>

Format bad jars log.

This Change refactor the job of running tests of osb-checker against huaweicloud, because the osb-checker has so some refactor. Closes: theopenlab/openlab#90

… into the Spark project (apache#361)

…t overflow (apache#361) This is a cherry-pick from apache#44006 to spark 3.5 ### What changes were proposed in this pull request? This change adds a check for overflows when creating Parquet row group filters on an INT32 (byte/short/int) parquet type to avoid incorrectly skipping row groups if the predicate value doesn't fit in an INT. This can happen if the read schema is specified as LONG, e.g via `.schema("col LONG")` While the Parquet readers don't support reading INT32 into a LONG, the overflow can lead to row groups being incorrectly skipped, bypassing the reader altogether and producing incorrect results instead of failing. ### Why are the changes needed? Reading a parquet file containing INT32 values with a read schema specified as LONG can produce incorrect results today: ``` Seq(0).toDF("a").write.parquet(path) spark.read.schema("a LONG").parquet(path).where(s"a < ${Long.MaxValue}").collect() ``` will return an empty result. The correct result is either: - Failing the query if the parquet reader doesn't support upcasting integers to longs (all parquet readers in Spark today) - Return result `[0]` if the parquet reader supports that upcast (no readers in Spark as of now, but I'm looking into adding this capability). ### Does this PR introduce _any_ user-facing change? The following: ``` Seq(0).toDF("a").write.parquet(path) spark.read.schema("a LONG").parquet(path).where(s"a < ${Long.MaxValue}").collect() ``` produces an (incorrect) empty result before this change. After this change, the read will fail, raising an error about the unsupported conversion from INT to LONG in the parquet reader. ### How was this patch tested? - Added tests to `ParquetFilterSuite` to ensure that no row group filter is created when the predicate value overflows or when the value type isn't compatible with the parquet type - Added test to `ParquetQuerySuite` covering the correctness issue described above. ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#44154 from johanl-db/SPARK-46092-row-group-skipping-overflow-3.5. Authored-by: Johan Lasperas <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]> Co-authored-by: Johan Lasperas <[email protected]>

SPARK-1445: compute-classpath should not print error if lib_managed n…

8facc16

…ot found Redirecting stderr to /dev/null/ can't be that much slower than an if [ -z ]...

asfgit closed this in e25b593 Apr 8, 2014

mccheah added a commit to mccheah/spark that referenced this pull request Oct 3, 2018

Merge pull request apache#361 from palantir/fix-bad-jar-logging

09d36ba

Format bad jars log.

arjunshroff pushed a commit to arjunshroff/spark that referenced this pull request Nov 24, 2020

MapR [SPARK-325] Add examples for work with the MapRDB JSON connector…

966853b

… into the Spark project (apache#361)

RolatZhang pushed a commit to RolatZhang/spark that referenced this pull request Mar 18, 2022

AL-4964 add log4j-over-sl4j (apache#361)

5852529

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SPARK-1445: compute-classpath should not print error if lib_managed not found #361

SPARK-1445: compute-classpath should not print error if lib_managed not found #361

Uh oh!

aarondav commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SPARK-1445: compute-classpath should not print error if lib_managed not found #361

SPARK-1445: compute-classpath should not print error if lib_managed not found #361

Uh oh!

Conversation

aarondav commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

AmplabJenkins commented Apr 8, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants