Skip to content

Conversation

@HyukjinKwon
Copy link
Member

What changes were proposed in this pull request?

This PR proposes to partially revert 5fef6e3 so that it does make a readsupport and read schema when it writes in branch 2-4 since it's too breaking change.

5fef6e3 happened to create a readsupport in write path, which ended up with reading schema from readsupport at write path.

For instance, this breaks spark.range(1).format("source").write.save("non-existent-path") case since there's no way to read the schema from "non-existent-path".

See also #22009 (comment)
See also #22688
See also http://apache-spark-developers-list.1001551.n3.nabble.com/Possible-bug-in-DatasourceV2-td25343.html

How was this patch tested?

Unit test and manual tests.

@SparkQA

This comment has been minimized.

@SparkQA
Copy link

SparkQA commented Oct 11, 2018

Test build #97262 has finished for PR 22697 at commit b836625.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 11, 2018

Test build #97260 has finished for PR 22697 at commit 9aa9375.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 11, 2018

Test build #97261 has finished for PR 22697 at commit 540d4b9.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member Author

retest this please

@dongjoon-hyun
Copy link
Member

The following failure looks persistent and relevant.

org.apache.spark.sql.sources.v2.DataSourceV2Suite.SPARK-25700: do not read schema when writing

@HyukjinKwon
Copy link
Member Author

HyukjinKwon commented Oct 11, 2018

That was another commit before rebasing.

@SparkQA
Copy link

SparkQA commented Oct 11, 2018

Test build #97271 has finished for PR 22697 at commit b836625.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

Retest this please.

@SparkQA
Copy link

SparkQA commented Oct 12, 2018

Test build #97281 has finished for PR 22697 at commit b836625.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member Author

cc @cloud-fan

@cloud-fan
Copy link
Contributor

thanks, merging to 2.4!

asfgit pushed a commit that referenced this pull request Oct 15, 2018
…n Data Source V2

## What changes were proposed in this pull request?

This PR proposes to partially revert 5fef6e3 so that it does make a readsupport and read schema when it writes in branch 2-4 since it's too breaking change.

5fef6e3 happened to create a readsupport in write path, which ended up with reading schema from readsupport at write path.

For instance, this breaks `spark.range(1).format("source").write.save("non-existent-path")` case since there's no way to read the schema from "non-existent-path".

See also #22009 (comment)
See also #22688
See also http://apache-spark-developers-list.1001551.n3.nabble.com/Possible-bug-in-DatasourceV2-td25343.html

## How was this patch tested?

Unit test and manual tests.

Closes #22697 from HyukjinKwon/append-revert-2.4.

Authored-by: hyukjinkwon <[email protected]>
Signed-off-by: Wenchen Fan <[email protected]>
@HyukjinKwon HyukjinKwon deleted the append-revert-2.4 branch October 16, 2018 12:41
bavardage pushed a commit to palantir/spark that referenced this pull request Oct 25, 2018
…urce V2 write path

## What changes were proposed in this pull request?

This PR proposes to avoid to make a readsupport and read schema when it writes in other save modes.

apache@5fef6e3 happened to create a readsupport in write path, which ended up with reading schema from readsupport at write path.

This breaks `spark.range(1).format("source").write.save("non-existent-path")` case since there's no way to read the schema from "non-existent-path".

See also apache#22009 (comment)
See also apache#22697
See also http://apache-spark-developers-list.1001551.n3.nabble.com/Possible-bug-in-DatasourceV2-td25343.html

## How was this patch tested?

Unit test and manual tests.

Closes apache#22688 from HyukjinKwon/append-revert-2.

Authored-by: hyukjinkwon <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
…urce V2 write path

## What changes were proposed in this pull request?

This PR proposes to avoid to make a readsupport and read schema when it writes in other save modes.

apache@5fef6e3 happened to create a readsupport in write path, which ended up with reading schema from readsupport at write path.

This breaks `spark.range(1).format("source").write.save("non-existent-path")` case since there's no way to read the schema from "non-existent-path".

See also apache#22009 (comment)
See also apache#22697
See also http://apache-spark-developers-list.1001551.n3.nabble.com/Possible-bug-in-DatasourceV2-td25343.html

## How was this patch tested?

Unit test and manual tests.

Closes apache#22688 from HyukjinKwon/append-revert-2.

Authored-by: hyukjinkwon <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants