Skip to content

Conversation

@gengliangwang
Copy link
Member

@gengliangwang gengliangwang commented Mar 12, 2019

What changes were proposed in this pull request?

During the migration of CSV V2(#24005), I find that we can improve the file source v2 framework by:

  1. check duplicated column names in both read and write
  2. Not all the file sources support filter push down. So remove SupportsPushDownFilters from FileScanBuilder
  3. The method isSplitable might require data source options. Add a new member options to FileScan.
  4. Make FileTable.schema a lazy value instead of a method.

How was this patch tested?

Unit test

@gengliangwang
Copy link
Member Author

@cloud-fan @dongjoon-hyun @HyukjinKwon

This PR is the "non-csv-datasource" part of #24005. It contains several improvements.

@SparkQA
Copy link

SparkQA commented Mar 12, 2019

Test build #103368 has finished for PR 24066 at commit 34c1ce6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

dongjoon-hyun commented Mar 13, 2019

Could you rebase this to the master, @gengliangwang ?

@SparkQA
Copy link

SparkQA commented Mar 14, 2019

Test build #103488 has finished for PR 24066 at commit 144b076.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

s"Unable to infer schema for $name. It must be specified manually.")
}.asNullable

override def schema(): StructType = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shall we make this a lazy val? It becomes expensive now.

@SparkQA
Copy link

SparkQA commented Mar 14, 2019

Test build #103498 has finished for PR 24066 at commit 6475d4f.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@cloud-fan
Copy link
Contributor

thanks, merging to master!

@cloud-fan cloud-fan closed this in 2a37d6e Mar 15, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants