[SPARK-39506][SQL] Make CacheTable, isCached, UncacheTable, setCurrentCatalog, currentCatalog, listCatalogs 3l namespace compatible #36904

amaliujia · 2022-06-17T21:04:58Z

What changes were proposed in this pull request?

add 3l namespace test for CacheTable, isCache.
make UncacheTable be 3l namespace compatible.
add new API setCurrentCatalog, currentCatalog, listCatalogs which are useful for 3l namespace.

Why are the changes needed?

This is a part of effort for 3l namespace work.

Does this PR introduce any user-facing change?

Yes. There are 3 new API added in this PR (setCurrentCatalog, currentCatalog, listCatalogs) and unCacheTable API will support 3l namespace.

How was this patch tested?

UT

amaliujia · 2022-06-17T21:05:06Z

R: @cloud-fan

AmplabJenkins · 2022-06-18T00:14:06Z

Can one of the admins verify this patch?

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala

cloud-fan · 2022-06-21T07:34:31Z

sql/core/src/main/scala/org/apache/spark/sql/catalog/interface.scala

if it's simply a string, shall we just let listCatalogs return Dataset[String]?

We might design the class like this

case class CatalogMetadata( name: String, description: String, owner: Option[String] = None, catalogType: Option[CatalogType] = None, createdAt: Option[Long] = None, createdBy: Option[String] = None, updatedAt: Option[Long] = None, updatedBy: Option[String] = None )

However, I am not sure if there are extra information we can fetch for a catalog. Do you know if we have such rich metadata for a catalog in Spark?

If Spark only knows the catalog name, then probably Dataset[String] would be fine.

ok let's keep it then, and add a description field so that it doesn't look so weird.

About the naming, how about just Catalog? to match Database and Table in this file.

Sounds good for keeping the class and additionally adding a description field.

Regrading renaming to Catalog, it will have a naming conflict with

spark/sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala

Line 33 in 59eee98

abstract class Catalog {

sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala

…log, currentCatalog, listCatalogs 3l namespace compatible.

amaliujia · 2022-06-22T18:38:53Z

Also I am seeing this

[info] spark-streaming-kinesis-asl: mimaPreviousArtifacts not set, not analyzing binary compatibility
[error] spark-sql: Failed binary compatibility check against org.apache.spark:spark-sql_2.12:3.2.0! Found 3 potential problems (filtered 602)
[error]  * abstract method currentCatalog()java.lang.String in class org.apache.spark.sql.catalog.Catalog is present only in current version
[error]    filter with: ProblemFilters.exclude[ReversedMissingMethodProblem]("org.apache.spark.sql.catalog.Catalog.currentCatalog")
[error]  * abstract method setCurrentCatalog(java.lang.String)Unit in class org.apache.spark.sql.catalog.Catalog is present only in current version
[error]    filter with: ProblemFilters.exclude[ReversedMissingMethodProblem]("org.apache.spark.sql.catalog.Catalog.setCurrentCatalog")
[error]  * abstract method listCatalogs()org.apache.spark.sql.Dataset in class org.apache.spark.sql.catalog.Catalog is present only in current version
[error]    filter with: ProblemFilters.exclude[ReversedMissingMethodProblem]("org.apache.spark.sql.catalog.Catalog.listCatalogs")

Any idea how to make the new API pass such API compatibility check?

HyukjinKwon · 2022-06-23T02:00:11Z

Adding @zhengruifeng FYI

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala

sql/core/src/main/scala/org/apache/spark/sql/catalog/interface.scala

sql/core/src/test/scala/org/apache/spark/sql/internal/CatalogSuite.scala

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala

…cala

cloud-fan · 2022-06-24T10:24:43Z

sql/core/src/test/scala/org/apache/spark/sql/internal/CatalogSuite.scala

+    assert(spark.catalog.currentCatalog().equals("testcat"))
+    spark.catalog.setCurrentCatalog("spark_catalog")
+    assert(spark.catalog.currentCatalog().equals("spark_catalog"))
+    assert(spark.catalog.listCatalogs().collect().map(c => c.name).toSet == Set("testcat"))


not related to this PR, but we should figure out why spark_catalog is missed here.

cloud-fan · 2022-06-24T10:25:45Z

The last commit only updates a comment, I'm merging it to master, thanks!

amaliujia · 2022-06-24T17:29:05Z

@cloud-fan thank you!

github-actions bot added the SQL label Jun 17, 2022

cloud-fan reviewed Jun 21, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala Outdated Show resolved Hide resolved

cloud-fan reviewed Jun 21, 2022

View reviewed changes

amaliujia commented Jun 21, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/internal/CatalogImpl.scala Outdated Show resolved Hide resolved

amaliujia added 2 commits June 21, 2022 17:21

[SPARK-39506] Make CacheTable, isCached, UncacheTable, setCurrentCata…

3a4cf19

…log, currentCatalog, listCatalogs 3l namespace compatible.

update

21393da

amaliujia force-pushed the apibatch1 branch from 31a01b4 to 21393da Compare June 22, 2022 00:22

update

c4b70ef

amaliujia added 2 commits June 22, 2022 13:34

update

ec2d2fe

update

58a91e8

HyukjinKwon reviewed Jun 23, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala Show resolved Hide resolved

HyukjinKwon reviewed Jun 23, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/interface.scala Show resolved Hide resolved

HyukjinKwon reviewed Jun 23, 2022

View reviewed changes

sql/core/src/test/scala/org/apache/spark/sql/internal/CatalogSuite.scala Outdated Show resolved Hide resolved

cloud-fan reviewed Jun 23, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala Outdated Show resolved Hide resolved

cloud-fan reviewed Jun 23, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala Outdated Show resolved Hide resolved

cloud-fan reviewed Jun 23, 2022

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.scala Outdated Show resolved Hide resolved

update

fdf6de6

github-actions bot added the BUILD label Jun 23, 2022

amaliujia and others added 3 commits June 23, 2022 11:28

update

ba7a1c2

update

7184665

Update sql/core/src/main/scala/org/apache/spark/sql/catalog/Catalog.s…

ddbf48f

…cala

cloud-fan reviewed Jun 24, 2022

View reviewed changes

cloud-fan closed this in 299cdfa Jun 24, 2022

zhengruifeng mentioned this pull request Jul 8, 2022

[SPARK-39716][R] Make currentDatabase/setCurrentDatabase/listCatalogs in SparkR support 3L namespace #37127

Closed

LuciferYang mentioned this pull request Sep 1, 2022

[SPARK-40283][INFRA] Make MiMa check default exclude private object and bump previousSparkVersion to 3.3.0 #37741

Closed

[SPARK-39506][SQL] Make CacheTable, isCached, UncacheTable, setCurrentCatalog, currentCatalog, listCatalogs 3l namespace compatible #36904

[SPARK-39506][SQL] Make CacheTable, isCached, UncacheTable, setCurrentCatalog, currentCatalog, listCatalogs 3l namespace compatible #36904

Uh oh!

Conversation

amaliujia commented Jun 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

amaliujia commented Jun 17, 2022

Uh oh!

AmplabJenkins commented Jun 18, 2022

Uh oh!

Uh oh!

cloud-fan Jun 21, 2022

Choose a reason for hiding this comment

Uh oh!

amaliujia Jun 21, 2022

Choose a reason for hiding this comment

Uh oh!

amaliujia Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

cloud-fan Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

amaliujia Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

cloud-fan Jun 23, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

amaliujia commented Jun 22, 2022

Uh oh!

HyukjinKwon commented Jun 23, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cloud-fan Jun 24, 2022

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented Jun 24, 2022

Uh oh!

amaliujia commented Jun 24, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amaliujia commented Jun 17, 2022 •

edited

Loading