[SPARK-16303][DOCS][EXAMPLES][WIP] Updated SQL programming guide and examples #14119

aokolnychyi · 2016-07-09T21:18:02Z

What changes were proposed in this pull request?

Hard-coded Spark SQL sample snippets were moved into source files under examples sub-project.
Removed the inconsistency between Scala and Java Spark SQL examples
Scala and Java Spark SQL examples were updated

How was this patch tested?

The work is still in progress. All involved examples were tested manually. An additional round of testing will be done after the code review.

aokolnychyi · 2016-07-09T21:23:48Z

@liancheng could you, please, review this PR?

aokolnychyi · 2016-07-09T22:37:00Z

docs/sql-programming-guide.md

-// spark is an existing HiveContext
-spark.refreshTable("my_table")
+// spark is an existing SparkSession
+spark.catalog.refreshTable("my_table")


Is it the correct way to refresh?

liancheng · 2016-07-11T13:26:49Z

test this please

liancheng · 2016-07-11T13:26:55Z

add to whitelist

SparkQA · 2016-07-11T13:28:41Z

Test build #62092 has finished for PR 14119 at commit 95f0f41.

This patch fails RAT tests.
This patch merges cleanly.
This patch adds no public classes.

liancheng · 2016-07-11T13:29:19Z

Since you've added JavaSparkSqlExample.scala, we can remove JavaSparkSQL.scala now. (I guess that file was from my original WIP branch?)

liancheng · 2016-07-11T13:30:14Z

examples/src/main/java/org/apache/spark/examples/sql/JavaSqlDataSourceExample.java

@@ -0,0 +1,192 @@
+package org.apache.spark.examples.sql;


Please add Apache license header.

liancheng · 2016-07-11T14:08:40Z

This looks pretty good! Only found a few minor issues. Thanks for working on it!

liancheng · 2016-07-11T14:09:46Z

Can we add actual stdout output after each .show() call?

…les. Changes after the initial review

aokolnychyi · 2016-07-12T22:11:40Z

Thanks for the review, I really appreciate. I tried to take into account all comments and update the PR accordingly.

Summary of the updates

JavaSparkSQL.java file was removed. I kept it initially since the file itself was quite old (2+ years) and it was present in your original WIP branch alongside the new file. But I can confirm that the new file covers the same functionality and more. No need to keep the old one, agree with you.
Apache header in JavaSqlDataSourceExample.java was added.
$-notation instead of df("...") in Scala examples.
col("...") instead of df.col("...") in Java examples.
Blank lines before {% include_example programmatic_schema ... } were added. However, everything was rendered fine locally even without them.
2 space indentation for chained method calls. My fault, sorry.
Actual outputs for all show() calls were added.
Tested manually and via ./dev/run-tests.

Open questions

Shall I add blank lines before each {% include_example ... } or only before those two examples?
I pointed to a wrong location that exceeded the length limit. It is exactly the same functionality but in Java. So, 113 and 117 lines of the JavaSqlDataSourceExample.java file. In my view, it would make sense to keep them as they are now for the better looking documentation.

SparkQA · 2016-07-12T22:25:45Z

Test build #62192 has finished for PR 14119 at commit 7451fc7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

- Hard-coded Spark SQL sample snippets were moved into source files under examples sub-project. - Removed the inconsistency between Scala and Java Spark SQL examples - Scala and Java Spark SQL examples were updated The work is still in progress. All involved examples were tested manually. An additional round of testing will be done after the code review. ![image](https://cloud.githubusercontent.com/assets/6235869/16710314/51851606-462a-11e6-9fbe-0818daef65e4.png) Author: aokolnychyi <[email protected]> Closes #14119 from aokolnychyi/spark_16303. (cherry picked from commit 772c213) Signed-off-by: Cheng Lian <[email protected]>

liancheng · 2016-07-13T08:13:54Z

LGTM, I've merged this to master and branch-2.0. Thanks for working on this!

I only observed one weird rendering case caused by the missing blank lines before {% include_example %}, but maybe my local Jekyll version is too low. I think it's fine to leave other lines as is. The exceeded lines in the Java example file should be OK.

Could you please remove the WIP tag from the PR title? (I've removed it manually while merging this PR.)

[SPARK-16303][DOCS][EXAMPLES] Updated SQL programming guide and examples

95f0f41

aokolnychyi reviewed Jul 9, 2016
View reviewed changes

liancheng reviewed Jul 11, 2016
View reviewed changes

[SPARK-16303][DOCS][EXAMPLES] Updated SQL programming guide and examp…

7451fc7

…les. Changes after the initial review

asfgit closed this in 772c213 Jul 13, 2016

liancheng mentioned this pull request Jul 13, 2016

[SPARK-16380][SQL][Example]:Update SQL examples and programming guide for Python language binding #14098

Closed

[SPARK-16303][DOCS][EXAMPLES][WIP] Updated SQL programming guide and examples #14119

[SPARK-16303][DOCS][EXAMPLES][WIP] Updated SQL programming guide and examples #14119

Uh oh!

Conversation

aokolnychyi commented Jul 9, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

aokolnychyi commented Jul 9, 2016

Uh oh!

aokolnychyi Jul 9, 2016

Choose a reason for hiding this comment

Uh oh!

liancheng Jul 11, 2016

Choose a reason for hiding this comment

Uh oh!

liancheng commented Jul 11, 2016

Uh oh!

liancheng commented Jul 11, 2016

Uh oh!

SparkQA commented Jul 11, 2016

Uh oh!

liancheng commented Jul 11, 2016

Uh oh!

liancheng Jul 11, 2016

Choose a reason for hiding this comment

Uh oh!

liancheng commented Jul 11, 2016

Uh oh!

liancheng commented Jul 11, 2016

Uh oh!

aokolnychyi commented Jul 12, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Jul 12, 2016

Uh oh!

liancheng commented Jul 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aokolnychyi commented Jul 12, 2016 •

edited

Loading

liancheng commented Jul 13, 2016 •

edited

Loading