[SPARK-11188][SQL] Elide stacktraces in bin/spark-sql for AnalysisExceptions #9194

dilipbiswal · 2015-10-21T07:24:01Z

Only print the error message to the console for Analysis Exceptions in sql-shell.

marmbrus · 2015-10-21T16:50:32Z

ok to test

marmbrus · 2015-10-21T17:20:36Z

...e-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLCLIDriver.scala

I would just say, "Error in query: ${e.message}" or something. It will always be org.apache.spark.sql.AnalysisException and we want to make it clear that is likely the users fault in this case (i.e. not a bug).

Sure Michael. I will make the change.

SparkQA · 2015-10-21T17:24:24Z

Test build #44071 has finished for PR 9194 at commit 0c68926.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-10-21T17:33:41Z

You should also be able to add a test in CliSuite.

Regarding the Hive Version concern. I believe this is only going to touch the execution version of Hive (as opposed to the Metastore connection version that we load in an isolated classloader). As such it should always be Hive 1.2.1

SparkQA · 2015-10-22T05:51:56Z

Test build #44131 has finished for PR 9194 at commit cea5b05.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dilipbiswal · 2015-10-22T06:50:18Z

@marmbrus
Michael, need your advice here. It seems like when CliSuite is run, the tracing is set to console and thats why i see the AnalysisException in the test output. I had run it in my env by setting the trace to a file.

What can we do here ? Please advice ..

dilipbiswal · 2015-10-22T15:20:58Z

retest this please.

dilipbiswal · 2015-10-22T19:37:39Z

@marmbrus
Michael, i tried putting "retest this please" in the comment but it does not seem to start the tests ? Is there a way to restart the tests ?

marmbrus · 2015-10-22T19:51:48Z

test this please

marmbrus · 2015-10-22T19:52:01Z

you need to be on the whitelist for jenkins to respond to you

SparkQA · 2015-10-22T20:21:27Z

Test build #44169 has finished for PR 9194 at commit 0e298a8.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-10-26T09:13:22Z

Test build #44347 has finished for PR 9194 at commit cb06db0.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-10-26T09:35:44Z

sql/hive-thriftserver/src/test/scala/org/apache/spark/sql/hive/thriftserver/CliSuite.scala

This is only testing that you message is there, not that the stack traces are elided. There is still a giant stack trace upon failure:

spark-sql> SELECT * FROM test; 15/10/26 10:34:27 INFO ParseDriver: Parsing command: SELECT * FROM test 15/10/26 10:34:27 INFO ParseDriver: Parse Completed 15/10/26 10:34:27 INFO HiveMetaStore: 0: get_table : db=default tbl=test 15/10/26 10:34:27 INFO audit: ugi=marmbrus ip=unknown-ip-addr cmd=get_table : db=default tbl=test 15/10/26 10:34:27 ERROR SparkSQLDriver: Failed in [SELECT * FROM test] org.apache.spark.sql.AnalysisException: no such table test; line 1 pos 14 at org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) at org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveRelations$.getTable(Analyzer.scala:260) at org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveRelations$$anonfun$apply$7.applyOrElse(Analyzer.scala:268) at org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveRelations$$anonfun$apply$7.applyOrElse(Analyzer.scala:264) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan$$anonfun$resolveOperators$1.apply(LogicalPlan.scala:57) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan$$anonfun$resolveOperators$1.apply(LogicalPlan.scala:57) at org.apache.spark.sql.catalyst.trees.CurrentOrigin$.withOrigin(TreeNode.scala:51) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.resolveOperators(LogicalPlan.scala:56) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan$$anonfun$1.apply(LogicalPlan.scala:54) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan$$anonfun$1.apply(LogicalPlan.scala:54) at org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$4.apply(TreeNode.scala:249) at scala.collection.Iterator$$anon$11.next(Iterator.scala:328) at scala.collection.Iterator$class.foreach(Iterator.scala:727) at scala.collection.AbstractIterator.foreach(Iterator.scala:1157) at scala.collection.generic.Growable$class.$plus$plus$eq(Growable.scala:48) at scala.collection.mutable.ArrayBuffer.$plus$plus$eq(ArrayBuffer.scala:103) at scala.collection.mutable.ArrayBuffer.$plus$plus$eq(ArrayBuffer.scala:47) at scala.collection.TraversableOnce$class.to(TraversableOnce.scala:273) at scala.collection.AbstractIterator.to(Iterator.scala:1157) at scala.collection.TraversableOnce$class.toBuffer(TraversableOnce.scala:265) at scala.collection.AbstractIterator.toBuffer(Iterator.scala:1157) at scala.collection.TraversableOnce$class.toArray(TraversableOnce.scala:252) at scala.collection.AbstractIterator.toArray(Iterator.scala:1157) at org.apache.spark.sql.catalyst.trees.TreeNode.transformChildren(TreeNode.scala:279) at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.resolveOperators(LogicalPlan.scala:54) at org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveRelations$.apply(Analyzer.scala:264) at org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveRelations$.apply(Analyzer.scala:254) at org.apache.spark.sql.catalyst.rules.RuleExecutor$$anonfun$execute$1$$anonfun$apply$1.apply(RuleExecutor.scala:83) at org.apache.spark.sql.catalyst.rules.RuleExecutor$$anonfun$execute$1$$anonfun$apply$1.apply(RuleExecutor.scala:80) at scala.collection.LinearSeqOptimized$class.foldLeft(LinearSeqOptimized.scala:111) at scala.collection.immutable.List.foldLeft(List.scala:84) at org.apache.spark.sql.catalyst.rules.RuleExecutor$$anonfun$execute$1.apply(RuleExecutor.scala:80) at org.apache.spark.sql.catalyst.rules.RuleExecutor$$anonfun$execute$1.apply(RuleExecutor.scala:72) at scala.collection.immutable.List.foreach(List.scala:318) at org.apache.spark.sql.catalyst.rules.RuleExecutor.execute(RuleExecutor.scala:72) at org.apache.spark.sql.execution.QueryExecution.analyzed$lzycompute(QueryExecution.scala:41) at org.apache.spark.sql.execution.QueryExecution.analyzed(QueryExecution.scala:41) at org.apache.spark.sql.execution.QueryExecution.assertAnalyzed(QueryExecution.scala:39) at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:132) at org.apache.spark.sql.DataFrame$.apply(DataFrame.scala:51) at org.apache.spark.sql.SQLContext.sql(SQLContext.scala:783) at org.apache.spark.sql.hive.thriftserver.SparkSQLDriver.run(SparkSQLDriver.scala:61) at org.apache.spark.sql.hive.thriftserver.SparkSQLCLIDriver.processCmd(SparkSQLCLIDriver.scala:308) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:376) at org.apache.spark.sql.hive.thriftserver.SparkSQLCLIDriver$.main(SparkSQLCLIDriver.scala:226) at org.apache.spark.sql.hive.thriftserver.SparkSQLCLIDriver.main(SparkSQLCLIDriver.scala) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:680) at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:180) at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:205) at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:120) at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala) Error in query: no such table test; line 1 pos 14

@marmbrus
Thanks for your quick feedback. Actually this is one of the question i was asking earlier.

The actual exception is also logged in run method of the SparkSQLDriver. If the
logging is set to CONSOLE then the exception is printed on the screen. I hope this is ok ?

I think by default the logging is enabled to console or stdout. If i change the log4j.properties and direct the logging to a file , then we should not see a stack trace.

Michael, do you want this logging completely suppressed for AnalysisException ?

@marmbrus
Hi Michael, given that logging to console is default , i have made changes to suppress the logging to console for analysis exceptions. One option is to completely stop logging these kind of exceptions. But i have tried to log them to non-console appenders. Please let me know what you think.

SparkQA · 2015-10-27T18:34:00Z

Test build #44454 has finished for PR 9194 at commit bbfeaed.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-10-28T13:18:26Z

As a side note, I think the target user of bin/spark-sql is probably less advanced and thus all the info logs probably aren't that useful. It doesn't have to be in this PR, but I'd be generally supportive of having a different default log4j config that is used for this binary logs to the console at warning instead.

dilipbiswal · 2015-10-28T21:36:15Z

@marmbrus
Thank you very much for your comments. I did think about users having their own log4j. My thinking was that - if they don't specify a console appender then i would get a null when querying for it and so i would not attempt to remove it. However, it is possible for them to write their own console appender and give it their own alias. I was hoping that this would be a rare case ..

However i like your suggestion of logging it as DEBUG level. Its much simpler and if users have set to log at this level then it is ok for them to see this exception on the console (if they have selected a console appender). I will make the change per your suggestion.

dilipbiswal · 2015-10-28T21:38:09Z

@marmbrus
Michael, i will work on your suggestion about having different log4j default for spark-sql binary in another PR. Thanks a lot for suggesting it.

SparkQA · 2015-10-28T22:18:03Z

Test build #44543 has finished for PR 9194 at commit df1a3fa.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-10-29T10:51:50Z

This LGTM, can you update the title (remove WIP) and update the description (it will be come the commit message when merging).

dilipbiswal · 2015-10-29T17:13:53Z

@marmbrus Took out "WIP" and also changed the description.

marmbrus · 2015-10-29T17:36:31Z

Thanks for doing this!

It would be nice to have this in branch-1.4 and branch-1.5. Would you be able to open up pull requests against those branches as well? This patch does not apply cleanly.

dilipbiswal · 2015-10-29T17:47:49Z

@marmbrus Thanks a LOT for your comments and guidance. I am still little new to the process.

So do i have to open two JIRAs one for 1.4 and the other for 1.5 and associate the new pull requests against them ?

marmbrus · 2015-10-29T17:56:01Z

I'm fine with using the existing JIRA. I've reopened it.
On Oct 29, 2015 6:48 PM, "Dilip Biswal" [email protected] wrote:

@marmbrus https://github.com/marmbrus Thanks a LOT for your comments
and guidance. I am still little new to the process.

So do i have to open two JIRAs one for 1.4 and the other for 1.5 and
associate the new pull requests against them ?

—
Reply to this email directly or view it on GitHub
#9194 (comment).

[SPARK-11188] Elide stacktraces in bin/spark-sql for AnalysisExceptions

0c68926

marmbrus reviewed Oct 21, 2015
View reviewed changes

Implement code review comments

cea5b05

dilipbiswal added 2 commits October 22, 2015 01:46

Fix up the expected output as per recent change to error message

02ff041

Fix up the expected output as per recent change to error message

0e298a8

fix test failures

cb06db0

marmbrus reviewed Oct 26, 2015
View reviewed changes

Supressing logging to console

bbfeaed

Code review comments from Michael

df1a3fa

dilipbiswal changed the title ~~[SPARK-11188][SQL][WIP] Elide stacktraces in bin/spark-sql for AnalysisExceptions~~ [SPARK-11188][SQL] Elide stacktraces in bin/spark-sql for AnalysisExceptions Oct 29, 2015

asfgit closed this in 8185f03 Oct 29, 2015

[SPARK-11188][SQL] Elide stacktraces in bin/spark-sql for AnalysisExceptions #9194

[SPARK-11188][SQL] Elide stacktraces in bin/spark-sql for AnalysisExceptions #9194

Uh oh!

Conversation

dilipbiswal commented Oct 21, 2015

Uh oh!

marmbrus commented Oct 21, 2015

Uh oh!

marmbrus Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

dilipbiswal Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Oct 21, 2015

Uh oh!

marmbrus commented Oct 21, 2015

Uh oh!

SparkQA commented Oct 22, 2015

Uh oh!

dilipbiswal commented Oct 22, 2015

Uh oh!

dilipbiswal commented Oct 22, 2015

Uh oh!

dilipbiswal commented Oct 22, 2015

Uh oh!

marmbrus commented Oct 22, 2015

Uh oh!

marmbrus commented Oct 22, 2015

Uh oh!

SparkQA commented Oct 22, 2015

Uh oh!

SparkQA commented Oct 26, 2015

Uh oh!

marmbrus Oct 26, 2015

Choose a reason for hiding this comment

Uh oh!

dilipbiswal Oct 26, 2015

Choose a reason for hiding this comment

Uh oh!

dilipbiswal Oct 27, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Oct 27, 2015

Uh oh!

marmbrus commented Oct 28, 2015

Uh oh!

dilipbiswal commented Oct 28, 2015

Uh oh!

dilipbiswal commented Oct 28, 2015

Uh oh!

SparkQA commented Oct 28, 2015

Uh oh!

marmbrus commented Oct 29, 2015

Uh oh!

dilipbiswal commented Oct 29, 2015

Uh oh!

marmbrus commented Oct 29, 2015

Uh oh!

dilipbiswal commented Oct 29, 2015

Uh oh!

marmbrus commented Oct 29, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants