[SPARK-17229][SQL] PostgresDialect shouldn't widen float and short types during reads #14796

JoshRosen · 2016-08-24T20:53:47Z

What changes were proposed in this pull request?

When reading float4 and smallint columns from PostgreSQL, Spark's PostgresDialect widens these types to Decimal and Integer rather than using the narrower Float and Short types. According to https://www.postgresql.org/docs/7.1/static/datatype.html#DATATYPE-TABLE, Postgres maps the smallint type to a signed two-byte integer and the real / float4 types to single precision floating point numbers.

This patch fixes this by adding more special-cases to getCatalystType, similar to what was done for the Derby JDBC dialect. I also fixed a similar problem in the write path which causes Spark to create integer columns in Postgres for what should have been ShortType columns.

How was this patch tested?

New test cases in PostgresIntegrationSuite (which I ran manually because Jenkins can't run it right now).

SparkQA · 2016-08-24T23:10:07Z

Test build #64375 has finished for PR 14796 at commit 5c2b84e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-08-24T23:29:53Z

Test build #64376 has finished for PR 14796 at commit bc4dee3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2016-08-25T03:58:03Z

sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala

    case FloatType => Some(JdbcType("FLOAT4", Types.FLOAT))
    case DoubleType => Some(JdbcType("FLOAT8", Types.DOUBLE))
+    case ShortType => Some(JdbcType("SMALLINT", java.sql.Types.SMALLINT))
    case t: DecimalType => Some(


Hi @JoshRosen, do you mind if I ask a question please (although it is a super minor)? It seems java.sql.Types.SMALLINT and Types.SMALLINT are pointing the same constant. Are there a reason to do this like this?

Ah, no good reason; just my IDE autocomplete acting up. Lemme fix it.

SparkQA · 2016-08-25T08:44:42Z

Test build #64402 has finished for PR 14796 at commit 708343d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

hvanhovell · 2016-08-25T21:22:09Z

LGTM - merging to master. Thanks!

JoshRosen added 5 commits August 24, 2016 13:24

Fix handling of Float type in Postgres JDBC dialect.

c232666

Similar fix for ShortType

5c2b84e

Smallint in writes.

c561a4a

Add new test

5b457b7

Typo.

bc4dee3

HyukjinKwon reviewed Aug 25, 2016
View reviewed changes

Update PostgresDialect.scala

708343d

asfgit closed this in a133057 Aug 25, 2016

JoshRosen deleted the postgres-jdbc-type-fixes branch August 25, 2016 21:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-17229][SQL] PostgresDialect shouldn't widen float and short types during reads #14796

[SPARK-17229][SQL] PostgresDialect shouldn't widen float and short types during reads #14796

Uh oh!

JoshRosen commented Aug 24, 2016 •

edited

Loading

Uh oh!

SparkQA commented Aug 24, 2016

Uh oh!

SparkQA commented Aug 24, 2016

Uh oh!

HyukjinKwon Aug 25, 2016

Uh oh!

JoshRosen Aug 25, 2016

Uh oh!

SparkQA commented Aug 25, 2016

Uh oh!

hvanhovell commented Aug 25, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-17229][SQL] PostgresDialect shouldn't widen float and short types during reads #14796

[SPARK-17229][SQL] PostgresDialect shouldn't widen float and short types during reads #14796

Uh oh!

Conversation

JoshRosen commented Aug 24, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Aug 24, 2016

Uh oh!

SparkQA commented Aug 24, 2016

Uh oh!

HyukjinKwon Aug 25, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen Aug 25, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Aug 25, 2016

Uh oh!

hvanhovell commented Aug 25, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JoshRosen commented Aug 24, 2016 •

edited

Loading