ENH: sql support for Timestamp (GH7103) #8205

jorisvandenbossche · 2014-09-07T20:15:03Z

Superceded by #8208 (converting to object dtype also converts the datetime64 values to datetime.datetime, so it is no longer needed that sqlalchemy can work with pandas' Timestamp type.

Closes #7103, #7936

This adds a pandas.io.sql.Timestamp class to handle Timestamps in to_sql. This basically converts it to a datetime.datetime before writing to the database.

Problem is that this makes it slower (I tested it with sqlite, and it's for a dataframe with only a datetime64 column about 20% slower), while it did already work before for some drivers (like psycopg2 and MySQLdb).

jreback · 2014-09-07T20:43:20Z

so just do this for certain database types then (will prob be similar for timedelta)

jorisvandenbossche · 2014-09-07T21:19:23Z

It depends on the driver, not the database type. But indeed, we could do that, but that feels a bit clumsy. Also, I tested it with psycopg2/pymysql/MySQLdb/mysql.connector, but there a lot more drivers for which I don't know the behaviour (and also not for different versions of the drivers).

While testing the fix for NaN values, I noticed that doing df.astype(object) (needed to get in None values, and not NaN) also converts Timestamp/datetime64 to datetime.datetime. Is this the expected behaviour? I would have expected individual Timestamp objects.

jreback · 2014-09-07T21:44:17Z

@jorisvandenbossche that is expected. the object dtype is a ndarray of datetime objects (I guess for compat reasons).

You might want to preconvert any datetimes/timedeltas at the start (iow, separate the frame into various 'blocks'), which you then iterate all together. Don't try to concat (or they will be re-coerced)
And to be honest you can simply do this for types that need NaN -> None as well, e.g. drop down to numpy object arrays (or rec-arrays). Might be a bit of work at first, but then you can easily do what you need quickly. I actually do this for PyTables, see here, (and the next method, where I create a structured/rec array), and fill with already coerced values (e.g. datetime64[ns] have already by tz converted and are now int64 and such, strings are already an appropriate dtype, etc.)

jorisvandenbossche · 2014-09-09T07:52:49Z

Superceded by #8208

ENH: sql support for Timestamp (GH7103)

bf031fe

jorisvandenbossche added the IO SQL to_sql, read_sql, read_sql_query label Sep 7, 2014

jorisvandenbossche added this to the 0.15.0 milestone Sep 7, 2014

jorisvandenbossche closed this Sep 9, 2014

jorisvandenbossche mentioned this pull request Sep 9, 2014

ENH: sql support for writing NaN + datetime64 values (GH2754, GH7103) #8208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: sql support for Timestamp (GH7103) #8205

ENH: sql support for Timestamp (GH7103) #8205

Uh oh!

jorisvandenbossche commented Sep 7, 2014

Uh oh!

jreback commented Sep 7, 2014

Uh oh!

jorisvandenbossche commented Sep 7, 2014

Uh oh!

jreback commented Sep 7, 2014

Uh oh!

jorisvandenbossche commented Sep 9, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ENH: sql support for Timestamp (GH7103) #8205

ENH: sql support for Timestamp (GH7103) #8205

Uh oh!

Conversation

jorisvandenbossche commented Sep 7, 2014

Uh oh!

jreback commented Sep 7, 2014

Uh oh!

jorisvandenbossche commented Sep 7, 2014

Uh oh!

jreback commented Sep 7, 2014

Uh oh!

jorisvandenbossche commented Sep 9, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants