Update with compiling optimization and DT_UINT32 support for HDF5 #379

CaptainDuke · 2019-07-25T14:00:36Z

More than 50% acceleration would be achieved when reading compressed hdf5 files, using compiling optimization, and even more with large batch_size.
Besides, DT_UINT32 would be supported.

For compilation optimization flags, the default (-march=native) optimizes the generated code for your machine's CPU type. [see here](https://www.tensorflow.org/install/source#configuration_options)

googlebot · 2019-07-25T14:00:45Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here (e.g. I signed it!) and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

googlebot · 2019-07-25T14:06:32Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

CaptainDuke · 2019-07-25T14:07:14Z

I signed it!

yongtang

LGTM. Thanks for the fix!

We already had some discussion about batch size and the overall column based data (e.g., Parquet, Feather, HDF5) pipeline in #366 (comment)

Previously the batch is a limitation of overall tf.data.Dataset pipeline where it generate each record one by one. This is not an issue for large records such as image files but is really slowing down everything when each record is say one integer or one float32.

We added batch concept in tensorflow-io to speed up. But we were using the same batch as tf.keras which actually have different concept (number of sample).

My way of thinking is that we may want to

read and process as much as possible in one chunk of big memory, if not the whole file

for each "batch process" in tf.data pipeline, then rebatch() to align tf.keras' batch if needed.

That likely needs some change in the overall tf.data pipeline (or move much of the logic out of tf.data pipeline). With TF 2.0 I think the effort will be smaller.

/cc @BryanCutler

CaptainDuke · 2019-07-26T03:45:07Z

LGTM. Thanks for the fix!

We already had some discussion about batch size and the overall column based data (e.g., Parquet, Feather, HDF5) pipeline in #366 (comment)

Previously the batch is a limitation of overall tf.data.Dataset pipeline where it generate each record one by one. This is not an issue for large records such as image files but is really slowing down everything when each record is say one integer or one float32.

We added batch concept in tensorflow-io to speed up. But we were using the same batch as tf.keras which actually have different concept (number of sample).

My way of thinking is that we may want to
read and process as much as possible in one chunk of big memory, if not the whole file
for each "batch process" in tf.data pipeline, then rebatch() to align tf.keras' batch if needed.

That likely needs some change in the overall tf.data pipeline (or move much of the logic out of tf.data pipeline). With TF 2.0 I think the effort will be smaller.

/cc @BryanCutler

Thanks for your review and detailed reply!

…nsorflow#379) * Update README.md * Update with compiling optimization For compilation optimization flags, the default (-march=native) optimizes the generated code for your machine's CPU type. [see here](https://www.tensorflow.org/install/source#configuration_options) * add DT_UINT32 support

CaptainDuke added 3 commits July 25, 2019 21:36

Update README.md

c4aee1f

Update with compiling optimization

102966c

For compilation optimization flags, the default (-march=native) optimizes the generated code for your machine's CPU type. [see here](https://www.tensorflow.org/install/source#configuration_options)

add DT_UINT32 support

39a86f2

yongtang added the kokoro:run Kokoro CI label Jul 25, 2019

kokoro-team removed the kokoro:run Kokoro CI label Jul 25, 2019

yongtang approved these changes Jul 25, 2019

View reviewed changes

yongtang merged commit 17c30f9 into tensorflow:master Jul 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update with compiling optimization and DT_UINT32 support for HDF5 #379

Update with compiling optimization and DT_UINT32 support for HDF5 #379

Uh oh!

CaptainDuke commented Jul 25, 2019

Uh oh!

googlebot commented Jul 25, 2019

Uh oh!

googlebot commented Jul 25, 2019

Uh oh!

CaptainDuke commented Jul 25, 2019

Uh oh!

yongtang left a comment

Uh oh!

CaptainDuke commented Jul 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update with compiling optimization and DT_UINT32 support for HDF5 #379

Update with compiling optimization and DT_UINT32 support for HDF5 #379

Uh oh!

Conversation

CaptainDuke commented Jul 25, 2019

Uh oh!

googlebot commented Jul 25, 2019

What to do if you already signed the CLA

Individual signers

Corporate signers

Uh oh!

googlebot commented Jul 25, 2019

Uh oh!

CaptainDuke commented Jul 25, 2019

Uh oh!

yongtang left a comment

Choose a reason for hiding this comment

Uh oh!

CaptainDuke commented Jul 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants