Convert input/output in stats to be MB per iteration #178

joshuadeng · 2022-03-25T20:23:26Z

Summary:
Instead of naively showing input as just the pooling factor and output as just the embedding dimension
this diff changes planner stats to use the actual size of input & output in terms of megabytes per iteration

input: global_batch_size * pooling factor * sizeof(dtype of input)

output: global_batch_size * (output size (1 in pooled)) * sizeof(dytpe of emb) * emb_dim

This provides a sense of scale for data coming in and out, and additionally makes plans with multiple sharding types directly comparable.

Also fixes a bug with TWCW, we incorrectly specified the ranks as entire world size when it should be limited to the local world of the host that the parameter is sharded on.

Differential Revision: D35153224

Differential Revision: D34758134 fbshipit-source-id: dadbc62dc2fb92ff4c68eb6177c46df65d04982d

Summary: Instead of naively showing input as just the pooling factor and output as just the embedding dimension this diff changes planner stats to use the actual size of input & output in terms of megabytes per iteration **input**: global_batch_size * pooling factor * sizeof(dtype of input) **output**: global_batch_size * (output size (1 in pooled)) * sizeof(dytpe of emb) * emb_dim This provides a sense of scale for data coming in and out, and additionally makes plans with multiple sharding types directly comparable. Also fixes a bug with TWCW, we incorrectly specified the ranks as entire world size when it should be limited to the local world of the host that the parameter is sharded on. Differential Revision: D35153224 fbshipit-source-id: 531626ef9d1a6ddf8e9feb756f778d9aa2e406a5

facebook-github-bot · 2022-03-25T20:23:41Z

This pull request was exported from Phabricator. Differential Revision: D35153224

joshuadeng added 2 commits March 25, 2022 13:22

Convert perf units to milliseconds

b61c5c6

Differential Revision: D34758134 fbshipit-source-id: dadbc62dc2fb92ff4c68eb6177c46df65d04982d

facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Mar 25, 2022

facebook-github-bot closed this in b121fab Mar 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert input/output in stats to be MB per iteration #178

Convert input/output in stats to be MB per iteration #178

Uh oh!

joshuadeng commented Mar 25, 2022

Uh oh!

facebook-github-bot commented Mar 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Convert input/output in stats to be MB per iteration #178

Convert input/output in stats to be MB per iteration #178

Uh oh!

Conversation

joshuadeng commented Mar 25, 2022

Uh oh!

facebook-github-bot commented Mar 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants