Skip to content

Conversation

@joshuadeng
Copy link
Contributor

Summary:
Instead of naively showing input as just the pooling factor and output as just the embedding dimension
this diff changes planner stats to use the actual size of input & output in terms of megabytes per iteration

input: global_batch_size * pooling factor * sizeof(dtype of input)

output: global_batch_size * (output size (1 in pooled)) * sizeof(dytpe of emb) * emb_dim

This provides a sense of scale for data coming in and out, and additionally makes plans with multiple sharding types directly comparable.

Also fixes a bug with TWCW, we incorrectly specified the ranks as entire world size when it should be limited to the local world of the host that the parameter is sharded on.

Differential Revision: D35153224

Differential Revision: D34758134

fbshipit-source-id: dadbc62dc2fb92ff4c68eb6177c46df65d04982d
Summary:
Instead of naively showing input as just the pooling factor and output as just the embedding dimension
this diff changes planner stats to use the actual size of input & output in terms of megabytes per iteration

**input**: global_batch_size * pooling factor * sizeof(dtype of input)

**output**: global_batch_size * (output size (1 in pooled)) * sizeof(dytpe of emb) * emb_dim

This provides a sense of scale for data coming in and out, and additionally makes plans with multiple sharding types directly comparable.

Also fixes a bug with TWCW, we incorrectly specified the ranks as entire world size when it should be limited to the local world of the  host that the parameter is sharded on.

Differential Revision: D35153224

fbshipit-source-id: 531626ef9d1a6ddf8e9feb756f778d9aa2e406a5
@facebook-github-bot facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Mar 25, 2022
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D35153224

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants