Added set_attribute + added script module to to include attributes #1183

shaltielshmid · 2023-12-11T22:56:57Z

Following the discussion on #1126.

shaltielshmid · 2023-12-11T23:05:47Z

@NiklasGustafsson one thing that has been bothering me about the previous PR and this one, with regard to toWillCopy() function.

While stepping through the to() call when writing this fix, I saw that most of the attributes are being moved twice. Once during the default to() function which copies all the parameters and buffers, and then again with the attributes. The reason is because after they move, they get a CUDA device index of 0 (the default index), whereas the default torch.CUDA has an index of -1.

The same thing would happen when calling regular module.cuda().cuda() if not for the memo field (_deviceType, _deviceIndex).

Generally, when deviceIndex = -1 it means the default device. Is there any way to confirm what the default device is, to avoid all these extra copies?

NiklasGustafsson · 2023-12-11T23:21:24Z

Interesting. No, I haven't seen a way of enumerating the devices or get metadata on them. -1 is supposed to imply the "best" available device, which isn't necessarily 0, if I understand the logic correctly. On my workstation, I have a P400 and a 2080 SUPRA. -1 is supposed to pick the latter.

shaltielshmid · 2023-12-11T23:22:19Z

Ah, I see.
So I guess there isn't anything to do except have it copy everything.
It essentially renders the toWillCopy() function useless.

NiklasGustafsson · 2023-12-11T23:24:55Z

Ah, I see. So I guess there isn't anything to do except have it copy everything. It essentially renders the toWillCopy() function useless.

I don't think that's true. There's CPU to consider, too, and for type conversion, if the source and target types are the same...

shaltielshmid · 2023-12-11T23:28:43Z

Right.
I meant that when calling it with DeviceType.CUDA and Index = -1, then the function will always return true.

shaltielshmid · 2023-12-12T10:48:15Z

I browsed through the libtorch code, and I believe that for CUDA, index -1 gets converted into a real index here.

Do we want to add handling so that when ".to()" is called with CUDA & index=-1, that we pull the index using that function, and use that as a base for checking if parameters need to be moved?

Alternatively since the parameters aren't actually being copied it's not a huge performance issue and we can just let it be.

NiklasGustafsson · 2023-12-12T15:27:56Z

I browsed through the libtorch code, and I believe that for CUDA, index -1 gets converted into a real index here.

Do we want to add handling so that when ".to()" is called with CUDA & index=-1, that we pull the index using that function, and use that as a base for checking if parameters need to be moved?

Alternatively since the parameters aren't actually being copied it's not a huge performance issue and we can just let it be.

I'm all for following the PyTorch behavior as closely as possible, everywhere, with one exception: if a functionally equivalent alternative is higher-performance, then go with the higher performance.

shaltielshmid · 2023-12-12T15:32:37Z

The PyTorch behavior is to not allow a cuda device index of -1.
This is more of an enhancement.
PyTorch always calls the .to() function

shaltielshmid · 2023-12-12T16:51:42Z

For now, the behavior is the same as that of PyTorch, so I think it's fine leaving it.
But it's worth pondering for the future if there's a way to know.
The performance gain isn't going to be significant since the tensors aren't copied just a new C# object is being created.

Added set_attribute + added script module to to include attributes

2f1cd3a

Merge branch 'main' into jit-attributes-to

8aac38d

NiklasGustafsson merged commit 6d4d20e into dotnet:main Dec 12, 2023

shaltielshmid deleted the jit-attributes-to branch December 12, 2023 23:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added set_attribute + added script module to to include attributes #1183

Added set_attribute + added script module to to include attributes #1183

Uh oh!

shaltielshmid commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023

Uh oh!

NiklasGustafsson commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023 •

edited

Loading

Uh oh!

NiklasGustafsson commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 12, 2023

Uh oh!

NiklasGustafsson commented Dec 12, 2023

Uh oh!

shaltielshmid commented Dec 12, 2023 •

edited

Loading

Uh oh!

shaltielshmid commented Dec 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added set_attribute + added script module to to include attributes #1183

Added set_attribute + added script module to to include attributes #1183

Uh oh!

Conversation

shaltielshmid commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023

Uh oh!

NiklasGustafsson commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NiklasGustafsson commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 11, 2023

Uh oh!

shaltielshmid commented Dec 12, 2023

Uh oh!

NiklasGustafsson commented Dec 12, 2023

Uh oh!

shaltielshmid commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shaltielshmid commented Dec 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

shaltielshmid commented Dec 11, 2023 •

edited

Loading

shaltielshmid commented Dec 12, 2023 •

edited

Loading