-
Notifications
You must be signed in to change notification settings - Fork 368
Closed
Labels
Description
Float8 was moved to torchao in #551, and currently the CI that we have for float8 is running on:
a. CPU nightly (skips all cuda related tests)
b. CUDA nightly (skips all cuda related tests which require torch._scaled_mm, because the default machines used for this do not have a high enough CUDA capability version.
We should enable float8 CI on sm89 machines, which have cuda capability 8.9. The performance will not be representative, but we can at least test correctness.
Pointers:
- Try running some periodic jobs on L4 pytorch#129608 is an example of adding github CI worklows on sm89
- float8 tests currently test for H100s with capability 9.0 (), we should update that everywhere to test for capability 8.9, something like below:
Line 57 in 00b76c4
is_H100 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (9, 0)
# old
is_H100 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (9, 0)
# new
is_cuda_8_9 = torch.cuda.is_available() and torch.cuda.get_device_capability() >= (8, 9)