-
Notifications
You must be signed in to change notification settings - Fork 7.2k
Closed
Description
Multi-channel images are common in the fields of satellite remote sensing (GIS) and in medical imaging. Many of the multispectral satellites I work with in my research have 8+ spectral bands, not just RGB. Some hyperspectral satellites have as many as 136 spectral bands. Microscopy often involves 4+ channel images as well.
Currently, torchvision relies on the Python Imaging Library (PIL) for all of its transforms. Unfortunately, pillow does not support multi-channel images: python-pillow/Pillow#3160, python-pillow/Pillow#1888
The way I see it, researchers such as myself have 3 options:
- Fix PIL (unlikely, as Add support for high bit depth multichannel images python-pillow/Pillow#1888 has been stalled for 3 years now)
- Fix torchvision (this would involve implementing all of the current transforms by hand using torch Tensors or numpy arrays)
- Fork torchvision (write all of the transforms we need ourselves)
I'm about to resort to 3 for my research. Do you have any suggestions for users like me?
sibocw, ekagra-ranjan, mdfirman, songkq, youweiliang and 9 more
Metadata
Metadata
Assignees
Labels
No labels