Skip to content

Conversation

@jsquyres
Copy link
Member

Add logic to handle different architectural capabilities
Detect the compiler flags necessary to build specialized
versions of the MPI_OP. Once the different flavors (AVX512,
AVX2, AVX) are built, detect at runtime which is the best
match with the current processor capabilities.

Add validation checks for loadu 256 and 512 bits.
Add validation tests for MPI_Op.

Signed-off-by: Jeff Squyres [email protected]
Signed-off-by: Gilles Gouaillardet [email protected]
Signed-off-by: dongzhong [email protected]
Signed-off-by: George Bosilca [email protected]
(cherry picked from commit 14b3c70)

Add logic to handle different architectural capabilities
Detect the compiler flags necessary to build specialized
versions of the MPI_OP. Once the different flavors (AVX512,
AVX2, AVX) are built, detect at runtime which is the best
match with the current processor capabilities.

Add validation checks for loadu 256 and 512 bits.
Add validation tests for MPI_Op.

Signed-off-by: Jeff Squyres <[email protected]>
Signed-off-by: Gilles Gouaillardet <[email protected]>
Signed-off-by: dongzhong <[email protected]>
Signed-off-by: George Bosilca <[email protected]>
(cherry picked from commit 14b3c70)
@jsquyres jsquyres added this to the v4.1.0 milestone Jul 13, 2020
@jsquyres jsquyres requested a review from bosilca July 13, 2020 20:49
@jsquyres jsquyres changed the title Add supports for MPI_OP using AVX512, AVX2 and MMX v4.1.x:Add supports for MPI_OP using AVX512, AVX2 and MMX Jul 13, 2020
@jsquyres jsquyres changed the title v4.1.x:Add supports for MPI_OP using AVX512, AVX2 and MMX v4.1.x: Add supports for MPI_OP using AVX512, AVX2 and MMX Jul 13, 2020
@jsquyres jsquyres merged commit 91c28f1 into open-mpi:v4.1.x Jul 14, 2020
@jsquyres jsquyres deleted the pr/v4.1.x/avx-for-strength branch July 14, 2020 15:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants