@@ -20366,18 +20366,19 @@ The argument to this intrinsic must be a vector of floating-point values.
2036620366Vector Partial Reduction Intrinsics
2036720367-----------------------------------
2036820368
20369- Partial reductions of vectors can be expressed using the following intrinsics.
20370- Each one reduces the concatenation of the two vector arguments down to the
20371- number of elements of the result vector type.
20369+ Partial reductions of vectors can be expressed using the intrinsics described in
20370+ this section. Each one reduces the concatenation of the two vector arguments
20371+ down to the number of elements of the result vector type.
2037220372
20373- Other than the reduction operator (e.g. add, fadd) the way in which the
20373+ Other than the reduction operator (e.g. add, fadd), the way in which the
2037420374concatenated arguments is reduced is entirely unspecified. By their nature these
20375- intrinsics are not expected to be useful in isolation but instead implement the
20376- first phase of an overall reduction operation.
20375+ intrinsics are not expected to be useful in isolation but can instead be used to
20376+ implement the first phase of an overall reduction operation.
2037720377
2037820378The typical use case is loop vectorization where reductions are split into an
2037920379in-loop phase, where maintaining an unordered vector result is important for
20380- performance, and an out-of-loop phase to calculate the final scalar result.
20380+ performance, and an out-of-loop phase is required to calculate the final scalar
20381+ result.
2038120382
2038220383By avoiding the introduction of new ordering constraints, these intrinsics
2038320384enhance the ability to leverage a target's accumulation instructions.
@@ -20429,9 +20430,7 @@ Semantics:
2042920430
2043020431As the way in which the arguments to this floating-point intrinsic are reduced
2043120432is unspecified, this intrinsic will assume floating-point reassociation and
20432- contraction, which may result in variations to the results due to reordering or
20433- by lowering to different instructions (including combining multiple instructions
20434- into a single one).
20433+ contraction, which may result in variations to the results.
2043520434
2043620435'``llvm.vector.insert``' Intrinsic
2043720436^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
0 commit comments