@@ -20318,18 +20318,19 @@ The argument to this intrinsic must be a vector of floating-point values.
2031820318Vector Partial Reduction Intrinsics
2031920319-----------------------------------
2032020320
20321- Partial reductions of vectors can be expressed using the following intrinsics.
20322- Each one reduces the concatenation of the two vector arguments down to the
20323- number of elements of the result vector type.
20321+ Partial reductions of vectors can be expressed using the intrinsics described in
20322+ this section. Each one reduces the concatenation of the two vector arguments
20323+ down to the number of elements of the result vector type.
2032420324
20325- Other than the reduction operator (e.g. add, fadd) the way in which the
20325+ Other than the reduction operator (e.g. add, fadd), the way in which the
2032620326concatenated arguments is reduced is entirely unspecified. By their nature these
20327- intrinsics are not expected to be useful in isolation but instead implement the
20328- first phase of an overall reduction operation.
20327+ intrinsics are not expected to be useful in isolation but can instead be used to
20328+ implement the first phase of an overall reduction operation.
2032920329
2033020330The typical use case is loop vectorization where reductions are split into an
2033120331in-loop phase, where maintaining an unordered vector result is important for
20332- performance, and an out-of-loop phase to calculate the final scalar result.
20332+ performance, and an out-of-loop phase is required to calculate the final scalar
20333+ result.
2033320334
2033420335By avoiding the introduction of new ordering constraints, these intrinsics
2033520336enhance the ability to leverage a target's accumulation instructions.
@@ -20381,9 +20382,7 @@ Semantics:
2038120382
2038220383As the way in which the arguments to this floating-point intrinsic are reduced
2038320384is unspecified, this intrinsic will assume floating-point reassociation and
20384- contraction, which may result in variations to the results due to reordering or
20385- by lowering to different instructions (including combining multiple instructions
20386- into a single one).
20385+ contraction, which may result in variations to the results.
2038720386
2038820387'``llvm.vector.insert``' Intrinsic
2038920388^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
0 commit comments