[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::VPMADD52L\H handling - again #159230

houngkoungting · 2025-09-17T02:36:39Z

FIX #155386

My LLVM version was too old, so I updated to a newer one.

@RKSimon

…ufficient leading zero/sign bits-1

…ufficient leading zero/sign bits -2

…ufficient leading zero/sign bits -3

…ufficient leading zero/sign bits-4

…ufficient leading zero/sign bits-5

…ufficient leading zero/sign bits-6

…ufficient leading zero/sign bits-7

…ufficient leading zero/sign bits-8

…ufficient leading zero/sign bits-9

…ent leading zero/sign bits-10

…ufficient leading zero/sign bits-11

…VPMADD52L\H handling-1

RKSimon

a few minors

RKSimon · 2025-09-17T08:18:01Z

llvm/test/CodeGen/X86/knownbits-vpmadd52.ll

@@ -0,0 +1,138 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
+; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512ifma,+avx512vl | FileCheck %s --check-prefixes=AVX512VL


we should add avxifma support, even if it means we drop the 512-bit test coverage

RKSimon · 2025-09-17T08:18:39Z

llvm/test/CodeGen/X86/knownbits-vpmadd52.ll

+  %r   = call <2 x i64> @llvm.x86.avx512.vpmadd52h.uq.128(
+             <2 x i64> <i64 1, i64 1>,           ; acc
+             <2 x i64> %mx,                      ; x (masked to 25-bit)
+             <2 x i64> %my)                      ; y (masked to 25-bit)


these per-operand comment really aren't necessary - a short single line comment above the define along witha descriptive function name is all that is necessary

RKSimon · 2025-09-17T08:22:14Z

llvm/test/CodeGen/X86/knownbits-vpmadd52.ll

+; AVX512VL-NEXT:    # xmm0 = mem[0,0]
+; AVX512VL-NEXT:    retq
+  %mx  = and <2 x i64> %x, <i64 33554431, i64 33554431>
+  %my  = and <2 x i64> %y, <i64 33554431, i64 33554431>


Try to use splat for uniform constant for breveity and remove the "25-bit/26-bit" comments and put them in the IR - nobody is ever going to go looking far for a description of a constant

%mx = and <2 x i64> %x, splat (i64 33554431) ; (1<<25)-1 %my = and <2 x i64> %y, splat (i64 33554431) ; (1<<25)-1

RKSimon

LGTM - cheers

houngkoungting and others added 19 commits August 6, 2025 16:20

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

80e303c

…ufficient leading zero/sign bits-1

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

24287f7

…ufficient leading zero/sign bits -2

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

c8cc2a9

…ufficient leading zero/sign bits -3

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

1115256

…ufficient leading zero/sign bits-4

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

08138a2

…ufficient leading zero/sign bits-5

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

728b37d

…ufficient leading zero/sign bits-6

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

44609a3

…ufficient leading zero/sign bits-7

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

2d268fc

…ufficient leading zero/sign bits-8

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

32041fb

…ufficient leading zero/sign bits-9

Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have suffici…

4e1af14

…ent leading zero/sign bits-10

[DAG] Fold trunc(avg(x,y)) for avgceil/floor u/s nodes if they have s…

c4ea7bd

…ufficient leading zero/sign bits-11

Merge branch 'main' into main

6f84361

Merge branch 'llvm:main' into main

3729135

Merge branch 'llvm:main' into main

f85579b

Merge branch 'llvm:main' into main

a7bbda8

Merge branch 'llvm:main' into main

12b64f6

[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::…

fac54ff

…VPMADD52L\H handling-1

Remove unintended changes to DAGCombiner.cpp

c5100dc

Merge branch 'main' into main

380155d

RKSimon self-requested a review September 17, 2025 08:16

RKSimon requested changes Sep 17, 2025

View reviewed changes

houngkoungting added 2 commits September 22, 2025 11:24

update test case

27f0f42

update test case: knownbits-vpmadd52.ll

efeb740

RKSimon approved these changes Sep 22, 2025

View reviewed changes

Merge branch 'main' into main

89555f8

RKSimon enabled auto-merge (squash) September 22, 2025 09:14

RKSimon merged commit dc6a915 into llvm:main Sep 22, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::VPMADD52L\H handling - again #159230

[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::VPMADD52L\H handling - again #159230

Uh oh!

houngkoungting commented Sep 17, 2025

Uh oh!

RKSimon left a comment

Uh oh!

RKSimon Sep 17, 2025

Uh oh!

RKSimon Sep 17, 2025

Uh oh!

RKSimon Sep 17, 2025

Uh oh!

RKSimon left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,138 @@
		; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
		; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512ifma,+avx512vl \| FileCheck %s --check-prefixes=AVX512VL

[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::VPMADD52L\H handling - again #159230

[X86] X86TargetLowering::computeKnownBitsForTargetNode - add X86ISD::VPMADD52L\H handling - again #159230

Uh oh!

Conversation

houngkoungting commented Sep 17, 2025

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

RKSimon Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

RKSimon Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

RKSimon Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants