merge bench code into benchgc #199

xurui1995 · 2024-07-30T08:35:49Z

merge bench code into benchgc

add mode=P for performance testing
driver=pattern, case=mlp for mlp pattern
remove old bench code dir

issue: #172

scripts/correctness.sh

test/benchgc/src/benchgc/__main__.py

test/benchgc/src/benchgc/pattern/mlp.py

xurui1995 · 2024-09-02T02:38:27Z

added a mlp case in the correctness check script as example，and will add more in the future

#mlp
python3 -m benchgc --verbose 1  --driver pattern --case mlp --batch_size=32 --hidden_size_list=32x16x64 --has_bias=1x1 --act_type=noop --dtype=f32

xurui1995 · 2024-09-02T05:05:37Z

@yifeizh2 @zhczhong @crazydemo @niuxiaog @BRUCE11111 the bench tool now is part of benchgc, please follow the new readme to do benchmark, https://github.com/intel/graph-compiler/blob/xurui/merge_into_benchgc/test/benchgc/README.md

Note: The current benchgc does not provide DLTI attr for the mlir module, will add that in the next PR.

scripts/correctness.sh

python/gc_mlir/_mlir_libs/_site_initialize_0.py

ciyongch · 2024-09-03T06:18:13Z

test/benchgc/README.md

+* 3 : COMPARE_VERBOSE, + print threshold for comparison
+* 4 : ERROR_OUTPUT_VERBOSE, + print all error data points if failed
+* 5 : OUTPUT_VERBOSE, + print all result including passed tensor
+* 6 : INPUT_VERBOSE, + print input torch tensors


Do you think saving the tensor into a file will be better than printing them in the terminal?

I agree with you, if the tensor is larger then dump into a file sounds better than printing. The printing thing is not added by this PR, I just added them on the README.txt, I can discuss with @WangJialei-A , and maybe we could provide another option to dump. For this PR let's keep the printing.

@ciyongch @xurui1995
Need more discussion and design this part carefully.

@ciyongch @xurui1995 Need more discussion and design this part carefully.

For debug capability, we shall have a more convenient/flexible way to get the intermediate result.

test/benchgc/README.md

test/benchgc/src/benchgc/bench.py

ciyongch · 2024-09-03T07:10:24Z

test/benchgc/src/benchgc/mlir/util.py

+    assert (
+        len(module.operation.regions) == 1
+    ), "Expected kernel module to have only one region"
+    assert (
+        len(module.operation.regions[0].blocks) == 1
+    ), "Expected kernel module to have only one block"


Why do we have such limitation?

removed now, here I try to find the entry from the top-level functions of a module, in fact, I have not seen any module with more than one region and blocks.

Why do you remove the original get_entry?

they are almost the same, in original get_entry, you have to pass entry name as '"entry"', which is kind a annoy. In addition, have a func to get FuncOp by name can cover the func only for getting entry.

ciyongch

LGTM, with a minor comment.

ciyongch · 2024-09-04T05:18:28Z

test/benchgc/src/benchgc/pattern/mlp.py

+            arg.fill_param = [
+                "matmul",
+                "wei",
+                arglist[0].dtype,


shall we use different idx here for wei?

Thank you for pointing this out. Previously, I tried to use a single matmul op's filling strategy in the MLP, but later I encountered some issues. Now, I have aligned the filling and compare strategies of the MLP with the MLP validation script in GC v1. In the future, a separate PR might be proposed to optimize the MLP's filling.

xurui1995 added the enhancement New feature or request label Jul 30, 2024

xurui1995 self-assigned this Jul 30, 2024

xurui1995 added the WIP work in progress label Jul 30, 2024

WangJialei-A reviewed Aug 8, 2024

View reviewed changes

scripts/correctness.sh Outdated Show resolved Hide resolved

WangJialei-A reviewed Aug 8, 2024

View reviewed changes

test/benchgc/src/benchgc/__main__.py Outdated Show resolved Hide resolved

WangJialei-A reviewed Aug 8, 2024

View reviewed changes

test/benchgc/src/benchgc/__main__.py Outdated Show resolved Hide resolved

WangJialei-A reviewed Aug 8, 2024

View reviewed changes

test/benchgc/src/benchgc/__main__.py Outdated Show resolved Hide resolved

lmontigny added this to the Functional llama2 milestone Aug 9, 2024

xurui1995 mentioned this pull request Aug 19, 2024

Adding the shared lib GcCpuRuntime for python GraphCompiler #256

Merged

WangJialei-A reviewed Aug 19, 2024

View reviewed changes

test/benchgc/src/benchgc/pattern/mlp.py Outdated Show resolved Hide resolved

introduce benchgc for correctness check

4772f9a

lmontigny modified the milestones: Functional llama2, [Validation] benchgc Aug 22, 2024

xurui1995 and others added 5 commits August 25, 2024 22:47

Merge branch 'main' into xurui/merge_bench_new

2124dc2

merge code

1f5b6ba

introduce benchgc for correctness check

2cccd04

remove print

1cabc2c

merge code

c3c5441

xurui1995 force-pushed the xurui/merge_into_benchgc branch from ed55792 to c3c5441 Compare August 27, 2024 02:09

xurui1995 added 11 commits August 26, 2024 19:11

fix

e316a98

simplify

841e81f

merge main

42a50c2

Merge branch 'main' into xurui/merge_into_benchgc

b16a9b6

fix format

1e8b074

fix format

1c20184

reorg the pattern dir

69f2e94

improve

8d0953c

fix format

e05d5f0

fix

e96d310

Merge branch 'main' into xurui/merge_into_benchgc

9d03541

xurui1995 added 9 commits August 28, 2024 18:45

fix

8e85b80

fix

56f2de6

add readme

b87b2d4

Merge branch 'main' into xurui/merge_into_benchgc

8f09ed0

Merge branch 'main' into xurui/merge_into_benchgc

4726c81

add mlp filling

b2597b9

Merge branch 'main' into xurui/merge_into_benchgc

248dd12

fix mlp

4392974

add case

3566b83

xurui1995 added 3 commits September 1, 2024 19:45

remove old bench code

8deb44c

update readme

a0641e9

Merge branch 'main' into xurui/merge_into_benchgc

5372bf0

xurui1995 requested a review from ZhennanQin September 2, 2024 05:00

WangJialei-A reviewed Sep 2, 2024

View reviewed changes

scripts/correctness.sh Show resolved Hide resolved

WangJialei-A approved these changes Sep 2, 2024

View reviewed changes

xurui1995 mentioned this pull request Sep 2, 2024

[BenchGC] attach DLTI for mlir module #312

Merged

Merge branch 'main' into xurui/merge_into_benchgc

061af38

xurui1995 requested a review from ciyongch September 3, 2024 05:57

ciyongch reviewed Sep 3, 2024

View reviewed changes

fix register_onednn_graph_dialect

66efdd2

ciyongch reviewed Sep 3, 2024

View reviewed changes

xurui1995 added 2 commits September 3, 2024 01:33

fix

f211736

Merge branch 'main' into xurui/merge_into_benchgc

27ca472

ciyongch reviewed Sep 4, 2024

View reviewed changes

xurui1995 added 2 commits September 4, 2024 00:09

update filling and cmp for mlp

e356975

Merge branch 'main' into xurui/merge_into_benchgc

4cd0ac5

ciyongch approved these changes Sep 4, 2024

View reviewed changes

xurui1995 merged commit 0956de2 into main Sep 4, 2024
6 checks passed

merge bench code into benchgc #199

merge bench code into benchgc #199

Uh oh!

Conversation

xurui1995 commented Jul 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xurui1995 commented Sep 2, 2024

Uh oh!

xurui1995 commented Sep 2, 2024

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xurui1995 Sep 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ciyongch left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

xurui1995 commented Jul 30, 2024 •

edited

Loading

xurui1995 Sep 3, 2024 •

edited

Loading