New API ONNXRT example update #187

yuwenzho · 2022-11-29T09:11:22Z

Type of Change

example

Description

update ONNXRT example for new API

JIRA ticket: ILITV-2468

How has this PR been tested?

extension test on onnx models

Dependency Change?

no

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-09T07:55:47Z

hi @chensuyue, PR is ready for extension test

chensuyue · 2022-12-12T02:13:43Z

extension test

pls check the tuning regression.
benchmark.sh api gap.

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-16T05:52:12Z

@chensuyue extension test:
https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3784/artifact/report.html

performance regression is caused by switching performance dataset from dummy to real dataset.

chensuyue · 2022-12-16T06:37:11Z

extension test for the other examples.

examples/onnxrt/object_detection/onnx_model_zoo/tiny_yolov3/quantization/ptq/main.py

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-21T02:11:07Z

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/
Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-21T10:47:25Z

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

NLP models failed due to some typos and code changes not working.
Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3883/

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-22T02:05:47Z

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

NLP models failed due to some typos and code changes not working. Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3883/

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3890/
yolov3, yolov4 and tiny_yolov3 will not be enabled in this version because 'onnxrt.graph_optimization.level' is not supported now.

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-23T05:03:27Z

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3890/ yolov3, yolov4 and tiny_yolov3 will not be enabled in this version because 'onnxrt.graph_optimization.level' is not supported now.

ssd-12, ssd-12_qdq, faster_rcnn, faster_rcnn_qdq, mask_rcnn, mask_rcnn_qdq will be re-enabled in 2.1 with supported 'onnxrt.graph_optimization.level' and quantization recipe. Please ignore them in extension test.
hf model failed with error: 'setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (4,) + inhomogeneous part.', which is caused from numpy version update. issue

Update:

remove ssd, faster_rcnn and mask_rcnn model
update model config json
add numpy==1.23.5 into requirements.txt in huggingface model

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3909/

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-23T10:40:07Z

passed: bert_squad_model_zoo_dynamic, mobilebert_squad_mlperf_dynamic, mobilebert_squad_mlperf_qdq, duc, BiDAF_dynamic and huggingface question answering models

failed: gpt2_lm_head_wikitext_model_zoo_dynamic and huggingface test classification models, retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3913/

Signed-off-by: yuwenzho <[email protected]>

yuwenzho · 2022-12-26T01:20:32Z

passed:
bert_squad_model_zoo_dynamic, mobilebert_squad_mlperf_dynamic, mobilebert_squad_mlperf_qdq, duc, BiDAF_dynamic and huggingface question answering models: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3908/artifact/report.html
gpt2_lm_head_wikitext_model_zoo_dynamic and huggingface test classification models: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3919/artifact/report.html

* SparseLib add vtune support refine doc about profiling

Building on the vllm WoQ path, this PR adds support for re-quantizing FP8 weights w/ per-tensor or per-channel scaling. --------- Co-authored-by: Yi Liu <[email protected]>

yuwenzho added 6 commits November 29, 2022 16:41

update example for new API

7b2c5fc

Signed-off-by: yuwenzho <[email protected]>

update example for new API

e8beaed

Signed-off-by: yuwenzho <[email protected]>

update example for new API

a972603

Signed-off-by: yuwenzho <[email protected]>

update example for new API

3c0291d

Signed-off-by: yuwenzho <[email protected]>

update onnx example

7e09b64

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

b1a5c29

yuwenzho added examples extension test labels Dec 7, 2022

yuwenzho marked this pull request as ready for review December 7, 2022 03:28

yuwenzho added 7 commits December 7, 2022 11:39

update onnx example

80ecba6

Signed-off-by: yuwenzho <[email protected]>

update onnxrt example link

533903d

Signed-off-by: yuwenzho <[email protected]>

remove onnrt example with old API

981d175

Signed-off-by: yuwenzho <[email protected]>

update onnxrt example

e99f41a

Signed-off-by: yuwenzho <[email protected]>

fix conflict

c665e8f

Signed-off-by: yuwenzho <[email protected]>

update onnxrt example params

151c5f3

Signed-off-by: yuwenzho <[email protected]>

update onnxrt example link

b7305f4

Signed-off-by: yuwenzho <[email protected]>

yuwenzho added 4 commits December 13, 2022 14:56

fix typo

96fb672

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

51f75fa

update onnxrt example batch size

202ee5a

Signed-off-by: yuwenzho <[email protected]>

update inc_dict.txt

15c4863

Signed-off-by: yuwenzho <[email protected]>

yuwenzho force-pushed the new_api_onnx_example branch from d0882e3 to 15c4863 Compare December 16, 2022 03:03

chensuyue reviewed Dec 16, 2022

View reviewed changes

examples/onnxrt/object_detection/onnx_model_zoo/tiny_yolov3/quantization/ptq/main.py Outdated Show resolved Hide resolved

yuwenzho added 4 commits December 19, 2022 12:02

fix example bug

d93ea26

Signed-off-by: yuwenzho <[email protected]>

fix example bug

31aa991

Signed-off-by: yuwenzho <[email protected]>

fix batch size bug

19dd927

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

e311f57

yuwenzho added 2 commits December 21, 2022 18:23

fix typo

42b202c

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

31bbb9b

yuwenzho added 4 commits December 22, 2022 09:21

update ort example & fix typo

9603b43

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

7fb5e57

update model config and link

20a22fc

Signed-off-by: yuwenzho <[email protected]>

remove tiny_yolov3, yolov3, yolov4

ea851f5

Signed-off-by: yuwenzho <[email protected]>

yuwenzho added 4 commits December 23, 2022 12:30

remove ort mask-rcnn, faster-rcnn, ssd

689cf19

Signed-off-by: yuwenzho <[email protected]>

update onnxrt example

a6ff8c8

Signed-off-by: yuwenzho <[email protected]>

update ort example code

a3d7ac0

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

8b0e8b7

yuwenzho added 2 commits December 23, 2022 18:25

fix example error

dcfe31a

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

65fbc05

yuwenzho added 3 commits December 24, 2022 14:39

fix ort example benchmark

db7caf6

Signed-off-by: yuwenzho <[email protected]>

Merge branch 'master' into new_api_onnx_example

765b894

fix performance benchmark

828dfa6

Signed-off-by: yuwenzho <[email protected]>

mengniwang95 mentioned this pull request Dec 26, 2022

Release bug fix #354

Closed

chensuyue approved these changes Dec 27, 2022

View reviewed changes

chensuyue merged commit 97c8e3b into master Dec 27, 2022

chensuyue deleted the new_api_onnx_example branch December 27, 2022 01:50

VincyZhang pushed a commit that referenced this pull request Feb 12, 2023

vtune support for sparselib (#187)

5c5ef58

* SparseLib add vtune support refine doc about profiling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New API ONNXRT example update #187

New API ONNXRT example update #187

Uh oh!

yuwenzho commented Nov 29, 2022

Uh oh!

yuwenzho commented Dec 9, 2022

Uh oh!

chensuyue commented Dec 12, 2022

Uh oh!

yuwenzho commented Dec 16, 2022

Uh oh!

chensuyue commented Dec 16, 2022

Uh oh!

Uh oh!

yuwenzho commented Dec 21, 2022

Uh oh!

yuwenzho commented Dec 21, 2022

Uh oh!

yuwenzho commented Dec 22, 2022

Uh oh!

yuwenzho commented Dec 23, 2022

Uh oh!

yuwenzho commented Dec 23, 2022

Uh oh!

yuwenzho commented Dec 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New API ONNXRT example update #187

New API ONNXRT example update #187

Uh oh!

Conversation

yuwenzho commented Nov 29, 2022

Type of Change

Description

How has this PR been tested?

Dependency Change?

Uh oh!

yuwenzho commented Dec 9, 2022

Uh oh!

chensuyue commented Dec 12, 2022

Uh oh!

yuwenzho commented Dec 16, 2022

Uh oh!

chensuyue commented Dec 16, 2022

Uh oh!

Uh oh!

yuwenzho commented Dec 21, 2022

Uh oh!

yuwenzho commented Dec 21, 2022

Uh oh!

yuwenzho commented Dec 22, 2022

Uh oh!

yuwenzho commented Dec 23, 2022

Uh oh!

yuwenzho commented Dec 23, 2022

Uh oh!

yuwenzho commented Dec 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants