-
Notifications
You must be signed in to change notification settings - Fork 283
New API ONNXRT example update #187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
|
hi @chensuyue, PR is ready for extension test |
|
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
d0882e3 to
15c4863
Compare
|
@chensuyue extension test: performance regression is caused by switching performance dataset from dummy to real dataset. |
|
extension test for the other examples. |
examples/onnxrt/object_detection/onnx_model_zoo/tiny_yolov3/quantization/ptq/main.py
Outdated
Show resolved
Hide resolved
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
|
https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ |
Signed-off-by: yuwenzho <[email protected]>
NLP models failed due to some typos and code changes not working. |
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3890/ |
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
Update:
Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3909/ |
Signed-off-by: yuwenzho <[email protected]>
|
passed: bert_squad_model_zoo_dynamic, mobilebert_squad_mlperf_dynamic, mobilebert_squad_mlperf_qdq, duc, BiDAF_dynamic and huggingface question answering models failed: gpt2_lm_head_wikitext_model_zoo_dynamic and huggingface test classification models, retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3913/ |
Signed-off-by: yuwenzho <[email protected]>
Signed-off-by: yuwenzho <[email protected]>
|
passed: |
* SparseLib add vtune support refine doc about profiling
Building on the vllm WoQ path, this PR adds support for re-quantizing FP8 weights w/ per-tensor or per-channel scaling. --------- Co-authored-by: Yi Liu <[email protected]>
Building on the vllm WoQ path, this PR adds support for re-quantizing FP8 weights w/ per-tensor or per-channel scaling. --------- Co-authored-by: Yi Liu <[email protected]>
Building on the vllm WoQ path, this PR adds support for re-quantizing FP8 weights w/ per-tensor or per-channel scaling. --------- Co-authored-by: Yi Liu <[email protected]>
Building on the vllm WoQ path, this PR adds support for re-quantizing FP8 weights w/ per-tensor or per-channel scaling. --------- Co-authored-by: Yi Liu <[email protected]>

Type of Change
example
Description
update ONNXRT example for new API
JIRA ticket: ILITV-2468
How has this PR been tested?
extension test on onnx models
Dependency Change?
no