Add support to export XNNPACK based static_llama #7535

derekxu · 2025-01-06T23:07:50Z

Summary:
Add support to export XNNPACK based static_llama

static_llama is the QNN backend hybrid/prefill+decode model with KV cache as the inference input
- https://www.internalfb.com/code/fbsource/fbcode/executorch/examples/qualcomm/oss_scripts/llama2/model/static_llama.py

Differential Revision: D67867190

pytorch-bot · 2025-01-06T23:07:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7535

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit 68298e1 with merge base 68c0208 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-01-06T23:08:02Z

This pull request was exported from Phabricator. Differential Revision: D67867190

Summary: Add support to export XNNPACK based static_llama - static_llama is the QNN backend hybrid/prefill+decode model with KV cache as the inference input - https://www.internalfb.com/code/fbsource/fbcode/executorch/examples/qualcomm/oss_scripts/llama2/model/static_llama.py Reviewed By: tarun292 Differential Revision: D67867190

facebook-github-bot · 2025-01-06T23:13:51Z

This pull request was exported from Phabricator. Differential Revision: D67867190

derekxu · 2025-01-06T23:22:58Z

@pytorchbot label "topic: not user facing"

Summary: Add support to export XNNPACK based static_llama - static_llama is the QNN backend hybrid/prefill+decode model with KV cache as the inference input - https://www.internalfb.com/code/fbsource/fbcode/executorch/examples/qualcomm/oss_scripts/llama2/model/static_llama.py Reviewed By: tarun292 Differential Revision: D67867190

facebook-github-bot · 2025-01-06T23:39:35Z

This pull request was exported from Phabricator. Differential Revision: D67867190

Summary: Add support to export XNNPACK based static_llama - static_llama is the QNN backend hybrid/prefill+decode model with KV cache as the inference input - https://www.internalfb.com/code/fbsource/fbcode/executorch/examples/qualcomm/oss_scripts/llama2/model/static_llama.py Reviewed By: tarun292 Differential Revision: D67867190

facebook-github-bot · 2025-01-07T00:51:44Z

This pull request was exported from Phabricator. Differential Revision: D67867190

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 6, 2025

facebook-github-bot added the fb-exported label Jan 6, 2025

tarun292 approved these changes Jan 6, 2025

View reviewed changes

derekxu force-pushed the export-D67867190 branch from a91eb31 to d3cb188 Compare January 6, 2025 23:13

pytorch-bot bot added the topic: not user facing label Jan 6, 2025

derekxu force-pushed the export-D67867190 branch from d3cb188 to dd8bb77 Compare January 6, 2025 23:39

derekxu force-pushed the export-D67867190 branch from dd8bb77 to 68298e1 Compare January 7, 2025 00:51

facebook-github-bot merged commit a29dc49 into pytorch:main Jan 7, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support to export XNNPACK based static_llama #7535

Add support to export XNNPACK based static_llama #7535

Uh oh!

derekxu commented Jan 6, 2025

Uh oh!

pytorch-bot bot commented Jan 6, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

derekxu commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support to export XNNPACK based static_llama #7535

Add support to export XNNPACK based static_llama #7535

Uh oh!

Conversation

derekxu commented Jan 6, 2025

Uh oh!

pytorch-bot bot commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7535

⏳ No Failures, 1 Pending

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

derekxu commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Jan 6, 2025 •

edited

Loading