Qualcomm AI Engine Direct - Optimize performance of pcq embedding by shewu-quic · Pull Request #20686 · pytorch/executorch

shewu-quic · 2026-07-02T03:55:25Z

Summary:

Change pcq embedding pattern for backend optimization
- Note that it is supported after QNN 2.48

Test plan

python3 backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_embedding_per_channel --build_folder  build-android  --host {HOST} --device {DEVICE}   --soc_model SM8850  -a {ARTIFACTS}

pytorch-bot · 2026-07-02T03:55:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20686

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 11bd40c with merge base 71a80d7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-07-02T03:56:20Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: - Change pcq embedding pattern for backend optimization

shewu-quic · 2026-07-02T04:12:56Z

Hi @psiddh,
This PR is to optimize performance for PCQ embedding on HTP.
Could you please have a look?
Thanks,
Hutton

shewu-quic requested review from abhinaykukkadapu and psiddh as code owners July 2, 2026 03:55

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 2, 2026

Qualcomm AI Engine Direct - Optimize performance of pcq embedding

11bd40c

Summary: - Change pcq embedding pattern for backend optimization

shewu-quic force-pushed the dev1/hutton/optimize_pcq_embedding branch from 231b11e to 11bd40c Compare July 2, 2026 04:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - Optimize performance of pcq embedding#20686

Qualcomm AI Engine Direct - Optimize performance of pcq embedding#20686
shewu-quic wants to merge 1 commit into
pytorch:mainfrom
CodeLinaro:dev1/hutton/optimize_pcq_embedding

shewu-quic commented Jul 2, 2026

Uh oh!

pytorch-bot Bot commented Jul 2, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jul 2, 2026

Uh oh!

shewu-quic commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

shewu-quic commented Jul 2, 2026

Test plan

Uh oh!

pytorch-bot Bot commented Jul 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20686

✅ No Failures

Uh oh!

github-actions Bot commented Jul 2, 2026

This PR needs a release notes: label

Uh oh!

shewu-quic commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot Bot commented Jul 2, 2026 •

edited

Loading

This PR needs a `release notes:` label