Skip to content

[QNN-EP] Parallelize graph finalization#27629

Draft
quic-calvnguy wants to merge 1 commit intomicrosoft:mainfrom
CodeLinaro:dev/quic_calvnguy/parallel_graph_prepare
Draft

[QNN-EP] Parallelize graph finalization#27629
quic-calvnguy wants to merge 1 commit intomicrosoft:mainfrom
CodeLinaro:dev/quic_calvnguy/parallel_graph_prepare

Conversation

@quic-calvnguy
Copy link
Contributor

Description

Currently graph compilation for each fused node is done sequentially. However, the QNN finalization portion can be done asynchronously.

Motivation and Context

Sequential graph finalization increases total time of graph compilation. By asynchronously finalizing the graph, the total time for graph compilation and ORT session creation is expected to be faster

Make graph finalization asynchronous
Add threadpool, set default to 2
Copy link
Contributor Author

@quic-calvnguy quic-calvnguy Mar 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

base branch may be behind by a commit or two. will rebase

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will be split up to .h/.cc once the PR is no longer a draft

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changes here will be removed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant