feat: 2.6.0 inference by calvinleng-science · Pull Request #174 · sciencecorp/synapse-python

calvinleng-science · 2026-04-24T00:13:12Z

Summary

This introduces the command "deploy-model" that allows users to either deploy a .pt or .onnx model onto the device.

If the user specifies --quantize and --input-list, the model can be used by the DSP runtime. To allow DSP runtime, the user must specify the root directory for v2.34 of Qualcomm's QAIRT to enable conversion of their model to .dlc. This is required since Qualcomm does not allow redistribution of their software.

If the user does not specify --quantize and --input-list, then the model will simply be uploaded as an .onnx model (we will convert .pt models to .onnx) and simply just use onnxruntime's CPU runtime.

Changes

Added a Docker container that the SDK directory gets mounted to to allow for model conversion. This was needed since they require a specific version of Python
Added conversion scripts
Added scripts to deploy the model once converted to the target device via SFTP
Updated 'apps build' to also pull over the vcpkg .so's that users may have added to their vcpkg.json. This is needed because otherwise the app will not be able to find the symbols needed at runtime. However, the example app has been updated to pull scifi-headstage-shared-libraries which contains all of the .so's that are already in our company-wide vcpkg. Thus, 'apps build' will use that to determine which .so's to NOT pull, as pulling those .so's onto the device causes issues with updating scifi-headstage-shared-libraries since then two .debs will be conflicting over the same file path.

Testing

Students at Neurohack 2026 were able to use our inference pipeline with Synapse Apps and the deploy-model utility. As for DSP runtime, I have tested it with my own apps and have confirmed that we are able to perform inference with the DSP runtime.

Example DSP runtime user-flow:

Example CPU runtime user-flow:

Replace host-side SNPE converter invocation with a Docker-based approach. The container (Python 3.10 + pinned deps) eliminates Python version and numpy compatibility issues on the host. - Add model-converter/ with Dockerfile and self-contained convert.py - Rewrite onnx_to_dlc.py to orchestrate Docker (auto-builds image) - Bind-mount SNPE SDK at runtime (Qualcomm license compliant) - Add --snpe-root CLI arg to deploy-model - Remove unused onnx_transforms.py (logic moved into container) - Fix -u shorthand conflict between --username and global --uri

…o use

…w up on overwrite confirmation

…self contained

…runtime, updated build to package onnxruntime into the deb

…EADME

…in scifi-headstage-shared-libraries into the resulting app .deb as that blocks installations of scifi-headstage-shared-libraries. additionally, tap names will now wrap instead of truncate in the rich UI

Calvin Leng and others added 20 commits April 8, 2026 15:43

deploy-model working

f7f643d

quantization working

e7c4ecc

wip

c7d67cf

works!

81801f7

removed redundant options in deploy-model, made it easier for users t…

abf61b4

…o use

reintroduced quantization to allow CPU inference

9de15d9

readme wip

daa322c

improved model deployment in README

ec2cf4d

added defaults for --name and --input-shape to make them optional

ec8a666

more indepth readme for quantization

cab88f0

removed device setup script

fdf8b58

reverted changes to files.py and updated callsite. made the [y/n] sho…

b55cdb7

…w up on overwrite confirmation

changed how things are laid out to work with pypi installation

974f581

will now bundle qualcomm headers too, resulting .deb should be fully …

9acf9f7

…self contained

should make models directory if it does not exist

0aed90b

bump version to 2.6.0a1 for app-inference pre-release

abd1d5d

updated README to specify qualcomm stuff is no longer needed for CPU …

6832f7f

…runtime, updated build to package onnxruntime into the deb

bump version to 2.6.0a2 for fixed onnxruntime packaging and updated R…

837965a

…EADME

apps build will now no longer package vcpkg .so's that already exist …

afce0ec

…in scifi-headstage-shared-libraries into the resulting app .deb as that blocks installations of scifi-headstage-shared-libraries. additionally, tap names will now wrap instead of truncate in the rich UI

calvinleng-science merged commit 4dad3dc into main Apr 24, 2026
2 checks passed

calvinleng-science deleted the calvin/app-inference branch April 24, 2026 00:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 2.6.0 inference#174

feat: 2.6.0 inference#174
calvinleng-science merged 20 commits intomainfrom
calvin/app-inference

calvinleng-science commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

calvinleng-science commented Apr 24, 2026

Summary

Changes

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant