Skip to content

feat: add PluggableDevice SE API and multi-TF build support#194

Merged
tngchien merged 16 commits into
MooreThreads:mainfrom
tngchien:pluggable_device
May 19, 2026
Merged

feat: add PluggableDevice SE API and multi-TF build support#194
tngchien merged 16 commits into
MooreThreads:mainfrom
tngchien:pluggable_device

Conversation

@tngchien
Copy link
Copy Markdown
Collaborator

No description provided.

@tngchien tngchien force-pushed the pluggable_device branch 2 times, most recently from ac5cae6 to bf3b3f3 Compare April 29, 2026 07:16
@tngchien tngchien force-pushed the pluggable_device branch 3 times, most recently from 30b72b8 to 05e4320 Compare May 12, 2026 10:36
@tngchien tngchien force-pushed the pluggable_device branch 2 times, most recently from 94570aa to 1251ffa Compare May 18, 2026 09:45
timo and others added 16 commits May 18, 2026 18:30
Resolve the CMake merge-conflict leftovers and apply key CODE_REVIEW fixes across
SE runtime, kernel guards, and variable op ordering to stabilize pluggable path behavior.
Also fix SE-only AddV2/compliance test regressions by allowing default-stream fast path
and isolating symbol export checks in subprocess, with tf261 validation tests passing.
Align the test with the optimizer's binary GraphDef dump default so it remains valid after rebasing onto main.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Keep failed SE deallocations non-destructive and expose safer device metadata so PluggableDevice diagnostics have stable state to inspect.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Require PluggableDevice runtime handles and streams before launching basic, metadata, and stream-backed MUSA operators, and cover them through installed-wheel import tests.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Migrate the remaining business kernel runtime paths away from legacy MusaDevice-only access and make the default operator runner validate the installed-wheel PluggableDevice path reliably.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Load custom op wrappers before PluggableDevice registration so installed-wheel tests can call tensorflow_musa.ops without falling back to skips.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Require an explicit muDNN handle before kernel code dereferences runtime handles so PluggableDevice failures surface early instead of falling through to sink handles.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Restore source file modes that were changed while applying the muDNN guard cleanup.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Restore fusion graph fixtures that were unrelated to the muDNN handle guard cleanup.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Align the plugin build, runtime compatibility shims, and operator tests with TensorFlow 2.15 so the installed wheel path can build and pass the full test runner.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Restore TensorList copy behavior and status compatibility after rebasing onto main, and trim public documentation for the open-source package.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@tngchien tngchien force-pushed the pluggable_device branch from 1251ffa to de6b28b Compare May 18, 2026 10:31
@tngchien tngchien merged commit a4e6597 into MooreThreads:main May 19, 2026
63 of 97 checks passed
@tngchien tngchien deleted the pluggable_device branch May 19, 2026 03:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant