Add support for building noarch kernels #319

danieldk · 2025-12-04T11:01:18Z

This change adds support for building noarch kernels. So far we have used the universal variant for kernels that do not have any AoT-compiled code. However, the universal variant has two important issues:

A kernel without AoT-compiled might still be backend-specific. E.g. NVIDIA CuTe-based kernels are not universal in the sense that they don't work on non-NVIDIA GPUs.
We cannot specify dependencies per backend.

To solve these issues, we introduce the noarch variants to replace universal kernels. Noarch kernels have variants of the shape torch-<backend> (e.g. torch-xpu). This resolves the issues outlined.

To support no-arch kernels, we update the build.toml format to v3, making the following changes:

general.universal is removed.
general.backends is introduced. This required option is used to list what backends the kernel supports.
general.cuda-{minver,maxver} has been moved to the general.cuda section.

If a kernel supports backend X and has one or more kernels.* sections with backend = "X", then the kernel is an AoT-compiled kernel for that backend. Otherwise, it is a noarch kernel for that backend. Suppose that we have:

[general]
# ...
backends = ["cuda", "xpu"]
#...

[kernel.mykernel]
backend = "xpu"
# ...

then the XPU kernel will be AoT-compiled (e.g. build/torch29-cxx11-xpu20252-x86_64-linux), whereas the CUDA kernel will be noarch (torch-cuda).

An older build.toml can be updated automatically with build2cmake update-build build.toml.

Universal kernels still exist in the project writers.

This still generates noarch variants with unnecessary bits (CUDA version, system, etc.).

danieldk · 2025-12-05T09:22:40Z

build2cmake/src/config/v3.rs

+pub struct General {
+    pub name: String,
+
+    pub backends: Vec<Backend>,


v3 is mostly a copy of v2, so I'll mark the changes here.

In general, universal was removed, backends was added.

danieldk · 2025-12-05T09:23:19Z

build2cmake/src/config/v3.rs

+    pub minver: Option<Version>,
+    pub maxver: Option<Version>,


cuda-minver and cuda-maxver have moved from general to this new general.cuda section.

danieldk · 2025-12-05T09:26:32Z

flake.nix

              pkgs.linkFarm "packages-for-cache" (
                map (buildSet: {
-                  name = buildName (buildSet.buildConfig);
+                  name = buildSet.torch.variant;


I have moved generation of variant strings torch-... to the Torch derivation itself. This makes it easier to access it from everywhere.

Yes makes sense

MekkCyber

Nice work! I didn’t follow every detail 😅, but the logic seems sound, Thank you

MekkCyber · 2025-12-09T08:09:13Z

docs/build-variants.md

+- `torch-cpu`
+- `torch-cuda`
+- `torch-metal`
+- `torch-rocm`
+- `torch-xpu`


can we add npu in the case of noarch variants ?

MekkCyber · 2025-12-09T15:59:00Z

flake.nix

              pkgs.linkFarm "packages-for-cache" (
                map (buildSet: {
-                  name = buildName (buildSet.buildConfig);
+                  name = buildSet.torch.variant;


Yes makes sense

drbh

changes look good to me!

small nit regarding the format of the config module, currently when adding a new version we need to make changes to the previous version files. it may be helpful to limit the version files to parsing toml only, and maintain a top level config that is populated by the different versions

danieldk added 22 commits December 5, 2025 07:59

Add v3 that is the same as v3 still

febe6f6

Move cuda-minver/maxver to general.cuda

12921f1

Remove universal from the v3 config.

264d17d

Universal kernels still exist in the project writers.

build2cmake: fix backend handling

333da80

Make update-build update to config v3

3f89453

Nix: work with new build.toml format changes

d843829

This still generates noarch variants with unnecessary bits (CUDA version, system, etc.).

Support no-arch variants

da53c14

Let Torch derivations create the variant names

72df979

Update build variants

67b01f0

Validate build.toml, recommend update when necessary

c99ee16

Update example kernels build.toml

f87f5fe

nix fmt

660571c

docs: update build.toml options for v3

c66e7f3

Rename silu-and-mul-universal kernel to silu-and-mul

8065db0

build2cmake: rename universal templates to noarch

44786f8

Move unused v2::Backend

4a83239

examples/silu_and_mul: fix

5d697ba

build2cmake: update flake.lock for newer Rust

dc4867e

CI: name silu-and-mul output changed

db454c9

Fix variant generation

5d497e1

nix fmt

b3a961b

Fixup CUDA minver/maxver in Nix and docs

01a0e2f

danieldk force-pushed the backend-noarch-kernels branch from 39adce0 to 01a0e2f Compare December 5, 2025 08:07

Update relu-torch-bounds build.toml

ca58719

danieldk marked this pull request as ready for review December 5, 2025 09:20

danieldk commented Dec 5, 2025

View reviewed changes

MekkCyber approved these changes Dec 9, 2025

View reviewed changes

drbh approved these changes Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for building noarch kernels #319

Add support for building noarch kernels #319

danieldk commented Dec 4, 2025

Uh oh!

danieldk Dec 5, 2025

Uh oh!

danieldk Dec 5, 2025

Uh oh!

danieldk Dec 5, 2025

Uh oh!

MekkCyber Dec 9, 2025

Uh oh!

MekkCyber left a comment

Uh oh!

MekkCyber Dec 9, 2025

Uh oh!

MekkCyber Dec 9, 2025

Uh oh!

drbh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add support for building noarch kernels #319

Are you sure you want to change the base?

Add support for building noarch kernels #319

Conversation

danieldk commented Dec 4, 2025

Uh oh!

danieldk Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

danieldk Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

danieldk Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber left a comment

Choose a reason for hiding this comment

Uh oh!

MekkCyber Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

drbh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants