s390x: add `nnp-assist` intrinsics #1996

folkertdev · 2026-01-17T16:01:35Z

Because qemu does not support these (yet), I haven't added any runtime tests

rustbot · 2026-01-17T16:01:40Z

rustbot has assigned @sayantn.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

folkertdev

cc @uweigand

crates/core_arch/src/s390x/vector.rs

folkertdev · 2026-01-17T16:16:43Z

crates/core_arch/src/s390x/vector.rs

+    // On processors implementing the IBM z16 architecture, only the value 0 is supported.
+    static_assert_uimm_bits!(B, 0);
+
+    vclfnls(a, B)


Is this equivalent to

https://godbolt.org/z/dGaf4P7sa

Clearly that optimizes horribly at the moment. If the const value being 0 does the obvious thing, I believe all of these could be implemented in terms of simpler simd primitives.

if currently only 0 is supported, we can just use SIMD primitives, as the assertion will ensure no other value is passed

We could, though currently it seems unspecified what the conversion method is (I think there is only one implementation that actually makes sense, but then why is the IMM argument even there?).

Also currently the SIMD primitives don't optimize into the instruction that this intrinsic should map to.

The AI accelerator unit operates on its own private data types. In particular, it uses a 16-bit floating point type which is neither IEEE-16 nor bfloat16, but a proprietary format. In order to prepare input/output data to be used with the accelerator, applications need to convert standard (IEEE) data types to and from this private data type; for this purpose, the ISA provides these conversion instructions (mapped to compiler intrinsics).

In principle, the accelerator might support multiple different private data types, and the immediate operand of these intrinsics identifies which of those types the conversion should target. This is not specified by the ISA but may differ between processor generations. However, all current processors only support a single private data type, identified by the immediate value 0.

So in practice, the immediate will always be 0 today. I'm not convinced this ought to be enforced by the compiler - if a future processor adds a second type, it might be good if we could use the intrinsic without having to update the compiler.

Either way, whatever the immediate value is, there is no possibility to open-code the conversion with standard LLVM IR - the private floating-point format is unknown to LLVM! This absolutely has to map to the LLVM builtin (and thus the special instruction).

Good to know. I've changed the code to accept the full 0..=15 range there.

Because `qemu` does not support these (yet), I haven't added any runtime tests

folkertdev · 2026-01-21T09:55:48Z

@uweigand does this look good now?

uweigand · 2026-01-21T15:59:00Z

@uweigand does this look good now?

Looks good to me know. There is one question remaining in my mind: for vec_convert_to_fp16 and vec_convert_from_fp16 only one side is the proprietary type (represented as vector_signed_short), while the other side is actually a vector of standard IEEE 16-bit floats. This is also represented as vector_signed_short here, which follows the precedent set by GCC and clang.

That precedent was created at a time when we did not have any _Float16 support in those compilers - but now we do. So in theory we could be more precise and use a proper type here. But I guess this would mean that we'd have to introduce a new vector type as well (vector_float16 ? vector_half ?) Given that we do not actually have any other instructions operating on that type, not even basic arithmetic, in current processors, I'm not sure this makes sense.

folkertdev · 2026-01-21T16:34:08Z

I suspect these functions will continue to be unstable for a while, so we could change this later if vector_half actually gets more serious support.

uweigand · 2026-01-21T16:43:11Z

I suspect these functions will continue to be unstable for a while, so we could change this later if vector_half actually gets more serious support.

Sounds good to me, thanks.

rustbot assigned sayantn Jan 17, 2026

folkertdev mentioned this pull request Jan 17, 2026

Tracking Issue for stdarch_s390x rust-lang/rust#135681

Open

folkertdev commented Jan 17, 2026

View reviewed changes

folkertdev force-pushed the s390x-nnp-assist branch from cbff50d to ea90026 Compare January 17, 2026 16:18

s390x: add nnp-assist intrinsics

0d5c015

Because `qemu` does not support these (yet), I haven't added any runtime tests

folkertdev force-pushed the s390x-nnp-assist branch from ea90026 to 0d5c015 Compare January 19, 2026 13:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

s390x: add `nnp-assist` intrinsics #1996

s390x: add `nnp-assist` intrinsics #1996

Uh oh!

folkertdev commented Jan 17, 2026 •

edited

Loading

Uh oh!

rustbot commented Jan 17, 2026

Uh oh!

folkertdev left a comment

Uh oh!

Uh oh!

folkertdev Jan 17, 2026

Uh oh!

sayantn Jan 17, 2026

Uh oh!

folkertdev Jan 17, 2026

Uh oh!

uweigand Jan 19, 2026

Uh oh!

folkertdev Jan 19, 2026

Uh oh!

folkertdev commented Jan 21, 2026

Uh oh!

uweigand commented Jan 21, 2026

Uh oh!

folkertdev commented Jan 21, 2026

Uh oh!

uweigand commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

s390x: add nnp-assist intrinsics #1996

Are you sure you want to change the base?

s390x: add nnp-assist intrinsics #1996

Uh oh!

Conversation

folkertdev commented Jan 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented Jan 17, 2026

Uh oh!

folkertdev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

folkertdev Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

sayantn Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

folkertdev Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

uweigand Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

folkertdev Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

folkertdev commented Jan 21, 2026

Uh oh!

uweigand commented Jan 21, 2026

Uh oh!

folkertdev commented Jan 21, 2026

Uh oh!

uweigand commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

s390x: add `nnp-assist` intrinsics #1996

s390x: add `nnp-assist` intrinsics #1996

folkertdev commented Jan 17, 2026 •

edited

Loading