Add macro-controlled SIMD support (SSE/NEON) to math library by Copilot · Pull Request #64 · bluesky013/SkyEngine

Copilot · 2026-02-16T01:29:34Z

Adds compile-time SIMD acceleration for Vector4 and Matrix4 operations, with platform detection for x86 (SSE) and ARM (NEON) and scalar fallback.

Changes

cmake/options.cmake — SKY_MATH_SIMD option (ON by default)
cmake/configuration.cmake — propagates SKY_MATH_SIMD as compile definition
MathSimd.h — platform detection macros: SKY_SIMD_SSE, SKY_SIMD_NEON, SKY_SIMD_ENABLED
Vector4.h — __m128/float32x4_t as union member alongside float v[4]/{x,y,z,w}
Vector4.inl — SIMD paths for +=, -=, *=, /=, Dot, Normalize (with Newton-Raphson refined rsqrt)
Matrix4.inl — SIMD mat×mat and mat×vec multiply
test/core/MathTest.cpp — tests for Vector4 arithmetic, dot, negate, and Matrix4 multiply

Design

SIMD type lives in the Vector4 union for zero-cost access. Operations use direct intrinsics with three-way #if SSE / #elif NEON / #else branching:

inline Vector4& Vector4::operator+=(const Vector4& rhs) {
#if SKY_SIMD_SSE
    simd = _mm_add_ps(simd, rhs.simd);
#elif SKY_SIMD_NEON
    simd = vaddq_f32(simd, rhs.simd);
#else
    x += rhs.x; y += rhs.y; z += rhs.z; w += rhs.w;
#endif
    return *this;
}

Normalize uses _mm_rsqrt_ps / vrsqrteq_f32 with one Newton-Raphson iteration for precision.

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

- Add SKY_MATH_SIMD CMake option (ON by default) in cmake/options.cmake - Add SKY_MATH_SIMD compile definition in cmake/configuration.cmake - Create MathSimd.h with platform detection for SSE (x86) and NEON (ARM) - Add __m128/float32x4_t SIMD member to Vector4 union with alignas(16) - Optimize Vector4 operators (+=, -=, *=, /=, Dot, Normalize) with SIMD - Optimize Matrix4 multiply (mat*mat, mat*vec) with SIMD - Add comprehensive math unit tests for Vector4 and Matrix4 operations Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

- Create SFloat4.h: platform-agnostic SIMD float4 type wrapping __m128 (SSE), float32x4_t (NEON), or scalar fallback with Load/Store/Set/Splat, arithmetic operators, HorizontalSum, HorizontalSumSplat, InvSqrt - Refactor Vector4.h: remove inline __m128/float32x4_t from union, include SFloat4.h instead of MathSimd.h directly - Refactor Vector4.inl: replace #if SSE/#elif NEON branches with unified #if SKY_SIMD_ENABLED using SFloat4 operations - Refactor Matrix4.inl: replace #if SSE/#elif NEON branches with unified #if SKY_SIMD_ENABLED using SFloat4 operations Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Restore the SIMD native type as a union member in Vector4 for direct access, avoiding unnecessary Load/Store memory round-trips. Vector4 operations now construct SFloat4 directly from the simd union member and write back via .value assignment. Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Remove SFloat4.h and replace all SFloat4 usage in Vector4.inl and Matrix4.inl with direct #if SSE / #elif NEON / #else SIMD intrinsics. Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Initial plan

32e1de9

Copilot AI assigned Copilot and bluesky013 Feb 16, 2026

Copilot started work on behalf of bluesky013 February 16, 2026 01:29 View session

Copilot AI and others added 4 commits February 16, 2026 01:34

Add Newton-Raphson refinement to SIMD normalize for improved precision

39d23cf

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Final: SIMD math library support complete

8a65857

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Remove codeql artifact and add to .gitignore

3d675e3

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Improve math library to support SIMD with x86 and ARM~~ Add macro-controlled SIMD support (SSE/NEON) to math library Feb 16, 2026

Copilot AI requested a review from bluesky013 February 16, 2026 01:36

Copilot finished work on behalf of bluesky013 February 16, 2026 01:36

Copilot started work on behalf of bluesky013 February 16, 2026 01:57 View session

Copilot AI and others added 2 commits February 16, 2026 02:03

Improve scalar fallback readability in SFloat4.h

16599c6

Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Copilot AI changed the title ~~Add macro-controlled SIMD support (SSE/NEON) to math library~~ Extract SFloat4 SIMD abstraction from Vector4/Matrix4 Feb 16, 2026

Copilot finished work on behalf of bluesky013 February 16, 2026 02:06

Copilot started work on behalf of bluesky013 February 16, 2026 02:08 View session

Copilot AI changed the title ~~Extract SFloat4 SIMD abstraction from Vector4/Matrix4~~ Add macro-controlled SIMD support for math library (SSE/NEON) Feb 16, 2026

Copilot finished work on behalf of bluesky013 February 16, 2026 02:11

Copilot started work on behalf of bluesky013 February 16, 2026 03:06 View session

Delete SFloat4 abstraction layer, use raw SIMD intrinsics directly

f975638

Remove SFloat4.h and replace all SFloat4 usage in Vector4.inl and Matrix4.inl with direct #if SSE / #elif NEON / #else SIMD intrinsics. Co-authored-by: bluesky013 <35895395+bluesky013@users.noreply.github.com>

Copilot AI changed the title ~~Add macro-controlled SIMD support for math library (SSE/NEON)~~ Add macro-controlled SIMD support (SSE/NEON) to math library Feb 16, 2026

Copilot finished work on behalf of bluesky013 February 16, 2026 03:09

bluesky013 approved these changes Feb 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add macro-controlled SIMD support (SSE/NEON) to math library#64

Add macro-controlled SIMD support (SSE/NEON) to math library#64
Copilot wants to merge 9 commits intomainfrom
copilot/improve-math-library-simd-support

Copilot AI commented Feb 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Design

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Feb 16, 2026 •

edited

Loading