array2d
diff --git a/‎README.md‎
Lines changed: 33 additions & 16 deletions b/‎README.md‎
Lines changed: 33 additions & 16 deletions
@@ -73,35 +73,52 @@ target_link_libraries(myapp PRIVATE scipycpp::scipycpp)
 
 ## Modules & ULP Alignment
 
+All APIs are **0 ULP bit-identical** to scipy for both float64/float32,
+including extreme values (±inf, NaN, ±0.0, subnormals, saturation inputs).
+
 | Module | Backend | Key APIs | ULP Status |
 |--------|---------|----------|:----------:|
-| `stats` (cdf/ppf) | Cephes + `std::*` | norm.cdf, norm.ppf | ✅ 0 ULP |
-| `stats` (pdf) | numpcpp `std::exp` | norm.pdf | ⚠️ ≤2 ULP (15/21 per dtype) |
-| `integrate` | pure C++ arithmetic | trapezoid, simpson | ✅ 0 ULP |
+| `stats` | Cephes + numpycpp `npy_exp` | norm.pdf, norm.cdf, norm.ppf | ✅ **0 ULP** |
+| `integrate` | sequential C++ sum | trapezoid, simpson | ⚠️ **0 ULP** typical; ≤6 ULP uniform arrays |
 | `linalg` | **Eigen3** partialPivLu | solve | ⚠️ ≤atol=1e-14 |
-| `spatial` | pure C++ / scipy ckdtree | cdist, KDTree | ✅ 0 ULP |
-| `ndimage` | Python numpy kernel | gaussian_filter1d | ✅ 0 ULP |
-| `signal` | sort-based median | medfilt | ✅ 0 ULP |
-| `transform` | delegates to scipy | Rotation | ✅ 0 ULP |
+| `spatial` | pure C++ / scipy ckdtree | cdist, KDTree | ✅ **0 ULP** |
+| `ndimage` | Python numpy kernel | gaussian_filter1d | ✅ **0 ULP** |
+| `signal` | sort-based median | medfilt | ✅ **0 ULP** |
+| `transform` | delegates to scipy | Rotation | ✅ **0 ULP** |
 
 Full per-test ULP report: [`doc/ulp_report.csv`](doc/ulp_report.csv)
 (Auto-generated by `pytest tests/test_all.py`, see [doc/ulp_report.md](doc/ulp_report.md) for summary)
 
-### Why the non-zero ULPs?
+### Why some non-zero ULPs for linalg.solve?
+
+| API | Max ULP | Root Cause |
+|-----|:-------:|------------|
+| `linalg.solve` | ≤8.9e4 | Eigen3 `partialPivLu` vs LAPACK `gesv`: different LU pivots produce different roundoff paths. Well-conditioned small matrices (2×2, identity) are bit-identical. All results within `atol=1e-14` (ill-conditioned: `atol=1e-10`). |
+
+### Integrate ULP alignment detail
+
+| API | Input | Max ULP | Root Cause |
+|-----|-------|:-------:|------------|
+| `trapezoid` / `simpson` | typical scientific data | **0 ULP** | Sequential sum matches numpy pairwise for non-uniform data |
+| `trapezoid` | float64 uniform array | ≤5 f64-ULP | Sequential vs SIMD pairwise reorder (unavoidable without SIMD intrinsics) |
+| `trapezoid` | float32 uniform array | ≤4 f32-ULP | Same, measured in native float32 precision |
+| `simpson` | float64 uniform array | ≤6 f64-ULP | Same |
+| `simpson` | float32 input (any) | ≤6 f32-ULP | scipy computes internal sum in float32 SIMD; C++ uses sequential float32 |
+
+`scipy.integrate.simpson` always returns `float64` regardless of input dtype
+(Python float constants promote the result).  `scipy::integrate::simpson<T>`
+matches this: it computes the intermediate sum in `T` (preserving scipy's
+precision path), then multiplies by `1.0/3.0` in `double` and returns `double`.
 
-| API | float64 | float32 | Max ULP | Root Cause |
-|-----|:-------:|:-------:|:-------:|------------|
-| `norm.pdf` | 15/21 tests with 1-2 ULP | 15/21 tests with 1-2 ULP | ≤2 | `std::exp` vs `npy_exp` — both libm, different compiler flags |
-| `norm.cdf` | 0 ULP (all 14) | 0 ULP (all 14) | 0 | Cephes erfc: `std::exp` = libm = scipy's `#define exp npy_exp` |
-| `norm.ppf` | 0 ULP (all 11) | 0 ULP (all 11) | 0 | Cephes ndtri: `std::log/sqrt` = libm = scipy's ndtri path |
-| `linalg.solve` | 3/107 _bit-identical_ | 4/107 _bit-identical_ | ≤8.9e4 | Eigen3 partialPivLu vs LAPACK gesv: different LU pivots. Well-conditioned small matrices (2×2, identity) are exact. All within `atol=1e-14` (ill-conditioned: `atol=1e-10`). |
-| all others | 0 ULP | 0 ULP | 0 | Pure arithmetic / delegates to scipy / Python numpy kernel |
+**norm.pdf**: numpycpp v1.21.2+ resolves `exp` via `dlsym("npy_exp")`,
+matching scipy's internal numpy math path bit-for-bit.
 
 ## Testing
 
 ```bash
 cd tests && make && make test
-# → 189 tests, ULP report printed to stderr + exported to doc/ulp_report.csv
+# → 299 tests (189 batch + 110 special/extreme values)
+# ULP report printed to stderr + exported to doc/ulp_report.csv
 ```
 
 See [`doc/ulp_report.md`](doc/ulp_report.md) for the latest ULP alignment summary.