DSB fixes by ilia-kats · Pull Request #165 · scverse/muon

ilia-kats · 2025-08-14T13:46:15Z

make it match the R reference implementation
implement some options that are present in the R reference implementations but were missing here

ilan-gold

Small things, but nice1

muon/_prot/preproc.py

…verse#131)

…erse#42)

ilan-gold · 2025-10-22T08:56:14Z

muon/_prot/preproc.py


    if denoise_counts:
-        bgmeans = np.empty(cells_scaled.shape[0], np.float32)
+        bgmeans = np.empty(cells_scaled.shape[0], cells_scaled.dtype)


Why are we making the bgmeans the same dtype as the input? It looks like bgmeans is filled in with mean values - why not just set it to float64? It seems like you'd have the same type-loss problem as with ints then

At that point, cells_scaled will always be a float. If it's float32, then the GaussianMixture results will also be float32, so no need to waste memory by using a dtype that is too large.

Alternatively, I can move the castback to the very bottom of the function, so we do all our calculations in float64 and only cast back the result if the input was float32.

Nice, ok you're right!

ilan-gold · 2025-10-22T09:14:25Z

muon/_prot/preproc.py


    if denoise_counts:
-        bgmeans = np.empty(cells_scaled.shape[0], np.float32)
+        bgmeans = np.empty(cells_scaled.shape[0], cells_scaled.dtype)


Nice, ok you're right!

ilia-kats requested a review from gtca August 15, 2025 14:36

ilia-kats requested a review from ilan-gold October 9, 2025 16:09

ilan-gold reviewed Oct 10, 2025

View reviewed changes

muon/_prot/preproc.py Show resolved Hide resolved

muon/_prot/preproc.py Show resolved Hide resolved

ilan-gold reviewed Oct 21, 2025

View reviewed changes

muon/_prot/preproc.py Outdated Show resolved Hide resolved

ilia-kats added 3 commits October 21, 2025 15:16

DSB: fix float overflow for large datasets (closes scverse#130)

ca258a5

DSB: use ddof=1 in std calculation to match the R behavior (closes sc…

c948690

…verse#131)

DSB: implement scale_factor and quantile_clipping options (closes scv…

48a4ad2

…erse#42)

ilia-kats force-pushed the dsb_fixes branch from 862554d to 48a4ad2 Compare October 21, 2025 13:19

ilia-kats requested a review from ilan-gold October 22, 2025 08:49

ilan-gold reviewed Oct 22, 2025

View reviewed changes

ilan-gold approved these changes Oct 22, 2025

View reviewed changes

ilia-kats merged commit f03eb69 into scverse:main Oct 22, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DSB fixes#165

DSB fixes#165
ilia-kats merged 3 commits intoscverse:mainfrom
ilia-kats:dsb_fixes

ilia-kats commented Aug 14, 2025

Uh oh!

ilan-gold left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilan-gold Oct 22, 2025

Uh oh!

ilia-kats Oct 22, 2025

Uh oh!

ilia-kats Oct 22, 2025

Uh oh!

ilan-gold Oct 22, 2025

Uh oh!

ilan-gold Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ilia-kats commented Aug 14, 2025

Uh oh!

ilan-gold left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilan-gold Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

ilia-kats Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

ilia-kats Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

ilan-gold Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

ilan-gold Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants