update handling of global option of 'use_inf_as_na' by AlrauneZ · Pull Request #126 · MiBiPreT/mibiscreen

AlrauneZ · 2026-02-23T13:24:28Z

The global option 'use_inf_as_na' is deprecated and will be removed in a future version of pandas (from 3.0 onwards).

Thus, modify the function "filter_values()" in transformation.py to not need the global conversation and handling detection of 'inf' values (similar to NaN) differently.

…inf within function

JaroCamphuijsen

Great that you're fixing this, because it is breaking things across other pr's as well. Also nice that we are losing the pandas import and the pd.set_option call in the global scope of this module. It's always a bit tricky to stick things in there and might lead to unexpected behavior, e.g. if you're only using one of the functions in your script and not loading the whole module, it will not execute the global command pd.set_option.
There are some minor issues with the consistency of handling np.inf and its documentation.

JaroCamphuijsen · 2026-02-23T14:54:08Z

mibiscreen/analysis/reduction/transformation.py


    # If there are any rows containing NULL cells, the NULL values will be filtered
    if len(NaN_rows)>0:
        if replace_NaN == 'remove':


Note that here (in the case replace_NaN == 'remove') we drop rows that contain either an by pandas na identified value (as specified here: https://pandas.pydata.org/docs/reference/api/pandas.isna.html#pandas.isna) OR np.inf (both negative and positive). This is because we use the mask that we produce ourselves in line 66, to remove the rows, it includes the np.inf values so also removes these.

However in the following cases ("zero", float|int value, "average", or "median") we use a pandas function (pd.DataFrame.isna()) which only selects the NaN values as identified by pandas.isna and not the np.inf values.

This makes the behavior of this function somewhat unclear and unpredictable. Perhaps we should completely ignore the inf values, or if we want, we can include them but should do that persistently. If you think it is still useful and a much encountered value, you could consider adding another argument to the function (regard_inf_as_na) and just do the replacement (with a DataFrame.replace({np.inf : np.nan, -np.inf : np.nan}) at the start of the function.

Do you think that would be a good solution?

JaroCamphuijsen · 2026-02-23T14:59:26Z

mibiscreen/analysis/reduction/transformation.py

-pd.set_option('mode.use_inf_as_na', True)

 def filter_values(data_frame,
                  replace_NaN = 'remove',


I think that we could use a slight improvement in the wording of what this function does and make it more consistent throughout the function. Currently the way it handles np.inf values is not documented in the docstrings and it is not consistent for all values of the replace_NaN argument.

JaroCamphuijsen · 2026-02-23T15:00:45Z

mibiscreen/analysis/reduction/transformation.py

        data_frame : pd.dataframe
            Tabular data containing variables to be evaluated with standard
            column names and rows of sample data.
        replace_NaN : string or float, default "remove"


Could we make this all lowercase instead? So replace_nan. It makes the function easier to use.

JaroCamphuijsen · 2026-02-23T15:02:17Z

tests/test_transformation.py

+        """
+        data_mod = self.data.copy()
+        data_mod.iloc[2,25]=np.inf
+        data_filter = filter_values(data_mod,


This currently only works if you set the replace_NaN argument to 'remove'. In al the other cases it will not produce the expected result. Or it will actually, because the filtering of inf values is not described in the docstrings.

Perhaps add a test for handling inf values with the other cases as well if we decide to keep the np.inf handling.

sonarqubecloud · 2026-02-24T10:44:21Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

AlrauneZ · 2026-02-24T10:56:18Z

@jaro: good point. I tried first, but it did not work. Now, I managed to get it working with replacing inf by NaN with the command you suggest.

During adapting of the testing, I realized that it is good to include handling of 'inf' values more generally. I adapted the example data now also including one inf-value. This required some adaptions in the testing and in the functions calculating average concentrations and non-zero value counts.

remove use of global option of 'une_inf_as_na' and adapt handling of …

f9405e9

…inf within function

AlrauneZ self-assigned this Feb 23, 2026

AlrauneZ added the bug Something isn't working label Feb 23, 2026

AlrauneZ linked an issue Feb 23, 2026 that may be closed by this pull request

Update handling of use_inf_as_na #124

Open

AlrauneZ added this to MIBIREM Feb 23, 2026

AlrauneZ added 3 commits February 23, 2026 15:01

add test for correct handling of 'inf' values

7e2b906

adapt handling of 'inf'

01f0bb5

linting

cef366d

AlrauneZ requested review from JaroCamphuijsen and raar1 February 23, 2026 14:16

AlrauneZ marked this pull request as ready for review February 23, 2026 14:16

JaroCamphuijsen requested changes Feb 23, 2026

View reviewed changes

AlrauneZ added 5 commits February 24, 2026 09:50

adapt handling of 'inf' values

4cc5615

Update example data including an 'inf' value

2934853

update testing to handling of 'inf'

5906fa2

Include replacement of 'inf' values in data cleaning

80d6441

fix import

f8520ac

AlrauneZ requested a review from JaroCamphuijsen February 24, 2026 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update handling of global option of 'use_inf_as_na'#126

update handling of global option of 'use_inf_as_na'#126
AlrauneZ wants to merge 9 commits intomainfrom
124-update-handling-of-use_inf_as_na

AlrauneZ commented Feb 23, 2026 •

edited

Loading

Uh oh!

JaroCamphuijsen left a comment

Uh oh!

JaroCamphuijsen Feb 23, 2026

Uh oh!

JaroCamphuijsen Feb 23, 2026

Uh oh!

JaroCamphuijsen Feb 23, 2026

Uh oh!

JaroCamphuijsen Feb 23, 2026

Uh oh!

sonarqubecloud bot commented Feb 24, 2026

Uh oh!

AlrauneZ commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AlrauneZ commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JaroCamphuijsen left a comment

Choose a reason for hiding this comment

Uh oh!

JaroCamphuijsen Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

JaroCamphuijsen Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

JaroCamphuijsen Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

JaroCamphuijsen Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Feb 24, 2026

Quality Gate passed

Uh oh!

AlrauneZ commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AlrauneZ commented Feb 23, 2026 •

edited

Loading