WIP: FEA Quantile regression for decision trees by cakedev0 · Pull Request #1 · cakedev0/scikit-learn

cakedev0 · 2025-09-14T19:36:19Z

WIP

Follow up to PR scikit-learn#32100 and esp. discussion here: scikit-learn#32100 (comment)

TODO (bottom-up order):

Reference Issues/PRs

scikit-learn#32100

What does this implement/fix? Explain your changes.

Any other comments?

Maths:

We consider a weighted dataset ${(y_i, w_i)}_{i}$ with non-negative weights $w_i$.

For a scalar prediction $q$, the weighted pinball loss is

$$ L_\alpha(q) = \sum_{i} w_i \big( \alpha \max(y_i - q, 0) + (1 - \alpha)\max(q - y_i, 0) \big) $$

Equivalently, splitting by whether $y_i \ge q$ or $y_i < q$:

$$ L_\alpha(q) = \alpha \sum_{i: y_i \ge q} w_i (y_i - q) + (1 - \alpha) \sum_{i: y_i < q} w_i (q - y_i) $$

To evaluate this efficiently, introduce the aggregates

$$ W^+(q) = \sum_{i: y_i \ge q} w_i, \qquad Y^+(q) = \sum_{i: y_i \ge q} w_i y_i, $$

$$ W^-(q) = \sum_{i: y_i < q} w_i, \qquad Y^-(q) = \sum_{i: y_i < q} w_i y_i. $$

Using these, the loss admits the "O(1)" form

$$ L_\alpha(q) = \alpha \big( Y^+(q) - q W^+(q) \big) + (1 - \alpha) \big( q W^-(q) - Y^-(q) \big). $$

Or in the code:

q * (above.weighted_sum - quantile * above.total_weight)
+ (1 - q) * (quantile * below.total_weight - below.weighted_sum)

…dded print everywhere to debug; fixed some bugs

…al PR but not all

…rray

Naming & comments Co-authored-by: Adam Li <adam2392@gmail.com>

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…to mae-split-optim

cakedev0 and others added 30 commits September 2, 2025 22:40

First draft, needs tests & fixes

f7cf7d6

Merge remote-tracking branch 'upstream/main' into mae-split-optim

7061ff6

fixed compilation errors

f4edaa2

fixed compilation errors

01fd9b2

Moved AE computation in external helper to be able to unit-test it; a…

3f87b99

…dded print everywhere to debug; fixed some bugs

WIP some additional tests that helped me, some will be kept in my fin…

e8adf96

…al PR but not all

tests cleanup

4ed868e

cleanup

83d89a4

cleanup

1ca34bf

Merge remote-tracking branch 'upstream/main' into mae-split-optim

43692f7

WIP fixing linting issues

d463558

fixed linting

fa993d4

fix spelling

cbf5405

Added test that would fail before this PR

a4bd310

added changed logs

f4a0e07

cleanup

a86a190

comments & cleanups

092af65

slight refactor of class inheritance

4a12dea

Merge remote-tracking branch 'upstream/main' into mae-split-optim

b44fb2b

adressed PR comments; simplified dimension of left/right abs errors a…

81728c2

…rray

removed print

7477f4c

heap methods docstring; test: split assertion

8f035d0

unit test for heap

e6bf43b

fix comment

eb2ccf5

Merge remote-tracking branch 'upstream/main' into mae-split-optim

66a2cb6

Apply suggestions from code review

d13a2c5

Naming & comments Co-authored-by: Adam Li <adam2392@gmail.com>

comments & naming

4fc78f4

parameters docstring

220c34f

Update doc about MAE criterion speed

d9b3c35

move precompute

72e15b5

cakedev0 and others added 13 commits September 26, 2025 16:59

new test and fix

77dcb19

fix typo

14014f5

remove np.pow

ad16ae0

Apply suggestion from @ogrisel

1e9c74f

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Apply suggestion from @cakedev0

b21040e

added explanation test; more tests with integer weights

e557f9e

Merge branch 'mae-split-optim' of github.com:cakedev0/scikit-learn in…

f920379

…to mae-split-optim

Merge branch 'mae-split-optim' into quantile-regression

c204c20

Merge remote-tracking branch 'upstream/main' into mae-split-optim

c842e59

Merge branch 'main' into mae-split-optim

bec926a

Merge branch 'mae-split-optim' into quantile-regression

0cdeaaf

Merge branch 'main' into quantile-regression

bf0007f

cleanup, comments updates, renamings, ...

19bf4a6

cakedev0 changed the base branch from mae-split-optim to main December 15, 2025 20:46

cakedev0 added 16 commits December 15, 2025 21:48

remove old changelog

aaa4b2a

Added simple changelog

4023c02

Merge remote-tracking branch 'upstream/main' into quantile-regression

e9c424d

Merge remote-tracking branch 'upstream/main' into quantile-regression

10e7398

renaming & public API

a718a68

userguide

23f0382

added tests

30418a4

support in RF/ExtraTrees

2cfc7f2

add a test with quantile criterion for forests

e06bd6c

fix docstring

a9b26d6

update changelog

6316b6b

cleanup

19a46b8

Merge remote-tracking branch 'upstream/main' into quantile-regression

c1a9af9

Merge remote-tracking branch 'upstream/main' into quantile-regression

3f81df5

fix __reduce__ for MAE criterion

ff2df12

minor public doc udpate

1553d6c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: FEA Quantile regression for decision trees#1

WIP: FEA Quantile regression for decision trees#1
cakedev0 wants to merge 82 commits intomainfrom
quantile-regression

cakedev0 commented Sep 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

cakedev0 commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

WIP

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Maths:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

cakedev0 commented Sep 14, 2025 •

edited

Loading