Speed up quantiles with sorting #1513

SarahG-579462 · 2023-10-26T20:22:21Z

Pull Request Checklist:

This PR addresses an already opened issue (for bug fixes / features)
- This PR will help Efficient Application of Bias Correction #1255
Tests for the changes have been added (for bug fixes / features)
- (If applicable) Documentation has been added / updated (for bug fixes / features)
CHANGES.rst has been updated (with summary of main changes)
- Link to issue (:issue:number) and pull request (:pull:number) has been added

What kind of change does this PR introduce?

nbutils.quantile has a speed-up of more than 2.5x by a combination of changes in nbutils.quantile and nbutils._quantile
This does not cover nbutils.vec_quantiles (used for adapt_freq) but similar principles could be used
It adds the possibility of using fastnanquantile module which is very fast

Does this PR introduce a breaking change?

No

Other information:

The new low-level function to compute quantiles nbutils._quantile is a 1d jitted version of xclim.core.utils._nan_quantile
Manual benchmarking can be performed in the notebook benchmarks/sdba_quantile.ipynb, attached to this PR.

review-notebook-app · 2023-10-26T20:22:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

github-actions · 2023-10-26T20:22:36Z

Welcome, new contributor!

It appears that this is your first Pull Request. To give credit where it's due, we ask that you add your information to the AUTHORS.rst and .zenodo.json:

The relevant author information has been added to AUTHORS.rst and .zenodo.json

Please make sure you've read our contributing guide. We look forward to reviewing your Pull Request shortly ✨

for more information, see https://pre-commit.ci

xclim/sdba/nbutils.py

juliettelavoie · 2023-11-01T13:43:30Z

For a 30 years period, over QC, the bias adjustment (first initial DQM + Npdf + restore) with these parameters:

n_iter: 30
nquantiles:50
group: time
n_escore: 1

took 8 hours for this branch and 8h40 for the master. ~~(time for branch faster_mbcn is coming.)~~

EDIT: Not sure if it makes a difference but, the test for this branch ran mostly during work hours and the test for the master ran during the night.

UPDATE: It took 6h30 with branch faster_mbcn. @coxipi your branch does help in my case!

Zeitsperre · 2023-11-16T16:09:57Z

@SarahG-579462

checking consistency... /home/docs/checkouts/readthedocs.org/user_builds/xclim/checkouts/1513/docs/notebooks/sdba-speed.ipynb: WARNING: document isn't included in any toctree

Small FYI, you need to add the notebook to the exclude_patterns in docs/conf.py if you don't want it rendered.

xclim/sdba/nbutils.py

coxipi · 2024-03-12T23:06:09Z

So I hijacked the PR a bit to test general benchmark tools. I needed to install pytest-benchmarkand pytest-benchmark[histogram]. The idea would be to have a benchmarking suite, similar to the structure of the testing suite. For a test similar to your current notebook, I run:

pytest  -n0 --dist no benchmarks/sdba/test_nbutils.py

where test_nbutils.py is

from xclim.sdba import nbutils
import pytest 
import importlib
import os
import numpy as np

nq = 50
i = 0
@pytest.mark.parametrize(
    "use_nanquantile,size",
    [
    (True, 100),
    (True, 200),
    (True, 500),
    (True, 1000),
    (False, 100),
    (False, 200),
    (False, 500),
    (False, 1000),
    ],
)
def test_nanquantile_simple(benchmark,use_nanquantile, size):
    np.random.seed(0)
    arr = np.random.randn(size)
    os.environ["USE_NANQUANTILE"] = "True" if use_nanquantile else ""
    importlib.reload(nbutils)   
    def func():
        return nbutils._choosequantile(arr, np.linspace(0, 1, nq))
    # trigger jit compilation on first run
    global i
    if i==0: 
        func()
        i = i + 1
    benchmark(func)

(I should have named it test__choosequantile but anyways)

I can then have an histogram with the results:

It's a bit annoying that the order of parameters in pytest.mark.parametrize is not respected. I could sort the names alphabetically, but then "1000" points comes before the other smaller samples. Anyways.

Anyways, it seems pretty plug and play. I find the visualization with the notebook is better, so we might want to keep a notebook for benchmarks too? Simpler tests that are not comparing multiple methods would benefit of simple numerical tests just to assert the performance doesn't drop below a certain level perhaps?

coxipi · 2024-03-13T15:56:32Z

I did a test with EmpiricalQuantileMapping, 30 years, 3 locations, dayofyear-31.
Sorting is still advantageous, but the effect is marginal. There is maybe an overhead which is just much greater than the actual computation of quantiles? False/True refers to use_nanquantile

In comparison, group="time" has a better performance for sortquantile, it's about 85-90% of the nanquantile running time. In that case, there must be less overhead (complicated map_blocks structure has a less important role in this case?)

SarahG-579462 · 2024-03-13T16:52:43Z

I did a test with EmpiricalQuantileMapping, 30 years, 3 locations, dayofyear-31. Sorting is still advantageous, but the effect is marginal. There is maybe an overhead which is just much greater than the actual computation of quantiles? False/True refers to use_nanquantile

This is really interesting, thanks. I totally support you taking over this PR, I don't really have time to work on it these days.

…tile

SarahG-579462 · 2024-07-02T21:31:04Z

The present implementation is not right, the call to nan_quantile is not properly "jitterized". Among other things, some operations in nan_quantile use the argument axis in nan_quantile which is unsupported for many functions used with njit. The library fastnanquantile shows how those parts working on axis must be done outside the njit calls. Also, np.asanyarray doesn't seem to be supported. I'm not sure why this is triggered or not in certain contexts.

The function would either need to be properly jitterized, or keep it unjittered and study again the performance. Since sdba and xclim are to be separated, I feel some redundancy would not be bad on this front, I was considering to simply revert some commits and continue to use sortquantile. Thoughts?

During testing, I had made a version of Abel's nan_quantile that supports only 1 dimension, that is jitt'able. Would it be preferable to use this? Abel's unjitted version is really not much slower than the jitted version, because he's using quite pure numpy functions.

… into speed-up-quantile

…-up-quantile

coxipi · 2024-07-09T17:51:26Z

Should be good to go, I added a few tests

coxipi · 2024-07-09T20:10:48Z

Would it be possible for you to add fastnanquantile as a dependency that can be installed via tox (e.g. under deps: fastnanquantile: fastnanquantile) and add that to one of the PyPI/tox CI builds (in main.yml) so that we can test against it?

@Zeitsperre under test-pypi?
EDIT: I will wait your return, exploring this part of the codebase is not good for my cardiac rhythm

github-actions · 2024-07-12T00:15:36Z

Note

It appears that this Pull Request modifies the main.yml workflow.

On inspection, the XCLIM_TESTDATA_BRANCH environment variable is set to the most recent tag (v2023.12.14).

No further action is required.

…-up-quantile

coveralls · 2024-07-12T14:18:18Z

coverage: 90.417% (-0.3%) from 90.679%
when pulling df6963b on speed-up-quantile
into 1b83536 on main.

…-up-quantile

Zeitsperre · 2024-07-19T15:47:36Z

@aulemahal I leave you to do the final review and merge. I won't pretend to be the expert here.

coxipi · 2024-07-19T15:56:35Z

@SarahG-579462 can you inspect a last time? Should be good now

SarahG-579462

Looks good to me! a note about fastmath below.

xclim/sdba/nbutils.py

…-up-quantile

… into speed-up-quantile

SarahG-579462 added 3 commits October 26, 2023 19:16

Improvements to sdba _quantile function

7d752a4

Add to vecquantiles

2de9086

fix vecquantiles?

65ddbdd

github-actions bot added sdba Issues concerning the sdba submodule. docs Improvements to documenation labels Oct 26, 2023

pre-commit-ci bot and others added 7 commits October 26, 2023 20:22

[pre-commit.ci] auto fixes from pre-commit.com hooks

719e9c6

for more information, see https://pre-commit.ci

pre-commit and proper signature orders

a798e8a

Remove changes to vecquantiles

a134bc2

Reformat sdba-speed notebook

749a467

Update AUTHORS.rst

ea57cf5

change criteria for nanquantile algorithm

7393c34

precompilation for most nbutils

8845208

aulemahal reviewed Oct 31, 2023

View reviewed changes

xclim/sdba/nbutils.py Outdated Show resolved Hide resolved

fix escores

6e35ec4

Zeitsperre and others added 3 commits November 16, 2023 14:42

Merge branch 'master' into speed-up-quantile

88aadeb

Merge branch 'master' into speed-up-quantile

500428f

Merge branch 'master' into speed-up-quantile

f81f484

coxipi reviewed Mar 12, 2024

View reviewed changes

xclim/sdba/nbutils.py Outdated Show resolved Hide resolved

coxipi reviewed Mar 12, 2024

View reviewed changes

xclim/sdba/nbutils.py Outdated Show resolved Hide resolved

coxipi added 4 commits April 29, 2024 15:06

allow_sortquantile option

5741c2d

Merge branch 'main' of github.com:Ouranosinc/xclim into speed-up-quan…

006d26e

…tile

time dimensions not stacked = faster quantile

33a79b0

Merge branch 'main' of github.com:Ouranosinc/xclim into speed-up-quan…

b629e7d

…tile

coxipi added 7 commits July 9, 2024 13:19

1d numba compatible _nan_quantile in sdba

980ec94

Merge branch 'speed-up-quantile' of https://github.com/Ouranosinc/xclim…

99c368a

… into speed-up-quantile

test nbu.quantile

ccc8291

conserve arr.dtype

7905a15

Merge branch 'main' of https://github.com/Ouranosinc/xclim into speed…

d5b81c0

…-up-quantile

CHANGELOG formatting

a2682ef

unwanted space

1585fc0

coxipi added 2 commits July 9, 2024 16:28

respect np2 new conventions

10b13dd

extras dependency (fastnanquantile)

53c9f23

github-actions bot added the CI Automation and Contiunous Integration label Jul 12, 2024

coxipi added 2 commits July 12, 2024 09:08

Merge branch 'main' of https://github.com/Ouranosinc/xclim into speed…

48b2e14

…-up-quantile

extra mention of extra

9206b73

coxipi and others added 2 commits July 19, 2024 10:56

Merge branch 'main' of https://github.com/Ouranosinc/xclim into speed…

d876376

…-up-quantile

Merge branch 'main' into speed-up-quantile

0b58a78

Zeitsperre approved these changes Jul 19, 2024

View reviewed changes

Zeitsperre requested a review from aulemahal July 19, 2024 15:25

Merge branch 'main' into speed-up-quantile

6542128

SarahG-579462 commented Jul 19, 2024

View reviewed changes

xclim/sdba/nbutils.py Show resolved Hide resolved

xclim/sdba/nbutils.py Show resolved Hide resolved

coxipi and others added 3 commits July 19, 2024 18:07

Merge branch 'main' of https://github.com/Ouranosinc/xclim into speed…

f93e71f

…-up-quantile

Merge branch 'speed-up-quantile' of https://github.com/Ouranosinc/xclim…

e3d0112

… into speed-up-quantile

Merge branch 'main' into speed-up-quantile

df6963b

coxipi merged commit 05e721b into main Jul 23, 2024
21 checks passed

coxipi deleted the speed-up-quantile branch July 23, 2024 16:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up quantiles with sorting #1513

Speed up quantiles with sorting #1513

SarahG-579462 commented Oct 26, 2023 •

edited by coxipi

Loading

review-notebook-app bot commented Oct 26, 2023

github-actions bot commented Oct 26, 2023 •

edited by Zeitsperre

Loading

juliettelavoie commented Nov 1, 2023 •

edited

Loading

Zeitsperre commented Nov 16, 2023

coxipi commented Mar 12, 2024 •

edited

Loading

coxipi commented Mar 13, 2024 •

edited

Loading

SarahG-579462 commented Mar 13, 2024

SarahG-579462 commented Jul 2, 2024

coxipi commented Jul 9, 2024

coxipi commented Jul 9, 2024 •

edited

Loading

github-actions bot commented Jul 12, 2024 •

edited

Loading

coveralls commented Jul 12, 2024 •

edited

Loading

Zeitsperre commented Jul 19, 2024

coxipi commented Jul 19, 2024

SarahG-579462 left a comment •

edited

Loading

Speed up quantiles with sorting #1513

Speed up quantiles with sorting #1513

Conversation

SarahG-579462 commented Oct 26, 2023 • edited by coxipi Loading

Pull Request Checklist:

What kind of change does this PR introduce?

Does this PR introduce a breaking change?

Other information:

review-notebook-app bot commented Oct 26, 2023

github-actions bot commented Oct 26, 2023 • edited by Zeitsperre Loading

juliettelavoie commented Nov 1, 2023 • edited Loading

Zeitsperre commented Nov 16, 2023

coxipi commented Mar 12, 2024 • edited Loading

coxipi commented Mar 13, 2024 • edited Loading

SarahG-579462 commented Mar 13, 2024

SarahG-579462 commented Jul 2, 2024

coxipi commented Jul 9, 2024

coxipi commented Jul 9, 2024 • edited Loading

github-actions bot commented Jul 12, 2024 • edited Loading

coveralls commented Jul 12, 2024 • edited Loading

Zeitsperre commented Jul 19, 2024

coxipi commented Jul 19, 2024

SarahG-579462 left a comment • edited Loading

Choose a reason for hiding this comment

SarahG-579462 commented Oct 26, 2023 •

edited by coxipi

Loading

github-actions bot commented Oct 26, 2023 •

edited by Zeitsperre

Loading

juliettelavoie commented Nov 1, 2023 •

edited

Loading

coxipi commented Mar 12, 2024 •

edited

Loading

coxipi commented Mar 13, 2024 •

edited

Loading

coxipi commented Jul 9, 2024 •

edited

Loading

github-actions bot commented Jul 12, 2024 •

edited

Loading

coveralls commented Jul 12, 2024 •

edited

Loading

SarahG-579462 left a comment •

edited

Loading