Add persist argument to QuantileDeltaMapping train method #1697

saschahofmann · 2024-04-03T08:45:12Z

Addressing a Problem?

In the case where you want to reuse the trained dataset for multiple adjustments the docs already mention that you can trigger the training by calling .load on the .ds dataset object of the class. This loads the trained model into the memory of the main thread. For bigger, datasets it might be required to leave the data on the worker but still only compute it once.

This can already be done by doing something like

qdm.set_dataset(qdm.ds.persist())

I could imagine that this is a common enough case to add an argument to the train method that does exactly that.

Potential Solution

Extend the train method by adding persist=False optional argument. That if true updates the trained dataset to be a persisted dask array.

Additional context

No response

Contribution

I would be willing/able to open a Pull Request to contribute this feature.

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

aulemahal · 2024-04-16T15:05:50Z

Hi again,

I haven't tried that yet, maybe I should! In our latest large scale workflows, we wrote the training the dataset to disk as it was larger than memory anyway.

I find the qdm.set_dataset(qdm.ds.persist()) line to be simple enough, I'm not totally convinced this warrants an implementation in xclim ? However, that implementation would also be very simple, I suggest adding a persist method to ParametrizableWithDataset here. Would that solve the issue for you?

QDM = QuantileDeltaMapping.train(ref, hist, **kwargs)
QDM.persist()
QDM.adjust(sim, **kwargs)

saschahofmann · 2024-04-16T15:21:56Z

Ah yes that would be another way to do it. I agree maybe it doesn't warrant an extra step especially if the more common use case is to persist on disk! Feel free to close

saschahofmann added the enhancement New feature or request label Apr 3, 2024

saschahofmann closed this as completed Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add persist argument to QuantileDeltaMapping train method #1697

Add persist argument to QuantileDeltaMapping train method #1697

saschahofmann commented Apr 3, 2024

aulemahal commented Apr 16, 2024 •

edited

Loading

saschahofmann commented Apr 16, 2024

Add persist argument to QuantileDeltaMapping train method #1697

Add persist argument to QuantileDeltaMapping train method #1697

Comments

saschahofmann commented Apr 3, 2024

Addressing a Problem?

Potential Solution

Additional context

Contribution

Code of Conduct

aulemahal commented Apr 16, 2024 • edited Loading

saschahofmann commented Apr 16, 2024

aulemahal commented Apr 16, 2024 •

edited

Loading