Intake conversion Fig3-GlobalTimeseries #344

rbeucher · 2024-06-06T05:41:37Z

Following the discussion in issue #313, we propose converting the recipes to use Intake, given that the Cookbook is no longer supported and the ACCESS-NRI Intake catalog is now available.

A few months ago, @max-anu began working on this transition. This pull request contains the changes @max-anu made to the notebook specified in the title.

review-notebook-app · 2024-06-06T05:41:41Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

julia-neme · 2024-07-29T01:12:44Z

What is expdata? I can't run this because the module is not available. Maybe it is outdated, do I need to find another way of opening and concatenating all cycles?

AndyHoggANU · 2024-07-29T01:37:32Z

It is an old system we used for the GMD paper.
But I simplified that in this example:
https://github.com/COSIMA/cosima-recipes/blob/main/ACCESS-OM2-GMD-Paper-Figs/Fig4-DrakePassageTransport.ipynb
So maybe follow my lead there to replace exptdata methods?

adele-morrison · 2024-07-29T01:40:08Z

Do we actually think it's worth converting all these notebooks from the Kiss et al. 2020 paper to intake? I thought they were more here for completeness and historical records. I don't think they're really still used?

julia-neme · 2024-07-29T01:44:41Z

I don't think they are in use, but I think that if we choose to keep them in the repo they should be runnable.

adele-morrison · 2024-07-29T01:45:47Z

Is it worth considering archiving them to zenodo or similar?

AndyHoggANU · 2024-07-29T01:56:58Z

I think they are good as extended contributed examples, so would be nice to keep some of them. Especially as we start to evaluate OM3 ...

julia-neme · 2024-07-29T04:25:32Z

I am not finding this conversion to intake easy.... could someone give me a hand understanding these errors? @rbeucher

RuntimeError: NetCDF: Not a valid ID

anton-seaice · 2024-07-29T04:49:54Z

RuntimeError: NetCDF: Not a valid ID

This normally means we need to set threads_per_worker=1 for the dask client. See #409

rbeucher · 2024-07-29T05:02:59Z

Sorry @julia-neme . Hopefully we can make it easier soon.

julia-neme · 2024-07-29T07:09:36Z

I have managed to load all the scalars via intake. However, the figure I am getting is not the same as in Kiss et al. (2020). I'm not sure what I'm doing wrong, I've just changed the loading and added one more cycle.... everything else is the same. It is also extremely slow to run and I am using XXlarge

Maybe @aekiss @AndyHoggANU @rbeucher have some idea why?

dougiesquire · 2024-07-30T03:22:11Z

Hi @julia-neme. The old (cosima-cookbook) and new (intake) versions of this recipe are loading different data, which is why the plots are different. Someone that knows about the history of these experiments will need to comment on this (@aekiss, @AndyHoggANU). For their reference:

The cc version uses experiments:

1deg_jra55v13_iaf_spinup1_B1
025deg_jra55v13_iaf_gmredi6
01deg_jra55v13_iaf

giving

The intake version uses experiments:

1deg_jra55_iaf_omip2_cycle.*
025deg_jra55_iaf_omip2_cycle.*
01deg_jra55v140_iaf.*

giving

Regarding the timing, the relevant baseline is the time taken before changing to the intake catalog. For me, the old version took ~7 mins to run (bearing in mind that this PR also changes the experiments being used, so we're not really comparing apples in comparing the timing of the old and new versions). The big reason your recipe is slow at the moment is because the chunking of the data has not been considered. By default, both the cookbook and the intake catalog open the data using the netcdf chunking. This chunking is particularly sub-optimal for this recipe as the scalar data are chunked in time. Simply adding chunks={"time": -1} when opening the data speeds things up a lot. When I add this to xarray_open_kwargs in the to_dask call in this PR, I can run the recipe to generate the plots in ~5.5 mins (remembering that this new version also includes more 01deg data than the old version).

julia-neme · 2024-07-30T03:45:42Z

That's great @dougiesquire thank you so much. I'll try to keep in mind all the intake tips I'm getting jaja, apologies for all my problems.

The intake catalog doesn't have the original experiments. So I guess maybe keeping these notebooks here, in a current runnable version is not possible...

dougiesquire · 2024-07-30T04:20:51Z

@julia-neme would you like me to push the changes I made to the notebook in this PR to generate the above plots in 5.5 mins?

julia-neme · 2024-07-30T04:24:45Z

On one hand yes, but on the other hand I think this is supposed to be a reproduction of the Kiss et al. (2020) figures. If we can't do that reproduction anymore because the experiments are not in intake, I'm not sure we want to update these at all... or even whether it makes sense to have them.

@AndyHoggANU @adele-morrison @navidcy what do you think?

dougiesquire · 2024-07-30T04:25:18Z

The intake catalog doesn't have the original experiments. So I guess maybe keeping these notebooks here, in a current runnable version is not possible...

We can add those experiments to the catalog if they're important to people? Data requests can be made here

anton-seaice · 2024-07-30T04:26:08Z

The intake catalog doesn't have the original experiments. So I guess maybe keeping these notebooks here, in a current runnable version is not possible...

I think it makes sense to move to the newer experiments, as those are the ones we use and refer to now.

I'm not sure the goal is to literally replicate the exact results in the paper, I think the goal is to have the same figures available (and up to date)?

aekiss · 2024-07-30T05:28:37Z

Agreed, I think it's worthwhile having this as an example, even if it doesn't reproduce the paper figure - thanks for getting it working!

aekiss · 2024-07-30T06:04:30Z

It would be good to widen the y axis limits in panel (a) so we can see the complete 0.25° and 0.1° timeseries.

aekiss · 2024-07-30T06:16:43Z

The differences between the 0.25° runs in panels (a) is a bit surprising. The initial condition is nearly 0.1°C warmer in the new run and the drift also seems a bit stronger.

I haven't thought of a plausible explanation for this.

The new run fixed an initial condition bug, but the resulting temperature change seems too small to explain it, and also doesn't explain why the initial difference at 1° is smaller and of the opposite sign.

The new config uses the updated bathymetry from input_20201102 (which is itself not the latest - see COSIMA/access-om2#265), so the initial state is based on WOA13 data interpolated to a different set of points. But the new 0.25° is generally somewhat deeper, which I'd expect would reduce the global mean temperature.

AndyHoggANU · 2024-07-30T09:14:59Z

Yeah, I think we should use more recent runs, but retain the style of plot that we used in the original GMD paper. The key is that people may want to use these as examples in the future, so it’s important that they work — and it’s not important for the intake catalog to include very old experiments that we won’t use for science any more …

navidcy · 2024-07-30T09:28:00Z

We can update the description of this directory in the README to say something along yhe lines of “reproduce figures in the same style as the Kiss et al. (2020) paper using different model output”

dougiesquire · 2024-07-30T10:16:16Z

I've just pushed the minor changes I made to speed things up and finish making the plots. Note, I also added a .load() to the global_scalar function - these data are tiny, so it doesn't help to delay the calculations here. This brought the run time down to ~3.5 mins, so ~2x faster than the original.

converting from cc to intake

9d46144

rbeucher added the 🕹️ hackathon 4.0 label Jun 6, 2024

Merge branch 'main' into INTAKE_Fig3-GlobalTimeseries

871aa3e

julia-neme self-requested a review July 22, 2024 06:11

failure to reproduce

12347d9

navidcy mentioned this pull request Jul 30, 2024

Converting notebooks from COSIMA Cookbook to ACCESS-NRI intake catalog #313

Open

42 tasks

Load scalar data, finish plots

041e0a7

Merge branch 'main' into INTAKE_Fig3-GlobalTimeseries

2dd537d

julia-neme approved these changes Aug 4, 2024

View reviewed changes

julia-neme merged commit 1907aaf into main Aug 4, 2024
2 of 3 checks passed

julia-neme deleted the INTAKE_Fig3-GlobalTimeseries branch August 4, 2024 23:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intake conversion Fig3-GlobalTimeseries #344

Intake conversion Fig3-GlobalTimeseries #344

rbeucher commented Jun 6, 2024

review-notebook-app bot commented Jun 6, 2024

julia-neme commented Jul 29, 2024

AndyHoggANU commented Jul 29, 2024

adele-morrison commented Jul 29, 2024

julia-neme commented Jul 29, 2024

adele-morrison commented Jul 29, 2024

AndyHoggANU commented Jul 29, 2024

julia-neme commented Jul 29, 2024 •

edited

Loading

anton-seaice commented Jul 29, 2024

rbeucher commented Jul 29, 2024

julia-neme commented Jul 29, 2024

dougiesquire commented Jul 30, 2024 •

edited

Loading

julia-neme commented Jul 30, 2024

dougiesquire commented Jul 30, 2024

julia-neme commented Jul 30, 2024

dougiesquire commented Jul 30, 2024 •

edited

Loading

anton-seaice commented Jul 30, 2024

aekiss commented Jul 30, 2024

aekiss commented Jul 30, 2024

aekiss commented Jul 30, 2024 •

edited

Loading

AndyHoggANU commented Jul 30, 2024

navidcy commented Jul 30, 2024

dougiesquire commented Jul 30, 2024

Intake conversion Fig3-GlobalTimeseries #344

Intake conversion Fig3-GlobalTimeseries #344

Conversation

rbeucher commented Jun 6, 2024

review-notebook-app bot commented Jun 6, 2024

julia-neme commented Jul 29, 2024

AndyHoggANU commented Jul 29, 2024

adele-morrison commented Jul 29, 2024

julia-neme commented Jul 29, 2024

adele-morrison commented Jul 29, 2024

AndyHoggANU commented Jul 29, 2024

julia-neme commented Jul 29, 2024 • edited Loading

anton-seaice commented Jul 29, 2024

rbeucher commented Jul 29, 2024

julia-neme commented Jul 29, 2024

dougiesquire commented Jul 30, 2024 • edited Loading

julia-neme commented Jul 30, 2024

dougiesquire commented Jul 30, 2024

julia-neme commented Jul 30, 2024

dougiesquire commented Jul 30, 2024 • edited Loading

anton-seaice commented Jul 30, 2024

aekiss commented Jul 30, 2024

aekiss commented Jul 30, 2024

aekiss commented Jul 30, 2024 • edited Loading

AndyHoggANU commented Jul 30, 2024

navidcy commented Jul 30, 2024

dougiesquire commented Jul 30, 2024

julia-neme commented Jul 29, 2024 •

edited

Loading

dougiesquire commented Jul 30, 2024 •

edited

Loading

dougiesquire commented Jul 30, 2024 •

edited

Loading

aekiss commented Jul 30, 2024 •

edited

Loading