Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement ClimaLand global calibration #794

Open
2 of 4 tasks
AlexisRenchon opened this issue Sep 25, 2024 · 12 comments
Open
2 of 4 tasks

Implement ClimaLand global calibration #794

AlexisRenchon opened this issue Sep 25, 2024 · 12 comments
Assignees

Comments

@AlexisRenchon
Copy link
Member

AlexisRenchon commented Sep 25, 2024

Tasks

@AlexisRenchon AlexisRenchon self-assigned this Sep 25, 2024
@AlexisRenchon AlexisRenchon changed the title Steps toward global calibration Implement ClimaLand global calibration Sep 25, 2024
@AlexisRenchon
Copy link
Member Author

Observations:

  • first, one per variable (same location), e.g., LHF, GPP with combine_observation
  • Then, combine them for multiple locations with Observationseries

@AlexisRenchon
Copy link
Member Author

Data (LHF and SHF that we calibrate on) needs to be in the same format as model output

@AlexisRenchon
Copy link
Member Author

Note: we should change single site observation calibration a bit, it takes 2 hours right now (300 simulations run), we could save the figure but run it with less simulations to ensure it passes CI

@AlexisRenchon
Copy link
Member Author

Land model:
What loss function?
LHF, SHF, emissivity, SW, LW, (SIF, NEE, C fluxes)

@AlexisRenchon
Copy link
Member Author

AlexisRenchon commented Oct 23, 2024

  • Format: EKP vs. ClimaCalibrate
  • Models: Bucket vs. LandModel
  • Data: Observations vs. Perfect Model
  • Scale: Single site vs. Multi-columns vs. Global

Possible KR:

  • Create pipeline with EKP or ClimaCalibrate
  • Demonstrate parameters recovery with Perfect Model
  • Demonstrate X with observations

What we mean by Loss:

  • Different averaging period or other reduction of variables
  • Different configuration, e.g., initialisation

Parameters:

  • Correlations between parameters
  • Parameters selection and distributions

Next to do:

  • Multiple variables stacked (ETKI, see comment above)
  • Multiple columns (e.g., 100s, see above)
  • Using netcdf diagnostics read and write

What we can qualify / quantify (and remember, stochastic method -> change all the time):

  • PM: collapse of ensemble toward original parameters
  • prior spread vs. final iteration spread
  • correlation between params

random notes:

  • Julia does not preserve seed between version!

@AlexisRenchon
Copy link
Member Author

(to do: add 2*sqrt(noise) as a band around target)

@AlexisRenchon
Copy link
Member Author

Options: limits the amount of data by averaging over larger steps

@AlexisRenchon
Copy link
Member Author

Think of informative features of the data that you want to capture such as seasonal amplitude, seasonal shifts, diurnal amplitude, etc. that are controlled by parameters...

Then you can make THAT your target as a function of params, and calibrate

@AlexisRenchon
Copy link
Member Author

observational noise is standard deviation, i.e., covariance

@AlexisRenchon
Copy link
Member Author

next steps:

  • ClimaCalibrate for Bucket
  • g(params) for full land model, global calibration

@AlexisRenchon
Copy link
Member Author

when doing random locations on land, do we sample good representation of all landscapes

@AlexisRenchon
Copy link
Member Author

only write LHF and SHF diagnostics

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant