Testing refactor #612

mwaxmonsky · 2024-11-11T04:48:42Z

Refactoring of testing infrastructure

Changes include:

Refactored python tests to leverage unittest framework.
Replaced shell tests with equivalent wrapper from Python.
Reduces value duplication from fortran/shell/python files
to just fortran and python file.
Adds re-useable CMake infrastructure for capgen, advection,
and var_compatability tests.
Enabling re-use of openmp and mpi configuration for both library
and tests.
Removing PGI compiler options and updating intel flags to ifx
equivalents.
Replaces manually calling test binaries with calling ctest
Removed a few un-used files

User interface changes?: No

Fixes: None

Testing:
test removed: None
unit tests:
system tests:
manual testing:

… with same binary name.

mwaxmonsky · 2024-11-11T05:12:19Z

Greetings all, apologies for throwing this out there without an issue but I've been trying to keep up with the changes in the framework and the equivalent tests and was getting lost in the details and noticed a few things that I thought would make things a little bit easier overall.

My main goal was making it more explicit what was being tested and reducing the amount of duplication we have but that necessitated quite a few changes at various levels of the infrastructure.

I started going down a rabbit hole of how the framework is built and tested but before going much further, I figured I would share some of the things I've updated and check in to see if these make sense and are useful for the rest of the team.

There are a few technical details that need to be evaluated as well but I wanted to check if, overall, this is moving things in a reasonable direction or if this is making things more complicated?

gold2718 · 2024-11-11T08:11:03Z

Hi @mwaxmonsky, thanks for tackling this.

So far, most of this looks like much needed cleanup!

There is one change I do not understand. The old (clumsy, wordy) run_test script tested both the Python and the bash interfaces to ccpp_datafile.py. Do the new tests test the bash interface? These are (or at least were) used by some build systems.

climbfuji

There is a lot of great stuff in this PR, thanks @mwaxmonsky.

My main concern with this PR is that it touches code/build files that aren't just used by the unit tests, but also by production code (e.g. the top-level CMakeLists.txt file). For instance, this PR enfores -O0 and certain compiler flags for the CCPP framework. We definitely do not want -O0 for production environments. There must be different options to set compilers for unit testing and for host models like the UFS, CAM/SIMA, NEPTUNE, and there must be a way for the host model to define compiler flags that overwrite any defaults (if we want to set defaults for cases other than running the tests).

There are two comments below that need fixing.

We also need to consider targeting the main branch for this PR due to the impact on the host modeling systems. This PR may have to come straight after the current develop was merged into main. And it must be written in a way that it allows ccpp-prebuild to continue to work, otherwise this PR (or the develop branch, if this PR gets merged into develop) will be blocked until all host models have moved to capgen. We don't want this, it was a heavy lift last time to get feature/capgen being merged into main after a long time of parallel development.

climbfuji · 2024-11-11T12:58:59Z

CMakeLists.txt

+  ADD_COMPILE_OPTIONS(-ggdb)
+  ADD_COMPILE_OPTIONS(-ffree-line-length-none)
+  ADD_COMPILE_OPTIONS(-cpp)
+elseif(${CMAKE_Fortran_COMPILER_ID} MATCHES "IntelLLVM")


ifort will be staying around for quite a while longer, we need this as an option, too.

Ah, okay, I'll add those back in and differentiate between ifort and ifx.

climbfuji · 2024-11-11T13:00:33Z

CMakeLists.txt

+option(BUILD_SHARED_LIBS "Build a static library" OFF)
+set(CCPP_VERBOSITY "0" CACHE STRING "Verbosity level of output (default: 0)")
+
+if(CCPP_FRAMEWORK_ENABLE_TESTS)


If CCPP_FRAMEWORK_ENABLE_TESTS is ON, then there needs to be a find_package call for pFUnit with the REQUIRED keyword. pfunit must come from the user-provided software stack. No implicit installs during cmake calls/build steps.

Understood, happy to make that change. I'm actually looking to make a container with all the dependencies/packages already installed so the environment better mimics our development environments to make the cmake call clean as well (we can pass in the pfunit path at configure time but it's clunky without modules).

climbfuji · 2024-11-11T13:11:49Z

test/capgen_test/test_reports.py

@@ -12,48 +12,22 @@
 """


Is every test directory in test/ going to have a copy of this gigantic file?

I will be templating it so each test/test_reports.py file only sets the field variables and inherits the tests from a shared parent file. That's a detail that's on my list but wanted to get feedback on the present state before refactoring the other directories.

mwaxmonsky · 2024-11-11T15:54:02Z

Hi @mwaxmonsky, thanks for tackling this.

So far, most of this looks like much needed cleanup!

There is one change I do not understand. The old (clumsy, wordy) run_test script tested both the Python and the bash interfaces to ccpp_datafile.py. Do the new tests test the bash interface? These are (or at least were) used by some build systems.

@gold2718 Yup! I made sure to be as faithful to the current coverage and made each of the shell tests their own unit tests at the bottom of the python file using the subprocess.run(...) API. If I missed anything on that front, I'm happy to add and address anything that might be missing.

mwaxmonsky · 2024-11-11T23:19:10Z

There is a lot of great stuff in this PR, thanks @mwaxmonsky.

My main concern with this PR is that it touches code/build files that aren't just used by the unit tests, but also by production code (e.g. the top-level CMakeLists.txt file). For instance, this PR enfores -O0 and certain compiler flags for the CCPP framework. We definitely do not want -O0 for production environments. There must be different options to set compilers for unit testing and for host models like the UFS, CAM/SIMA, NEPTUNE, and there must be a way for the host model to define compiler flags that overwrite any defaults (if we want to set defaults for cases other than running the tests).

There are two comments below that need fixing.

We also need to consider targeting the main branch for this PR due to the impact on the host modeling systems. This PR may have to come straight after the current develop was merged into main. And it must be written in a way that it allows ccpp-prebuild to continue to work, otherwise this PR (or the develop branch, if this PR gets merged into develop) will be blocked until all host models have moved to capgen. We don't want this, it was a heavy lift last time to get feature/capgen being merged into main after a long time of parallel development.

@climbfuji Happy to remove the -O0 flag and the other flags (I meant to make those target specific but wanted to get feed back on everything else before going further with the build), that was just in there from debugging and we definitely don't want that in there for production environments.

In terms of options from different compilers/flags, I'll look into different APIs for cmake and see what makes the most sense.

As for targeting main, I'm happy to pivot there but the tests for prebuild run successfully the same as on main. Is the issue that users would have to update their model code and their build integration with the framework? Just trying to understand the issue of targeting develop instead of main.

climbfuji · 2024-11-12T03:08:07Z

There is a lot of great stuff in this PR, thanks @mwaxmonsky.
My main concern with this PR is that it touches code/build files that aren't just used by the unit tests, but also by production code (e.g. the top-level CMakeLists.txt file). For instance, this PR enfores -O0 and certain compiler flags for the CCPP framework. We definitely do not want -O0 for production environments. There must be different options to set compilers for unit testing and for host models like the UFS, CAM/SIMA, NEPTUNE, and there must be a way for the host model to define compiler flags that overwrite any defaults (if we want to set defaults for cases other than running the tests).
There are two comments below that need fixing.
We also need to consider targeting the main branch for this PR due to the impact on the host modeling systems. This PR may have to come straight after the current develop was merged into main. And it must be written in a way that it allows ccpp-prebuild to continue to work, otherwise this PR (or the develop branch, if this PR gets merged into develop) will be blocked until all host models have moved to capgen. We don't want this, it was a heavy lift last time to get feature/capgen being merged into main after a long time of parallel development.

@climbfuji Happy to remove the -O0 flag and the other flags (I meant to make those target specific but wanted to get feed back on everything else before going further with the build), that was just in there from debugging and we definitely don't want that in there for production environments.

In terms of options from different compilers/flags, I'll look into different APIs for cmake and see what makes the most sense.

As for targeting main, I'm happy to pivot there but the tests for prebuild run successfully the same as on main. Is the issue that users would have to update their model code and their build integration with the framework? Just trying to understand the issue of targeting develop instead of main.

Yes, if you change the top-level CMakeLists.txt, you will impact the host models that use the ccpp-framework. UFS, SCM and NEPTUNE all use the ccpp-framework CMakeLists.txt.

mwaxmonsky · 2024-11-12T15:12:53Z

Also, this might be better saved for a discussion thread but I was wondering about the references to MPI and OpenMP. I don't see any openmp pragmas or types and currently the only reference to MPI is the mpi_comm in ccpp_types and even then we only declare a public reference but never use it internally.

I see a note in common.py about MPI:

# Maximum number of concurrent CCPP instances per MPI task
CCPP_NUM_INSTANCES = 200

and this value is used in the write function in mkstatic but it doesn't look like this is directly tied to the framework being build with MPI support.

If we don't need MPI or openmp at build time for the framework, would it make sense removing these references in the CMake and ccpp_types or are these actively being used and I'm just missing a key detail?

gold2718 · 2024-11-12T19:31:54Z

Currently, capgen generates omp_get_thread_num calls in the suite caps.
I don't think there is any MPI code generated currently by capgen.

climbfuji · 2024-11-12T19:50:46Z

ccpp-prebuild doesn't use any OpenMP calls in the auto-generated caps (everything comes from the host model via ccpp_t and its components, but the auto-generated caps and the code in src/ccpp_types.F90 need MPI. Removing the MPI dependency may break host models currently using ccpp_prebuild.

mwaxmonsky added 19 commits October 26, 2024 09:30

Re-adding pylint test.

bcf4097

Simplifying arg parser.

340c2f9

First pass refactor of tests into integrated stack.

1285ea9

Updating MPI cmake find with components.

c707b20

Adding mpi library.

fd66d0f

Updating mpi library.

278b2bd

Adding missing cmake.

9ce27bf

Actually building.

68f2575

Adding pytest dependency.

c256bed

Updating pytest binary name.

e7f84f4

Updating pytest install.

c886348

Updating pytest binary name.

20feac2

Updating test binary name in ctest to distinguish between other tests…

efff984

… with same binary name.

Minor CMake cleanup and temporarily removing MPI dependency.

84cdd96

Re-adding MPI for testing.

cc43fda

Resolving merge conflicts merging develop into testing-refactor branch.

890ea0e

Re-adding MPI linking and minor CMake cleanup.

cfe3921

Fixing CI job failure.

9b66008

Removing pylint workflow for different PR.

df44e3a

climbfuji requested changes Nov 11, 2024

View reviewed changes

climbfuji reviewed Nov 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing refactor #612

Testing refactor #612

mwaxmonsky commented Nov 11, 2024

mwaxmonsky commented Nov 11, 2024

gold2718 commented Nov 11, 2024

climbfuji left a comment

climbfuji Nov 11, 2024

mwaxmonsky Nov 11, 2024

climbfuji Nov 11, 2024

mwaxmonsky Nov 11, 2024

climbfuji Nov 11, 2024

mwaxmonsky Nov 11, 2024

mwaxmonsky commented Nov 11, 2024

mwaxmonsky commented Nov 11, 2024

climbfuji commented Nov 12, 2024

mwaxmonsky commented Nov 12, 2024

gold2718 commented Nov 12, 2024

climbfuji commented Nov 12, 2024

Testing refactor #612

Are you sure you want to change the base?

Testing refactor #612

Conversation

mwaxmonsky commented Nov 11, 2024

mwaxmonsky commented Nov 11, 2024

gold2718 commented Nov 11, 2024

climbfuji left a comment

Choose a reason for hiding this comment

climbfuji Nov 11, 2024

Choose a reason for hiding this comment

mwaxmonsky Nov 11, 2024

Choose a reason for hiding this comment

climbfuji Nov 11, 2024

Choose a reason for hiding this comment

mwaxmonsky Nov 11, 2024

Choose a reason for hiding this comment

climbfuji Nov 11, 2024

Choose a reason for hiding this comment

mwaxmonsky Nov 11, 2024

Choose a reason for hiding this comment

mwaxmonsky commented Nov 11, 2024

mwaxmonsky commented Nov 11, 2024

climbfuji commented Nov 12, 2024

mwaxmonsky commented Nov 12, 2024

gold2718 commented Nov 12, 2024

climbfuji commented Nov 12, 2024