GitHub - ReHoss/singularity-hydrogym: Combine singularity containers with hydrogym and stable-baselines3

Installation

Hydrogym uses a Computational Fluid Dynamics (CFD) library calledFiredrake. Firedrake is a Python library that solves partial differential equations using the finite element method. It is not easy to install on platforms where the user does not have root access, such as HPC clusters.

Depending on the user needs, the following softwares may be required:

Docker
Singularity
MLFlow: Experiment tracker that is installed by default from pyproject.toml file.
Stable Baselines3: Reinforcement learning library that is also installed by default.
spython: to convert Dockerfile to Singularity definition files .def

Local development

Locally, you can install this codebase by following these steps:

Install Firedrake: https://www.firedrakeproject.org/download.html
The firedrake installation necessarily creates a virtual environment (e.g. venv_firedrake).
Activate the virtual environment: source venv_firedrake/bin/activate
Install the singularity-hydrogym package: pip install -e .

Docker

For local development, Docker can be used.

To build the Docker image, run the following script (from the project root):

./singularity-hydrogym/bash_scripts/local/docker/build_container.sh

This scripts builds the Docker image from the Dockerfile located in docker/. It passes the --build-arg flag to the Docker build command to specify the UID of the host user running the script. This permits to give the same UID to the firedrake user in the container, so that writing permissions from the container to the host are granted.

To run the container run the following script (from the project root):

./singularity-hydrogym/bash_scripts/local/docker/run_container.sh

Now, inside the container you can perform the integration with this following example script:

python src/integration/main.py --yaml /home/firedrake/mount_dir/project_root/configs/cavity/cavity_reynolds-7500.yaml

Note that while src/ is copied to the container file-system during the build phase of the Docker image, the data/ and configs/ directories are mounted in the container when running it.

Usually, Docker is not supported on HPC clusters. However, an equivalent solution is to use Singularity. The Docker images available in this repository are analogous to the Singularity images available in the singularity/ folder. It can be used for local development and testing through development environments such as VSCode or PyCharm.

Singularity

The Singularity containers are the ones that should be used on HPC clusters. To build a Singularity container, scripts from bash_scripts/local/singularity/ can be used.

Details on the images

The Docker images are based on Firedrake official images. Those images create a user called firedrake. The Firedrake python environment is located at /home/firedrake/firedrake/bin/activate. In order to let users in the container to activate this environment and manipulate files in /home/firedrake, the firedrake user is modified to have the same UID as the user running the containers during image generation. Such images are defined in docker/Dockerfile and singularity/definition_files/hydrogym-firedrake.def. However, user account modifications through usermod may be not allowed in some HPC clusters, thus the images may not work for security reasons.

Consequently, an alternative solution implemented in singularity/definition_files/hydrogym-firedrake_nousernamespace.def is to make the \home\firedrake directory writable and executable by all users. It is the preferable way to go.

Codebase description:

bash_scripts/ - Contains scripts for running containers and others.
congigs/ - Contains configuration files for the src/ python scripts.
data/ - Contains data needed by the src/ python scripts.
docker/ - Contains the Dockerfiles for the containers.
singularity/ - Contains the Singularity files for the containers.
src/ - The directory venv may content your own virtual environment, to run MLFlow and other tools or a version of the Firedrake environment.

Data generated from src/integration/main.py will be saved in the data/mlruns/ folder. The data/ folder is mounted in the containers (in read-write mode), so the data will be available in the containers.

Initial vector field

Initial vector fields are needed to start the simulations. Only the one for the Cavity Flow problem is provided in the data/ folder for now but more can be added very easily.

TODO:

Issue update the repo for Firedrake warnings
PATH_VENV="$PATH_CONTENT_ROOT"/venv/"$V_ENV_NAME"/bin/activate
Talk about initial data
Make the Steady State solver script
Containers run on Ruche but not on local now ?
Sync with cluster
Add pip install -e . in the Dockerfile

Ressources:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Local development

Docker

Singularity

Details on the images

Codebase description:

Initial vector field

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
bash_scripts		bash_scripts
configs/cavity		configs/cavity
data		data
docker		docker
singularity		singularity
src		src
venv		venv
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

ReHoss/singularity-hydrogym

Folders and files

Latest commit

History

Repository files navigation

Installation

Local development

Docker

Singularity

Details on the images

Codebase description:

Initial vector field

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages