The SBI-FAIR repository hosts metadata for surrogate AI models and datasets, facilitating their use and reuse by providing a unified interface for training, evaluation, and deployment.
This repository aims to:
- Offer a consistent interface for reproducible training, evaluation, and deployment of surrogate models.
- Provide Docker and Apptainer container definitions for each surrogate model, and reusable scripts and SCIF recipes to build derivatives (for example to use in specialized environments such as HPC clusters)
- Facilitate dataset access by referencing external URLs for downloads and providing loaders (as scripts or Docker containers) for datasets requiring post-processing or generation from referenced data.
- Supply patches to ensure models conform to the unified interface.
The repository does not store datasets or models' code but focuses on modifications
needed for standardization. Orchestration is to be handled by external tools like cloudmesh
, SABATH
, or MLCube
.
Currently the following surrogate models have at least training implemented. Here are the detailed instructions how to build containers and run the training using the provided examples:
The documentation about the repository structure and the general examples how to build and execute the models that may be usefull for integration with other tools will be provided here