This repository contains schemas and models for working with the BioImage Archive's MIFA (Metadata, Incentives, Formats and Accessibility) metadata implementation. This metadata model is intended to promote the reusability of AI-ready biological image datasets and it is based on the community recommendations summarised here.
The MIFA metadata model provides classes to work with the metadata related to the dataset annotations, version, and general study information. For the biological images metadata, the BioImage Archive follows the REMBI (Recommended Metadata for Biological Images) model. You can find the models and tools for working with the BioImage Archive's REMBI implementation here: https://github.com/BioImage-Archive/bia-rembi-models
You can browse the MIFA metadata model here: https://BioImage-Archive.github.io/bia-mifa-models
The main source of truth is bia_mifa_models.yaml, a relatively simple to edit YAML file created with LinkML.
The yaml definition is used to derive the following files:
- Excel
- GraphQL
- JSON-LD context
- JSON Schema
- OWL
- Python dataclasses
- Prefix map
- ProtoBuf definitions
- SHACL
- RDF Shape Expressions
- SQL schema
- examples/ - example data from real BioImage Archive submissions
- project/ - project files, including all artefacts derived from the yaml definition (do not edit these)
- src/ - source files (edit these)
- bia_mifa_models
- schema -- LinkML schema - yaml definition (edit this)
- datamodel -- generated Python datamodel
- pydantic_model -- generated Pydantic datamodel
- bia_mifa_models
- tests/ - Python tests
Prerequisites: Python 3.7+ and poetry
make all
: make everythingmake deploy
: deploys site