This guide can be used as a reference point when trying to add/customize your own documentation for individual models.
This section should contain a brief description/introduction of the model. Pretty much every model will have something along the lines of this provided. Additionally, you should aim to provide the following information as part of this document:
- Developed By
- Code Repository
- GitHub/GitLab if provided
- Model Repository
- Link to the repository page of the model (i.e. HuggingFace)
- Relevant Papers (if provided)
- Release Date (if provided)
- Licensing
- Link to the license (i.e. link to the Apache 2 website)
This section should contain the intended use of the model. Most suppliers of the models will have a section like this on the repository page for the model.
This document should aim to provide users with the information required to consume the model. For example, if the model has a Python package that can be imported then it would be important to provide examples of different actions you can perform.
You are able to format Markdown to properly highlight code based on language by adding the language next to your 3 back-ticks.
Example:
```python
<code block>
```
This training section should preferably contain information such as:
- The infrastructure the model was trained on
- Data the model was trained on
- Performance data if supplied
During research it was found that a good deal of models that provided performance/benchmarking data did so for the entire model suite (all sizes), in order to provide relevant information it may be necessary to trim the data to only provide information for the model the Tech Doc is for.
Some models may also come supplied with information about the environmental costs of training this model, if that is supplied it would also be valid to add as part of this section.
This is an optional section that should be added if the model provider has this as part of their documentation. From my findings a majority of large companies/model providers have included this section in their model repositories/descriptions.
This section contains the license the model falls under as part of a Markdown file. This allows it to be easily referenced through the Tech Doc instead of needing to follow and external link for viewing.
When looking to add Tech Docs to your resource there are a few practices you should adhere to; this will help ensure that everything is rendered correctly:
- All documentation should be under
/docs
with image assets such as PNGs being stored under/docs/images
and Markdown files containing code samples under/docs/code
- All Markdown files should be referenced within mkdocs.yaml, for file path you should reference the .md files as if mkdocs.yaml is the root:
site_name: 'Documentation'
nav:
- Model Information: index.md
- Model Usage: code/usage-examples.md
- Training Information: training.md
- Ethics: ethics.md
- License: license.md
In order to add your resource (along with the documentation) to your Catalog you will need to add them to your <catalog-name>.yaml
file. Reference this is the Red Hat AI Catalog.
For your Tech Docs to properly render and be referenced, you will need to add the following annotation to your catalog entry.
metadata:
name: <resource-name>
annotations:
backstage.io/techdocs-ref: dir:<path-to-resource>
You are able to view the differing templates/structures in action as part of developer-model-service and ollama-model-service.