This repository contains the Guidelines of the SegmOnto controlled vocabulary for layout analysis and segmentation.
There are two classes of regions: zones (an area, i.e., polygon, on the page) and lines (an area, plus a baseline).
Zones or lines can be caracterised by:
- a type (mandatory, controlled values)
- a subtype (optional, suggested open list of values)
- a number (optional)
of the form
Region(:subtype)?(#\d)?
e.g.
MainZone
or
MainZone#1
,MainZone:column
MainZone:column#1
.
The repository contains the text of the Guidelines and definitions, with somes examples, as markdown files stored in two folders:
- Zones
- Lines