dhSegment-text-torch

This repository contains an add-on to dhSegment torch to use it with text embeddings maps.

For more details about text embeddings map, please see the following publication:

Barman, Raphaël, Ehrmann, Maud, Clematide, Simon, Ares Oliveira, Sofia, and Kaplan, Frédéric  (2020).
Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers.
Journal of Data Mining and Digital Humanities. https://arxiv.org/abs/2002.06144

Usage

This repository introduces new input code for using text embeddings maps as well as new networks.

Using dhSegment torch training script, the following config parameters must be changed:

train_dataset and val_dataset should now be of type image_text_csv (note that patches datasets are not supported).
train_loader and val_loader must be set to text_data_loader.
model type should be set to text_segmentation_model.
Either the encoder or decoder should be set to the text_ variant (currently supported architectures are text_resnet50 and text_unet).
The text_ encoder or decoder should have the following additional parameters:
- "embeddings_encoder": {"target_embeddings_size": 300} set to the size of the embeddings (here 300).
- "embeddings_level": 0 set to the level in the network where the embeddings map should be input (here 0)

An example config file can be found in example_conf.json.

In addition to these changes to the config file, the training script should be modified to by adding import dh_segment_text_torch to the top.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dh_segment_text		dh_segment_text
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
example_conf.json		example_conf.json
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dhSegment-text-torch

Usage

About

Releases

Packages

Languages

License

dhlab-epfl/dhSegment-text-torch

Folders and files

Latest commit

History

Repository files navigation

dhSegment-text-torch

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages