Skip to content

PGxCorpus, a manually annotated corpus, designed for the extraction of pharmacogenomic relations from text.

License

Notifications You must be signed in to change notification settings

practikpharma/PGxCorpus

Repository files navigation

PGxCorpus

PGxCorpus is a manually annotated corpus, designed for the extraction of pharmacogenomic realtions from text. It is composed of 945 sentences mannually annotated, issued from 911 distinct PubMed abstracts. Annotation has been achieved by 11 annotators, including 5 senior annotators. Each sentence has been seen independently by 2 annotators, in a first phase, and by a third senior annotator, in a second phase.

Annotation guidelines

The annotation guidelines were provided to the annotators to reduced the heterogeneity in the annotation task.

Source code of a baseline experiment

The source code of the baseline experiment reported in [1], is available in ./baseline_experiment/

In preparation.

License

PGxCorpus is under Creative Commons BY NC 4.0.

Acknowledgments

PGxCorpus is supported by the PractiKPharma project (http://practikpharma.loria.fr/), funded by the French National Research Agency (ANR) under grant ANR-15-CE23-0028.

About

PGxCorpus, a manually annotated corpus, designed for the extraction of pharmacogenomic relations from text.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published