Skip to content

opencitations/cec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Citation Extraction and Classifier

The Citation Extraction and Classifier is a software that performs the automatic annotation of in-text citations in academic papers provided in PDF.

It is developed within the GraspOS EU project.

It works by applying two steps, described as follows:

  • PDF Parsing. The software analyses the PDF paper provided as input and extracts its basic bibliographic metadata, all the bibliographic references with all its metadata marked up, the citation sentences that contain in-text reference pointers , and other structural information such as sections, when possible
  • Citation Function Classification. The software uses the output of the previous step to classify the semantics emerging from each citation sentence that will be used for characterising the function of the citation defined by the authors of the citing paper (i.e., the input PDF) by means of the related in-text reference pointer.

Please find more details in the README files of each module: extractor and classifier.