Summaformers

Code for the Paper 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop

Abstract

Automatic text summarization has been widely studied as an important task in natural language processing. Traditionally, various feature engineering and machine learning based systems have been proposed for extractive as well as abstractive text summarization. Recently, deep learning based, specifically Transformer-based systems have been immensely popular. Summarization is a cognitively challenging task — extracting summary worthy sentences is laborious, and expressing semantics in brief when doing abstractive summarization is complicated. In this paper, we specifically look at the problem of summarizing scientific research papers from multiple domains. We differentiate between two types of summaries, namely, (a) LaySumm: A very short summary that captures the essence of the research paper in layman terms restricting overtly specific technical jargon and (b) LongSumm: A much longer detailed summary aimed at providing specific insights into various ideas touched upon in the paper. While leveraging latest Transformer-based models, our systems are simple, intuitive and based on how specific paper sections contribute to human summaries of the two types described above. Evaluations against gold standard summaries using ROUGE metrics prove the effectiveness of our approach. On blind test corpora, our system ranks first and third for the LongSumm and LaySumm tasks respectively.

BibTeX to cite our work

@inproceedings{ghosh-roy-etal-2020-summaformers,
    title = "Summaformers @ {L}ay{S}umm 20, {L}ong{S}umm 20",
    author = "Ghosh Roy, Sayar  and
      Pinnaparaju, Nikhil  and
      Jain, Risubh  and
      Gupta, Manish  and
      Varma, Vasudeva",
    booktitle = "Proceedings of the First Workshop on Scholarly Document Processing",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.sdp-1.39",
    doi = "10.18653/v1/2020.sdp-1.39",
    pages = "336--343"
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LaySumm_Generation		LaySumm_Generation
Loaders_and_Budget		Loaders_and_Budget
LongSumm_Processing		LongSumm_Processing
LongSumm_Training_Inference		LongSumm_Training_Inference
README.md		README.md
poster.pdf		poster.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summaformers

Code for the Paper 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop

Abstract

BibTeX to cite our work

About

Releases

Packages

Contributors 2

Languages

sayarghoshroy/Summaformers

Folders and files

Latest commit

History

Repository files navigation

Summaformers

Code for the Paper 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop

Abstract

BibTeX to cite our work

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages