README

The code and data for "Understanding Jargon: Combining Extraction and Generation for Definition Modeling" (EMNLP '22)

Introduction

We propose to combine extraction and generation for jargon definition modeling: first extract self- and correlative definitional information of target jargon from the Web and then generate the final definitions by incorporating the extracted definitional information. Our framework is remarkably simple but effective: experiments demonstrate our method can generate high-quality definitions for jargon and outperform state-of-the-art models significantly, e.g., BLEU score from 8.76 to 22.66 and human-annotated score from 2.34 to 4.04.

Usage

Please refer to the detailed README.md in ./extraction/ and ./generation/

Data

Data can be downloaded from Google Drive

Generated definitions

Stored in ./sample/generated_definition_for_cs_term.txt

Citation

The details of this repo are described in the following paper. If you find this repo useful, please kindly cite it:

@inproceedings{huang2022understanding,
  title={Understanding Jargon: Combining Extraction and Generation for Definition Modeling},
  author={Huang, Jie and Shao, Hanyin and Chang, Kevin Chen-Chuan and Xiong, Jinjun and Hwu, Wen-mei},
  booktitle={Proceedings of EMNLP},
  year={2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Introduction

Usage

Data

Generated definitions

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
extraction		extraction
generation		generation
sample		sample
LICENSE		LICENSE
README.md		README.md

License

jeffhj/CDM

Folders and files

Latest commit

History

Repository files navigation

README

Introduction

Usage

Data

Generated definitions

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages