Complex Assignments

Introduction

This repository includes codes related to our paper Finding Black Cat in a Coal Cellar - Keyphrase Extraction & Keyphrase-Rubric Relationship Classification from Complex Assignments focusing empirical study of Keyphrase Extraction & Generic/Specific Keyphrase-Rubric Relationship Classification.

Dependencies

Python3 (Tested on python 3.7)
Jupyter Notebook
Scikit-learn
Transformers(Hugging Face)
pke
eli5

Installation

First, clone the repository:

https://github.com/manikandan-ravikiran/complex-assignments.git

Install requirements.

pip install -r requirements.txt
pip install git+https://github.com/boudinfl/pke.git
python -m nltk.downloader stopwords
python -m nltk.downloader universal_tagset
python -m spacy download en # download the english model
pip install spacy
pip install en-core-web-sm

NOTE: The, datasets are already processsed, features are extracted and pickled for replication purposes. Due to privacy restrictions we dont release any datasets in raw format. If you need data for research purposes. Please send an email to mravikiran3@gatech.edu along with details on your research.

The codes are in form of Ipython notebook, you can deploy directly in binder and execute. Please click on the binder build icon. (Please note the due to requirement of GPU and privacy of datasets binder can run only few of the experiments of RQ2.2. For full fledged run, use an independent GPU machine)

Code Organization

The code organization is with respect to research questions in paper Finding Black Cat in a Coal Cellar - Keyphrase Extraction & Keyphrase-Rubric Relationship Classification from Complex Assignments
Each RQx (RQ1.1,RQ1.2,...RQ3.2) folder answers one or more of the research question and contains two sub folders namely code and data. Each of the RQx folder code could be run individually without any cross dependency across the project, so in a sense the codes are self contained.

Result Reproducibility & Execution

Following details shows relationship to code and Tables of results in Finding Black Cat in a Coal Cellar - Keyphrase Extraction & Keyphrase-Rubric Relationship Classification from Complex Assignments.
Each code could be run individually without any cross dependency. To reproduce a results, execute corresponding Ipython notebook as mentioned below. Further each of the jupyter notebook includes comments and neccessary information for execution.

Results from Paper	Code Folder
Table 4 (KEA/WINGUS)	Execute this
Table 4 (KPMINER/YAKE)	Execute this
Table 4 (Ranking)	Execute this
Table 4 (KEA)	Execute this
Table 4 (Multipartite)	Execute this
Table 7 (K-Means)	Execute this
Table 8 (Agglomerative)	Execute this
Table 9 (Spectral)	Execute this
Table 10 (Latent Dirichlet Allocation)	Execute this
Table 12 (BOW/TF-IDF)	Execute this
Table 12 (Language Models)	Execute this
Table 14 (Interpretability - BERT)	Execute this
Table 15 (Interpretability - SVM+TFIDF)	Execute this

Cite

If you find this repo useful in your research, please consider citing the following papers:

@article{Ravikiran2020FindingBC,
  title={Finding Black Cat in a Coal Cellar - Keyphrase Extraction & Keyphrase-Rubric Relationship Classification from Complex Assignments},
  author={Manikandan Ravikiran},
  journal={ArXiv},
  year={2020},
  volume={abs/2004.01549}
}

@article{Ravikiran2020KeyPC,
title={Key Phrase Classification in Complex Assignments},
author={Manikandan Ravikiran},
journal={ArXiv},
year={2020},
volume={abs/2003.07019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
RQ1.1		RQ1.1
RQ1.2		RQ1.2
RQ1.3		RQ1.3
RQ2.1		RQ2.1
RQ2.2		RQ2.2
RQ2.3		RQ2.3
RQ3.1		RQ3.1
RQ3.2		RQ3.2
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Complex Assignments

Introduction

Dependencies

Installation

Code Organization

Result Reproducibility & Execution

Cite

About

Releases

Packages

Contributors 2

Languages

License

manikandan-ravikiran/complex-assignments

Folders and files

Latest commit

History

Repository files navigation

Complex Assignments

Introduction

Dependencies

Installation

Code Organization

Result Reproducibility & Execution

Cite

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages