Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

Official code repository of "Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning" paper (PyTorch implementation). For a description of the models and experiments, see our paper: https://aclanthology.org/2021.conll-1.27/ (published at CoNLL 2021).

Setup

Requirements

Python 3.5+
PyTorch (tested with version 1.3.1)
scikit-learn (tested with version 0.24.2)
tqdm (tested with version 4.55.1)
numpy (tested with version 1.17.4)
rapids (tested with version 0.19)
seaborn (tested with version 0.11.2)

Execution steps

Download the CharacterBERT model [1] using the official repository.
Run the data_preprocessing.py script to process the data ade_full_spert.json.
Run the offline_graph_creation.py script to extract the graphs for each sentence.
Train the CLDR and CLNER models by running the main.py script under the corresponding folder. There are two modes, the tuning mode, and the final run. If you want to execute only the final run, consult the final_run_epochs.xlsx file to find the number of training epochs per split (cross-validation).
Run the embeddings_RE_NER_jointly.py to extract the trained embeddings.
Solve the NER and RE tasks by running the KNN classifiers under the /classification/ folder. The "final run" mode is implemented.
Run the evaluation.py script to extract the evaluation metrics per task. Strict evaluation [2] is used.
For the tSNE [3] analysis, first run the dataset_creation.py script to extract the dataset in a particular format (.hdf5 file). Then run the tSNE_*.py scripts to create the plots.

*** For each execution step the corresponding (if any) config file (/configs/) should be updated accordingly. Importantly, change the split_id number.

Notes

Please cite our work when using this software.

Theodoropoulos, C., Henderson, J., Coman, A. C., & Moens, M. F. (2021, November). Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning. In Proceedings of the 25th Conference on Computational Natural Language Learning (pp. 337-348).

BibTex:

@inproceedings{theodoropoulos-etal-2021-imposing,
    title = {Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning},
    author = {Theodoropoulos, Christos  and
      Henderson, James  and
      Coman, Andrei Catalin  and
      Moens, Marie-Francine},
    booktitle = {Proceedings of the 25th Conference on Computational Natural Language Learning},
    month = {nov},
    year = {2021},
    address = {Online},
    publisher = {Association for Computational Linguistics},
    url = {https://aclanthology.org/2021.conll-1.27},
    doi = {10.18653/v1/2021.conll-1.27},
    pages = {337--348}
}

@software{CLDR_CLNER_models,
    author = {Theodoropoulos, Christos},
    title = {Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning},
    url = {https://github.com/christos42/CLDR_CLNER_models},
    version = {main},
    year = {2021}
}

References

[1] Hicham El Boukkouri, et al. 2020. "Characterbert: Reconcilingelmo and bert for word-level open-vocabulary rep-resentations  from  characters." In Proceedings of the 28th International Conference on Computational Linguistics, pages 6903–6915.
[2] Bruno Taillé, et al. 2020. "Let’s stop error propagation in the end-to-end relation extraction literature!" In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), pages 3689–3701.
[3] Laurens  Van  der  Maaten  and  Geoffrey  Hinton.  2008. "Visualizing  data  using  t-sne." Journal  of  MachineLearning Research, 9(11).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

Setup

Requirements

Execution steps

Notes

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CLDR		CLDR
CLNER		CLNER
classification		classification
configs		configs
tSNE_analysis		tSNE_analysis
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
ade_full_spert.json		ade_full_spert.json
cv_splits.json		cv_splits.json
data_preprocessing.py		data_preprocessing.py
embeddings_RE_NER_jointly.py		embeddings_RE_NER_jointly.py
final_run_epochs.xlsx		final_run_epochs.xlsx
offline_graph_creation.py		offline_graph_creation.py

License

christos42/CLDR_CLNER_models

Folders and files

Latest commit

History

Repository files navigation

Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

Setup

Requirements

Execution steps

Notes

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages