Semantic Attention Network for Intuitive Information Retrieval

Pytorch implementation integrating the Tree-LSTM network from "Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks" (http://arxiv.org/abs/1503.00075, Kai Sheng Tai et al.) and "A Structured Self-Attentive Sentence Embedding" (https://arxiv.org/pdf/1703.03130.pdf, Zhouhan Lin et al.).

The model achieves about the same test accuracy (86.85%) on the SICK dataset as the Tree-LSTM alone (86.76%), and additionally provides methods for understanding how the network is learning semantics, as well as information compression via learned attention weights. State of the art results achieve 88.5% using transfer learning (https://arxiv.org/pdf/1705.02364.pdf).

A write up of this work can be found at https://journals.mcmaster.ca/mjep/article/view/1627/1230.

Requirements

Python 2.7 (tested on 2.7.12)
PyTorch (tested on 0.1.12)
tqdm
Java >= 8 (for Stanford CoreNLP utilities)

Usage

First run the script ./fetch_and_preprocess.sh, which, as the name suggests, does two things:
- Fetch data, such as:
  - SICK dataset (semantic relatedness task)
  - Glove word vectors (Common Crawl 840B) -- Warning: this is a 2GB download!
  - Stanford Parser and Stanford POS Tagger
- Preprocess data, i.e. generate dependency parses using Stanford Neural Network Dependency Parser.

Acknowledgements

Thanks to Riddhiman Dasgupta for his open source Pytorch implementation of the dependency tree-LSTM and Haoyue Shi for his open source Pytorch implementation of the self-structured attention mechansism.

https://github.com/ExplorerFreda/Structured-Self-Attentive-Sentence-Embedding https://github.com/dasguptar/treelstm.pytorch

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
lib		lib
scripts		scripts
.gitignore		.gitignore
Constants.py		Constants.py
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
fetch_and_preprocess.sh		fetch_and_preprocess.sh
main.py		main.py
metrics.py		metrics.py
model.py		model.py
trainer.py		trainer.py
tree.py		tree.py
utils.py		utils.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Attention Network for Intuitive Information Retrieval

Requirements

Usage

Acknowledgements

License

About

Releases

Packages

Languages

License

lajd/Semantic-Attention-Network

Folders and files

Latest commit

History

Repository files navigation

Semantic Attention Network for Intuitive Information Retrieval

Requirements

Usage

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages