GPADRlex: Grouped Phrasal Adverse Drug Reaction lexicon

This repository contains supporting data and code used in our paper titled "GPADRlex: Grouped Phrasal Adverse Drug Reaction lexicon" that is published at "2019 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC 2019) ".

Files/Folders details

Data/grouped_phrasal_ADR_lexicon:

GPADRlex consists of 19,585 phrasal ADRs compiled from FAERS and we grouped the phrases representing similar ADRs together based on semantic similarity using agglomerative hierarchical clustering. This folder contains GPADRlex for different values of k (number of clusters).

Data/benchmark_clusters.txt

This file contains benchmark lexicon used in this study. Benchmark lexicon lexicon consist of 890 hand curated groups derived from a subset of ADR lexicon compiled by Nikfarjam et al.. Each line consist of all the phrases that represent the similar ADR and are grouped together.

Data/FAERS_all_ADRs.csv

This file contains a large phrasal ADR lexicon containing 19,585 phrasal ADRs, that we have compiled ADR reported in FAERS.

src/get_nclusters.py

This file contains a python script for generating user-defined number of clusters.

Instruction for using src/get_nclusters.py

Perform the following steps to run the get_nclusters.py

clone the GPADRlex with git clone https://github.com/HammadFarooq/GPADRlex.git
Download the distance matrix (3.1 GB) from the drive link in GPADRlex/Data folder
move to "GPADRlex/src" folder with cd GPADRlex/src
run the command python get_n_clusters.py "single" 100. where "single" is the linkage criteria and 100 is the value of k (number of clusters). Linkage can be one of the following: 'single', 'complete', 'average' ,'weighted', 'centroid', 'median' and 'ward'.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Data		Data
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPADRlex: Grouped Phrasal Adverse Drug Reaction lexicon

Files/Folders details

Data/grouped_phrasal_ADR_lexicon:

Data/benchmark_clusters.txt

Data/FAERS_all_ADRs.csv

src/get_nclusters.py

Instruction for using src/get_nclusters.py

About

Releases

Packages

Languages

HammadFarooq/GPADRlex

Folders and files

Latest commit

History

Repository files navigation

GPADRlex: Grouped Phrasal Adverse Drug Reaction lexicon

Files/Folders details

Data/grouped_phrasal_ADR_lexicon:

Data/benchmark_clusters.txt

Data/FAERS_all_ADRs.csv

src/get_nclusters.py

Instruction for using src/get_nclusters.py

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages