Skip to content

A Natural Language Enabled for Pharmaceutical research data. We aim to easily and efficiently find any search results using a word2vec encoder

Notifications You must be signed in to change notification settings

prabhav-pandya/PharmaSearch

 
 

Repository files navigation

TitleBanner

Introduction

Python-3.8 gensim-3.8.3 NLTK Flask PR PR

Dataset

Research articles from various journals like for example, International Journal of Pharmaceutical Sciences, The National Medical Journal of India, La Revue de Médecine Interne, MDPI Journals, RPS Journal and so on

Approach

  • Create shared vector space among word2vec representations of articles and search phrases
  • Make a seq2seq model to summarize and encode Research text documents
  • Find way to map research paper vectors to search phrase vectors
  • Create search engine using 1, 2 and 3
  • Build UI to house the search engine

Technology Used

  • Python
  • gensim
  • NLTK
  • Gensim: Gensim is a library used to develop scalable word2vec or doc2vec models which we would need to create a shared vector space for the input strings as well as the documents fed to it. It also comes packaged with several standard word2vec models which we would need for general vocabulary in our search.

About Us

  • Omkar Prabhune
  • Prabhav Pandya
  • Pritesh Pawar
  • Pranav Tambaku

About

A Natural Language Enabled for Pharmaceutical research data. We aim to easily and efficiently find any search results using a word2vec encoder

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.7%
  • Python 0.3%