Information-Retrieval-System

Introduction
PRD: Information Retrieval System
API Documentation
- Search
How to run and Install
Technologies and Tools

Introduction

This is a simple information retrieval system that can be used to search for documents in a corpus. The system is based on the positional index data structure. The system is built using Node.js and Express.js. The system is built as a part of the course CS F313 Data Storage and Retrieval at Helwan University Software Engineering program 2023/2024.

PRD: Information Retrieval System

✅ Tokenization should be done using the following rules:
- ✅ Splitting on whitespace.
- ✅ Removing punctuation.
- ✅ Removing stop words.
- ✅ Stemming using the Porter Stemmer algorithm.
✅ Constructing Auxiliary structure(s) (Positional index) to speed up the search process should be done.
✅ Phrase query search should be supported using the positional index.
✅ Term frequency and inverse document frequency should be used to rank the documents in the corpus.
✅ Idf smoothing should be used to rank the documents.
✅ TF.idf matrix should be normalized using the cosine normalization technique.
✅ Similarity between query and each document should be computed using the cosine similarity measure.
✅ Boolean query search should be supported using the positional index.

API Documentation

Search

`GET` /search

body: {
    query: string
}

How to run and Install

# Clone the repo
git clone https://github.com/Adosh74/Information-Retrieval-System

# Install dependencies
yarn install
or
npm install

# Run the server
yarn start

# Run the server in development mode
yarn start:dev

After running the server, you can access the API through the following URL: http://localhost:3001
And display all PRD tables in console

Technologies and Tools

Node.js
Express.js
Typescript

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
collection		collection
src		src
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Information-Retrieval-System

Introduction

PRD: Information Retrieval System

API Documentation

Search

How to run and Install

Technologies and Tools

About

Releases

Packages

Contributors 2

Languages

Adosh74/Information-Retrieval-System

Folders and files

Latest commit

History

Repository files navigation

Information-Retrieval-System

Introduction

PRD: Information Retrieval System

API Documentation

Search

How to run and Install

Technologies and Tools

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages