Data Science Institute x Disability Research Network: A UTS HASS-DSI Research Project

Introduction

This repository contains work conducted in collaboration with the Data Science Institute (DSI) and Disability Research Network (DRN) at the University of Technology, Sydney.

The project involves preprocessing textual data from the Royal Commission into "Aged Care Quality and Safety", and "Violence, Abuse, Neglect and Exploitation of People with Disability" and utilising natural language processing (NLP) techniques to improve document search functionality. Initial attempts were made to create a document-fetching algorithm designed to minimise the amount of time a user may spend searching relevant information.

Our research spans various implementations of NLP techniques on this data, as well as utilising common deep-learning algorithms such as BERT and GPT-3. Most of our work is showcased in this repository in order for you to browse, but to also understand both the advantages and drawbacks on the applications of such algorithms in this particular use case.

We hope that with further reserarch and development, these automative tools will benefit legal professionals, as well as the general public in being able to access legal information more efficiently. A warm thank you to Adam Berry and Linda Steel who co-supervised this topic area of research, and who have also kindly given permission to make these findings available to the public.

Feel free to also test the current version of this experiment out (created using Streamlit). It is recommended that you upload a datafile that we have processed in order for it to be successfully readable for our code. The user also has the option to adjust the temperature of the GPT-3 response (this controls how much randomness is in the output). Note that we are not responsible for the output of the GPT3 model. There have been reports of inappropriate content being generated by the deep learning model.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
Data Preprocessing		Data Preprocessing
Deep Learning Implementation		Deep Learning Implementation
Importance of Data Preprocessing		Importance of Data Preprocessing
BM25 (Retrieval Function).py		BM25 (Retrieval Function).py
Exploratory Data Analysis (EDA)		Exploratory Data Analysis (EDA)
LICENSE		LICENSE
README.md		README.md
application.py		application.py
requirements.txt		requirements.txt
test_final.txt		test_final.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science Institute x Disability Research Network: A UTS HASS-DSI Research Project

Introduction

Contents

About

Releases

Packages

Languages

License

roupenminassian/UTS-DSI-x-Disability-Research-Network

Folders and files

Latest commit

History

Repository files navigation

Data Science Institute x Disability Research Network: A UTS HASS-DSI Research Project

Introduction

Contents

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages