This repo contain a system that was made for a sign language translation model. The model was created using Grey Level Co-ocurrence Level Matrix and CNN Algorithm (GLCM-CNN) for feature extraction and classification respectively. This code is part of a Final Year Project Development Catagory (link to be added or not)
Throughout the development of this repo the following system has been used:
- OS : Windows 10 Home
- Processor : Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz 1.80 GHz
- RAM : 20.0 GB (4.0 GB Built-in | 16.0 Installed)
- GPU : Intel UHD Graphixs 620
- Storage : 1 TB HDD & 500 GB SSD
The following is needed to run this system:
- Python 3.9
- HTML, CSS, Javascript
- Flask (as the server*)
- Python venv (The list of installed dependencies is in requirement.txt**)
* Only use for development server, not suitable for a production server deployment.
** There are some that I end up not that I haven't removed so yeah...
This system contains only 2 feature:
- Sign to Text Translator (Input: Upload)
- Fingerspelling
The way how the translation work
- The image is uploaded
- It is passed to the flask server and is run through the GLCM Preprocessing which output an 1D array of the 24 GLCM value.
- This is then passed back to the Javascript and is fed into the Model prediction method.
- The returning value act as an index to print out what Alphabet it was matched to
Home Page Navigation bar Fingerspelling page Fingerspelling page(select sign) Translate Page Translate Page(Translating the Gesture A) About Page
- Can only make prediction by upload because it is unable to make inference in real time due to the need of sending the video feed to the server to preprocess before passing it through the tensoflow.js model prediction method. Might be able to solve if we integrate the GLCM into a preprocessing layer.
- Due to the dataset being a video that is just chopped into the frame, there is not much of a diversity in picture / background. Thus the model is bad at making prediction with picture that has a complex or a different background. Maybe a larger dataset with more variety might help.
- The model can only detect the sign if it is the main focus of the picture. Maybe adding a regression to detect the position might solve this problem.
- The fingerspelling asset are directly in the folder, might cause problem once the rage of sign is expanded.
- Have added a dark mode button but haven't really stylized the whole page for dark mode
- The model itself (TBA)
- Tensorflow.js