Pre-process arabic text (remove diacritics, punctuations, and repeating characters)
Usage: clean_arabic_text.py [-h] -i INFILE -o OUTFILE
Pre-process arabic text (remove diacritics, punctuations, and repeating
characters).
optional arguments:
-h, --help show this help message and exit
-i INFILE, --infile INFILE
input file.
-o OUTFILE, --outfile OUTFILE
out file.
python clean_arabic_text.py -i infile.txt -o outfile.txt
Your contributions to improve the code are welcomed. Please follow the steps below.
- Fork the project.
- Modify the code, test it, make sure that it works fine.
- Make a pull request.
Please consult github help to get help.