Skip to content

gaganmanku96/CPAR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CPAR Dataset

Introduction

CPAR Dataset consists of Devanagri(Hindi) Characters and Digits. This Dataset was collected by Mr. Rajiv Kumar. It took almost 3 years to collect this dataset. He collected data from writers belonging to diverse population strata. They belonged to different age groups (from 6 to 77 years), genders, educational backgrounds (from 3rd grade to postgraduate levels), professions (software engineers, professors, students, accountants, housewives and retired persons), regions (Indian states: Bihar, Uttar Pradesh, Haryana, Punjab, National Capital Region (NCR), Madhya Pradesh, Karnataka, Kerala, Rajasthan, and countries: Nigeria, China and Nepal).

Quick Tour

CPAR contains 48 Characters and 9 Digits.

alt rating_to_insight alt rating_to_insight

Installation

Using PIP via PyPI

$ pip install cpar

Manually via GIT

$ git clone git://github.com/gaganmanku96/cpar
$ cd cpar
$ python setup.py install

Usage

Character Dataset

$ from cpar.char import load_data
$ (x_train, y_train), (x_test, y_test) = load_data()

Digit Dataset

$ from cpar.digit import load_data
$ (x_train, y_train), (x_test, y_test) = load_data()

Citations

  1. Kumar Rajiv, Amresh Kumar, and Pervez Ahmed. "A benchmark dataset for devnagari document recognition research." In 6th International Conference on Visualization, Imaging and Simulation (VIS'13), Lemesos, Cyprus, pp. 258-263. 2013.
  2. Kumar, Rajiv, and Kiran Kumar Ravulakollu. "HANDWRITTEN DEVNAGARI DIGIT RECOGNITION: BENCHMARKING ON NEW DATASET." Journal of theoretical & applied information technology 60, no. 3 (2014).

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages