Junaidi, Akmal A Semi-Supervised Ensemble Learning Approach for Character Labeling with Minimal Human Effort. 2011 International Conference on Document Analysis and Recognition.


Download (4MB) | Preview


One of the major issues in handwritten character recognition is the efficient creation of ground truth to train and test the different recognizers. The manual labeling of the data by a human expert is a tedious and costly procedure. In this paper we propose an efficient and low-cost semi-automatic labeling system for character datasets. First, the data is represented in different abstraction levels, which is clustered after in an unsupervised manner. The different clusters are labeled by the human experts and finally an unanimity voting is considered to decide if a label is accepted or not. The experimental results prove that labeling only less than 0.5% of the training data is sufficient to achieve 86.21% recognition rate for a brand new script (Lampung) and 94.81% for the MNIST benchmark dataset, considering only a K-nearest neighbor classifier for recognition.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Fakultas Matematika dan Ilmu Pengetahuan Alam (FMIPA) > Prodi Ilmu Komputer
Depositing User: Mr. Akmal Junaidi
Date Deposited: 06 Dec 2021 07:03
Last Modified: 06 Dec 2021 07:03
URI: http://repository.lppm.unila.ac.id/id/eprint/36585

Actions (login required)

View Item View Item