Junaidi, Akmal A Semi-Supervised Ensemble Learning Approach for Character Labeling with Minimal Human Effort. 2011 International Conference on Document Analysis and Recognition.
|
Text
vajda2011-ASS_semi_sup_char_label_penges.pdf Download (4MB) | Preview |
Abstract
One of the major issues in handwritten character recognition is the efficient creation of ground truth to train and test the different recognizers. The manual labeling of the data by a human expert is a tedious and costly procedure. In this paper we propose an efficient and low-cost semi-automatic labeling system for character datasets. First, the data is represented in different abstraction levels, which is clustered after in an unsupervised manner. The different clusters are labeled by the human experts and finally an unanimity voting is considered to decide if a label is accepted or not. The experimental results prove that labeling only less than 0.5% of the training data is sufficient to achieve 86.21% recognition rate for a brand new script (Lampung) and 94.81% for the MNIST benchmark dataset, considering only a K-nearest neighbor classifier for recognition.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Fakultas Matematika dan Ilmu Pengetahuan Alam (FMIPA) > Prodi Ilmu Komputer |
Depositing User: | Mr. Akmal Junaidi |
Date Deposited: | 06 Dec 2021 07:03 |
Last Modified: | 06 Dec 2021 07:03 |
URI: | http://repository.lppm.unila.ac.id/id/eprint/36585 |
Actions (login required)
View Item |