Junaidi, Akmal Lampung – A New Hanwdritten Character Benchmark: Database, Labeling and Recognition. 2011 Proceedings of Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data.
|
Text
junaidi2011-LAN_lampung_char_benchmark.pdf Download (10MB) | Preview |
Abstract
This research paper deals with our effort of creation and recognition of isolated Lampung characters, a script originated from Indonesia. The aim is to describe this new script with all its peculiarities, propose a labeling scheme to manage a large isolated character dataset and finally a recognition scheme based on water reservoir concept. The Lampung script originally descending from Brahmi script is used in Lampung Province and it is close to extinction if no such initiative as ours will direct the focus to this cultural heritage. The collected dataset contains isolated characters coming from fairy tales transcriptions and were annotated with a semi-automatic labeling method using a limited human effort. Our attention is focused not only on the database collection but on recognition as well. For this purpose a water reservoir based feature set is proposed exploiting the different cavities and the subsequent measures of the character shapes. The experimental results (94.27%) prove the efficiency of the method considering a brand new script and feature set.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Fakultas Matematika dan Ilmu Pengetahuan Alam (FMIPA) > Prodi Ilmu Komputer |
Depositing User: | Mr. Akmal Junaidi |
Date Deposited: | 13 Dec 2021 09:39 |
Last Modified: | 13 Dec 2021 09:39 |
URI: | http://repository.lppm.unila.ac.id/id/eprint/37282 |
Actions (login required)
View Item |