Permata, Permata and Abidin, Zaenal and Ariyani, Farida (2020) Efek Peningkatan Jumlah Paralel Korpus Pada Penerjemahan Kalimat Bahasa Indonesia ke Bahasa Lampung Dialek Api. Jurnal Komputasi, 8 (2). pp. 41-49. ISSN 2541-0350
|
Text (Article)
2613-6365-1-PB.pdf - Published Version Download (264kB) | Preview |
|
|
Text (Similarity Report)
Efek Peningkatan Jumlah Paralel Korpus Pada Penerjemahan Kalimat Bahasa Indonesia ke Bahasa Lampung Dialek Api.pdf Download (2MB) | Preview |
Abstract
Experimental observations of the effect of the number of parallel corpus on Indonesian translation into the Lampung dialect api were carried out using the statistical machine translation (SMT) method. SMT utilizes a parallel Indonesian corpus and its translation in the Lampung dialect api as a material for training data. The research strategy was carried out in three ways, namely first strategy with a corpus parallel number of 1000 sentences, the second strategy with a corpus parallel number of 2000 and the third strategy with a corpus parallel number of 3000 sentences. The research starts from the preprocessing phase followed by the training phase, namely the parallel corpus processing phase to obtain a language model and translation model. Then the testing phase, and ends with the evaluation phase. SMT testing uses 25 single sentences without out-of-vocabulary (OOV), 25 single sentences with OOV, 25 compound sentences without OOV and 25 compound sentences with OOV. The test results of translating Indonesian sentences into Lampung dialectic api are shown through the accuracy value of Bilingual Evaluation Undestudy (BLEU) obtained in testing 25 single sentences without out-of-vocabulary (OOV) in the first strategy, the second and the third are 21.49%, 59.58% and 73.21%. In testing 25 single sentences with out-of-vocabulary (OOV) obtained in the first strategy, the second and the third are 23.22%, 44.33% and 68.72%. In testing 25 compound sentences without out-of-vocabulary (OOV) obtained in the first strategy, the second and the third are 18.22%, 39.4% and 69.18%. In testing 25 compound sentences with out-of-vocabulary (OOV) obtained in the first strategy, the second and the third are 25.94%, 28.22% and 71.94%.
Item Type: | Article |
---|---|
Subjects: | P Language and Literature > P Philology. Linguistics |
Divisions: | Fakultas Keguruan dan Ilmu Pendidikan (FKIP) > Prodi Magister Pendidikan Bahasa dan sastra Daerah |
Depositing User: | Dr. Farida Ariyani, M.Pd. |
Date Deposited: | 06 Mar 2023 01:57 |
Last Modified: | 06 Mar 2023 01:57 |
URI: | http://repository.lppm.unila.ac.id/id/eprint/48773 |
Actions (login required)
View Item |