Detail Cantuman
Advanced SearchArtikel Jurnal
Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia
Abstract
Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.
Ketersediaan
JUTISI2-014 | JUTISI V6N2 Agustus 2020 | Perpustakaan FT UPI YAI | Tersedia |
Informasi Detil
Judul Seri |
JUTISI : Jurnal Teknik Informatika dan Sistem Informasi
|
---|---|
No. Panggil |
JUTISI V6N2 Agustus 2020
|
Penerbit | Maranatha University Press : Bandung., 2020 |
Deskripsi Fisik |
hlm : 301-310
|
Bahasa |
English
|
ISBN/ISSN |
2443-2210
|
Klasifikasi |
JUTISI
|
Tipe Isi |
-
|
Tipe Media |
-
|
---|---|
Tipe Pembawa |
-
|
Edisi |
Volume 6 Nomor 2 Agustus 2020
|
Subyek | |
Info Detil Spesifik |
-
|
Pernyataan Tanggungjawab |
-
|
Versi lain/terkait
Tidak tersedia versi lain