MACHINE TRANSLATION WITH JAVANESE SPEECH LEVELS’ CLASSIFICATION


Abstract

A hybrid corpus-based machine processing has been developed to produce a proper Javanese speech level translation. The developed statistical memory-based machine translation shows significantly accurate results. Integration of an automatic text classifier and an expert system is proposed to help Javanese in classifying the speech levels used for a specific interlocutor. Javanese rule-based expert system is designed while naive Bayes classifier is selected after outperforming simple logic probability approach. As a result, the average of translation accuracy (72.3%) indicates that the integrated intelligent interfaces could effectively solve the Javanese language pragmatic translation problems.


Keywords

expert system; hybrid corpus-based machine translation; Javanese speech levels; text classifier

Ahrenberg L., Andersson M., Merkel M.: A simple hybrid aligner for generating lexical corespondences in parallel text. Proc. 36th Annual Meeting of the Association for Computational Linguistics Montreal, Quebec, Canada, 1998, 29–35.

Arias-Aranda D., et al.: A fuzzy expert system for business management. Expert Systems with Applications 37 (12), 2010, 7570–7580.

Bilbao V. M. D., Lopes J. G. P., Ildefonso T.: Measuring the impact of cognates in parallel text alignment. Proc. EPIA 2005, 338–343.

Bond F., Ogura K.: Reference in Japanese-English Machine Translation. Machine Translation 13 (2/3), 1998, 107–134.

Chen Y., et al.: Constructing a nutrition diagnosis expert system. Expert Systems with Applications 39 (2), 2012, 2132–2156.

Hutchins J.: Example-based machine translation: a review and commentary. Machine Translation 19 (3), 2005, 197–211.

Jian De W., Zhao Xiong C., He Yan H.: Intelligent Case Based Machine Translation System Computational Linguistics and Intelligent Text Processing, in A Gelbukh, Editor, Springer, Berlin/Heidelberg, 2001, 197–205.

John K.: Expert systems and mass appraisal. Journal of Property Investment & Finance 29 (4/5), 2011, 529–550.

Koutsantonis D., Panayiotopoulos J-c.: Expert system personalized knowledge retrieval. Operational Research 11 (2), 2011, 215–227.

Le S., et al.: PECAT: a computer-aided translation tool based on bilingual corpora. Proc. Systems, Man, and Cybernetics, vol. 2, 2001, 927–932.

Li J., Wang B.: The automatic extraction of translation patterns and matching algorithm in an English-Chinese machine translation. Proc. Natural Language Processing and Knowledge Engineering. Wuhan, 2005, 839–843.

Liu X., Fang C., Xiao D.: Intrusion diagnosis and prediction with expert system. Security and Communication Networks 4 (12), 2011, 1483–1494.

Lopez A.: Statistical machine translation. ACM Comput. Surv. 40 (3), 2008, 1–49.

Mitchell T.M.: Machine Learning. McGraw-Hill, New York, 1997.

Poedjosoedarmo G.: The effect of Bahasa Indonesia as a lingua franca on the Javanese system of speech levels and their functions. International Journal of the Sociology of Language 177 (1), 2006, 111–121.

Poedjosoedarmo S.: Javanese Speech Levels. Indonesia (6), 1968, 54–81.

Purwadi P., Mahmudi M., Setijaningrum E.: Javanese language structure. Media Abadi,Yogyakarta, 2005.

Quinn G.: Teaching Javanese Respect Usage to Foreign Learners. Electronic Journal of Foreign Language Teaching 8, 2011, 362–370.

Setiyanto A. B.: Parama Satra: Javanese Language. Panji Pustaka,Yogyakarta, 2010.

Smith-Hefner N. J.: Language Shift, Gender, and Ideologies of Modernity in Central Java, Indonesia. Journal of Linguistic Anthropology 19 (1), 2009, 57–77.

Somers H.: Review Article: Example-based Machine Translation. Machine Translation. 14 (2), 1999, 113–157.

Subroto D. E., Rahardjo M. D., Setiawan B.: Endangered krama and krama Inggil varieties of the Javanese language. Linguistik Indonesia 26 (1), 2008, 89–96.

Susilowati E., et al.: Practicing Javanese (Gladhen Basa Jawa) for Junior High School. CV Pustaka Grafika, Malang, 2010.

Terashima R., Echizen-ya H., Araki K.: Learning method for extraction of partial correspondence from parallel corpus. Proc International Conference on Asian Language Processing. Singapore, 2009, 293–298.

Van Hecke T.: Fuzzy Expert System to Characterize Students. PRIMUS 21 (7), 2011, 651–658.

Way A., Gough N.: Comparing example-based and statistical machine translation. Natural Language Engineering 11 (3), 2005, 295–309.

Wedhawati W., et al.: Struktur Nahasa Jawa Mutakhir. Kanisius, Yogyakarta, 2006.

Wibawa A. P., et al.: Edit Distance Algorithm to Increase Storage Efficiency of Javanese Corpora. Proc International Conference on Computer, Electrical, and Systems Sciences, and Engineering. Singapore, 2012, 1056–1060.

Wibawa A. P., et al.: Hybrid machine translation for Javanese speech levels. Proc. 5th International Conference on Knowledge and Smart Technology (KST). Burappa, Thailand, 2013.

Wibawa S.: Efforts to maintain and develop Javanese language politeness. Proc. International Seminar of Javanese Language. Paramaribo, Suriname, 2005, 1–10.

Yazdchi M. V., Faili H.: Generating English-Persian parallel corpus using an automatic anchor finding sentence aligner. Proc Natural Language Processing and Knowledge Engineering (NLP-KE), 2010, 1–6.

Zhao J., Liu F., Liu D.: Two-phase base noun phrase alignment in Chinese-English parallel corpora. Proc. Natural Language Processing and Knowledge Engineering. Wuhan, 2005, 360–365.

Download

Published : 2016-02-04


Nafalski, A., & Wibawa, A. P. (2016). MACHINE TRANSLATION WITH JAVANESE SPEECH LEVELS’ CLASSIFICATION. Informatyka, Automatyka, Pomiary W Gospodarce I Ochronie Środowiska, 6(1), 21-25. https://doi.org/10.5604/20830157.1194260

Andrew Nafalski  nafalski@unisa.edu.au
University of South Australia, School of Engineering, Australia  Australia
Aji Prasetya Wibawa 
Universitas Negeri Malang, Department of Electrical Engineering, Indonesia  Indonesia