Abstract—This paper aims to classify texts in Indonesian language into emotion expression classes. The data were taken from 6 basic emotion classes whose training documents and test documents were obtained from articles in www.kompas.com, www.suaramerdeka.com, and www.detik.com. The text weighing was processed by using TFID method which is an integration of Term Frequency (TF) and Inverse Document Frequency (IDF). In the classification process, K-Nearest Neighbor (K-NN) was used to see how far this method could classify emotion expression of Indonesian language. The test shows that the classification of the Indonesian texts for the six basic emotion classes by using KNN method results in accurateness percentage of 71.26%, obtained at k=40 as the optimum value.
Index Terms—Basic emotions, K-Nearest Neighbor, TFIDF Indonesian language.
Arifin is with the Dian Nuswantoro University, Semarang, CO 50139INDONESIA (e-mail: arifin.firdan@gmail.com)
Ketut Eddy Purnam a is with the Electrical Engineering Department, Institute of Technology Sepuluh Nopember Surabaya Kampus ITS, Sukolilo, Surabaya CO 60111 INDONESIA (e-mail: ketut@ee.its.ac.id).
Cite: Arifin and Ketut Eddy Purnama, "Classification of Emotions in Indonesian Texts Using K-NN Method," International Journal of Information and Electronics Engineering vol. 2, no. 6, pp. 899-903, 2012.