The Classification of Documents in Malay and Indonesian Using the Naive Bayesian Method Uses Words and Phrases as a Training Set
MetadataZobrazit celý záznam
Malay Language and Indonesian Language are two closely related languages, sharing a lot in common in the meanings of words and grammar. Classifying the two languages automatically using a tool is a challenge because the two languages are very similar. The classification method that is widely used today is the Naive Bayesian method. This method needs to be implemented in a particular way to increase the level of classification accuracy. In this study, a new method was used, by using a training set in the form of words and phrases instead of just using a training set in the form of words only. With this method, the level of classification accuracy of the two languages is increased.
Typ dokumentuRecenzovaný dokument
Verze dokumentuFinální verze PDF
Zdrojový dokumentMendel. 2020 vol. 26, č. 2, s. 23-28. ISSN 1803-3814
- Vol. 26, No. 2