Performance of Classifiers in Bangla Text Categorization

Document Type

Conference Article

Publication Title

International Conference on Innovations in Science, Engineering and Technology, ICISET 2018

Abstract

Automated text categorization or text classification has become an important text mining task especially with the speedy development and increase of the number of on-line documents. Automatic text classification system aims to assign the text documents to their predefined categories based on some linguistic characteristics. Although research has progressed significantly for languages like English, Arabic, Chinese, etc., there has not been much development for the Indian Languages especially for Bangla which is one of the most popular languages of India and Bangladesh. One reason for this is the inherent complexity of Bangla which is accompanied by the unavailability of standard datasets and resources. In this paper, the performance of different classifiers is presented for the task of text classification based on 'term association' and 'term aggregation' feature extraction methods and an accuracy of 98.68% has been obtained on dataset of 8000 Bangla text documents procured from various web sources.

First Page

168

Last Page

173

DOI

10.1109/ICISET.2018.8745621

Publication Date

10-1-2018

This document is currently not available here.

Share

COinS