Journal of the Association for Information Systems

A Hybrid Attribute Selection Approach for Text Classification

Chen-Huei Chou, College of CharlestonFollow
Atish P. Sinha, University of Wisconsin - MilwaukeeFollow
Huimin Zhao, University of Wisconsin - MilwaukeeFollow

Abstract

The application of text mining in organizations is growing. Text classification, an important type of text mining problem, is characterized by a large attribute space and entails an efficient and effective attribute selection procedure. There are two general attribute selection approaches: the filter approach and the wrapper approach. While the wrapper approach is potentially more effective in finding the best attribute subset, it is cost-prohibitive in most text classification applications. In this paper, we propose a hybrid attribute selection approach that is both efficient and effective for text classification problems. We apply the proposed approach to detect and prevent Internet abuse in the workplace, which is becoming a major problem in modern organizations. The empirical evaluations we conducted using a variety of classification algorithms, indexing schemes, and attribute selection methods demonstrate the utility of the proposed approach. We found that combining the filter and wrapper approaches not only boosts the accuracies of text classifiers but also brings down the computational costs significantly.

Recommended Citation

Chou, Chen-Huei; Sinha, Atish P.; and Zhao, Huimin (2010) "A Hybrid Attribute Selection Approach for Text Classification," Journal of the Association for Information Systems, 11(9), .
DOI: 10.17705/1jais.00236
Available at: https://aisel.aisnet.org/jais/vol11/iss9/1

DOI

10.17705/1jais.00236

Download

COinS

When commenting on articles, please be friendly, welcoming, respectful and abide by the AIS eLibrary Discussion Thread Code of Conduct posted here.

A Hybrid Attribute Selection Approach for Text Classification

Authors

Abstract

Recommended Citation

DOI

Share

Search