Preserving the user’s preference in document-category management is essential because it affects his/her search efficiency, cognitive processing load, and satisfaction. Prior research has investigated automated document category evolution by using lexicon-based documentcategory evolution techniques which take into account the document categories previously created by the user. However, comparing documents at the lexical level cannot solve word mismatch or ambiguity problems effectively. To address such problems inherent to the lexicon-based approach, we propose an ONtology-based Category Evolution (ONCE) technique, which uses an appropriate ontology to support document-category evolution at the conceptual level rather than at the lexical level. Specifically, we develop an Ontology Enrichment (OE) technique for automatic leaning of concept descriptors in the adopted ontology. We empirically evaluate the effectiveness of the proposed ONCE technique, using a lexicon-based document-category evolution technique (i.e., CE2) and the hierarchical agglomerative clustering (HAC) technique for benchmark purposes. According to our empirical results, ONCE appears more effective than CE2 and HAC, and achieves higher clustering recall and precision.
Lee, Yen-Hsien; Wei, Chih-Ping; and Hu, Paul Jen-Hwa, "Preserving User Preferences in Document-Category Management: An Ontology-based Evolution Approach" (2007). PACIS 2007 Proceedings. 64.