Automatically Finding Significant Topical Terms from Documents

Abstract

With the pervasion of digital textual data, text mining is becoming more and more important to deriving competitive advantages. One factor for successful text mining applications is the ability of finding significant topical terms for discovering interesting patterns or relationships. Document keyphrases are phrases carrying the most important topical concepts for a given document. In many applications, keyphrases as textual elements are better suited for text mining and could provide more discriminating power than single words. This paper describes an automatic keyphrase identification program (KIP). KIP’s algorithm examines the composition of noun phrases and calculates their scores by looking up a domain-specific glossary database; the ones with higher scores are extracted as keyphrases. KIP’s learning function can enrich its glossary database by automatically adding new identified keyphrases. KIP’s personalization feature allows the user build a glossary database specifically suitable for the area of his/her interest.

Recommended Citation

Li, Quanzhhi; Wu, Yi-Fang Brook; Bot, Ravzan Stefan; and Chen, Xin, "Automatically Finding Significant Topical Terms from Documents" (2005). AMCIS 2005 Proceedings. 120.
https://aisel.aisnet.org/amcis2005/120

AMCIS 2005 Proceedings

Automatically Finding Significant Topical Terms from Documents

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

Links

AMCIS 2005 Proceedings

Automatically Finding Significant Topical Terms from Documents

Authors

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner

Links