Track 8: New Topics in IS Development

Going Deeper than Supervised Discretisation in Processing of Stylometric Features

Urszula Stanczyk, Silesian University of Technology, Department of Graphics Computer Vision and Digital Systems, Gliwice, PolandFollow
Beata Zielosko, University of Silesia in Katowice, Faculty of Science and Technology, Institute of Computer Science, Sosnowiec, PolandFollow
Grzegorz Baron, Silesian University of Technology, Department of Graphics Computer Vision and Digital Systems, Gliwice, PolandFollow

Abstract

Rough set theory is employed in cases where data are incomplete and inconsistent and an ap- proximation of concepts is needed. The classical approach works for discrete data and allows only nominal classification. To induce the best rules, access to all available information is ad- vantageous, which can be endangered if discretisation is a necessary step in the data preparation stage. Discretisation, even executed with taking into account class labels of instances, brings some information loss. The research methodology illustrated in this paper is dedicated to ex- tended transformations of continuous input features into categorical, with the goal of enhancing the performance of rule-based classifiers, constructed with rough set data mining. The experi- ments were carried out in the stylometry domain, with its key task of authorship attribution. The obtained results indicate that supporting supervised discretisation with elements of unsuper- vised transformations can lead to enhanced predictions, which shows the merits of the proposed research framework.

Recommended Citation

Stanczyk, U., Zielosko, B., & Baron, G. (2023). Going Deeper than Supervised Discretisation in Processing of Stylometric Features. In A. R. da Silva, M. M. da Silva, J. Estima, C. Barry, M. Lang, H. Linger, & C. Schneider (Eds.), Information Systems Development, Organizational Aspects and Societal Trends (ISD2023 Proceedings). Lisbon, Portugal: Instituto Superior Técnico. ISBN: 978-989-33-5509-1. https://doi.org/10.62036/ISD.2023.32

Paper Type

Full Paper

DOI

10.62036/ISD.2023.32

References_DOI_ISD.2023.32.pdf (63 kB)

Download

COinS

Going Deeper than Supervised Discretisation in Processing of Stylometric Features

Track 8: New Topics in IS Development

Going Deeper than Supervised Discretisation in Processing of Stylometric Features

Abstract

Recommended Citation

Paper Type

DOI

Search

Browse

Author Corner

Links

Track 8: New Topics in IS Development

Going Deeper than Supervised Discretisation in Processing of Stylometric Features

Presenter Information

Abstract

Recommended Citation

Paper Type

DOI

Share

Search

Browse

Author Corner

Links