Abstract
The paper discusses a series of related techniques that prepare and transform raw linguistic data for advanced processing in order to unveil hidden grammatical patterns. It identifies XML as a suitable mark-up language to build an exploitable data bank of multi-dimensional data in the Hebrew text of the Old Testament. This concept is illustrated by tagging a transcription of Gen. 1:1-2:3 and manipulating this data bank. Transferring the data into a three-dimensional array allows advanced processing of the data in order to either confirm existing knowledge or to mine for new, yet undiscovered, linguistic features. Visualisation is discussed as a technique that enhances interaction between the human researcher and the computerised technologies supporting this process of knowledge creation. The empirical study is a small experiment that illustrates the viability and usefulness of the proposed expert devices as well as the benefits of applying information system techniques to linguistic databases.
Recommended Citation
Kroeze, Jan H.; Bothma, Theo J.D.; and Matthee, Machdel C., "From Tags to Topic Maps: Using Marked-up Hebrew Text to Discover Linguistic Patterns" (2008). CONF-IRM 2008 Proceedings. 30.
https://aisel.aisnet.org/confirm2008/30