Abstract

One of the challenges in cross lingual information retrieval is the retrieval of relevant information for a query expressed in a native language. While retrieval of relevant documents is slightly easier, analyzing the relevance of the retrieved documents and the presentation of the results to the users are non-trivial tasks. A method for information retrieval for a query expressed in a native language is presented in this paper. It uses insights from data mining and intelligent search for formulating the query and parsing the results. It also uses heuristic methods for the categorization of documents in terms of relevance. Our approach compliments the search engine’s inbuilt methods for identifying and displaying the results of queries. A prototype has been developed for analyzing Tamil-English corpora. The initial results have shown that this approach is suitable for on the fly retrieval of documents.

Share

COinS