Information Retrieval for Unstructured Text Documents in Serbian into the Crime Domain
Conference object (Published version)
MetadataShow full item record
The reform and modernisation of public sector based on wide application of information-communication technologies (ICT) is considered as one the key elements of futher development of information society in the Republic of Serbia. The trends of development of many e-Government services for many countries in the world, indicate the necessity of application natural language processing - NLP and in e-Government services Republic of Serbia. When performing many of the natural language processing tasks it is needed that all forms of a word with the same meaning has the same form. In general, many documents that are available e-Government services are not structured, so from them is very difficult to isolate some forms (knowledge) that exist in them. The process of finding useful information from such documents in Serbian language, which normally represents highly inflectional language, is one of the key elements of the modern information society in the Republic of Serbia.
Keywords:Information Retrieval / Natural Language Processing / unstructured documents / Apach Lucene
Source:2015 16th IEEE international symposium on computational intelligence and informatics (CINTI), 2015, 267-271
- IEEE, New York