Authors
Cem Rifki Aydin1, Ali Erkan1, Tunga Gungor1 and Hidayet Takci2, 1Bogazici University, Turkey and 2Cumhuriyet University, Turkey
Abstract
In this study, a dictionary-based method is used to extract expressive concepts from documents. So far, there have been many studies concerning concept mining in English, but this area of study for Turkish, an agglutinative language, is still immature. We used dictionary instead of WordNet, a lexical database grouping words into synsets that is widely used for concept extraction. The dictionaries are rarely used in the domain of concept mining, but taking into account that dictionary entries have synonyms, hypernyms, hyponyms and other relationships in their meaning texts, the success rate has been high for determining concepts. This concept extraction method is implemented on documents, that are collected from different corpora.
Keywords
Concept mining, Morphological analysis, Morphological disambiguation