Authors
Alok Ranjan Pal1 and Diganta Saha2, 1College of Engineering and Management, India and 2Jadavpur University, India
Abstract
The proposed approach deals with the detection of jargon words in electronic data in different communication mediums like internet, mobile services etc. But in the real life, the jargon words are not used in complete word forms always. Most of the times, those words are used in different abbreviated forms like sounds alike forms, taboo morphemes etc. This proposed approach detects those abbreviated forms also using semi supervised learning methodology. This learning methodology derives the probability of a suspicious word to be a jargon word by the synset and concept analysis of the text.
Keywords
Natural Language Processing (NLP), Jargon word, Suspicious word, Synset, Concept.