Abstract
Nowadays, although many researches is being done in the field of word sense disambiguation in some languages like English, still some other languages like Persian have many things to be done. Some difficulties are in this way which might have made it less interactive for researchers. For example, Persian WordNet or FarsNet is newly developed and there is no sense tagged corpus developed based on it yet. So we propose a semi-supervised approach for extending FarsNet with some new relations and then use it for WSD. Also a method to extract semantic keywords or key-concepts from textual documents is used. As the key-concepts are extracted exploiting FarsNet, we call them Key-synsets. In fact Key-synsets of a document are those synsets which are semantically related to the main subjects of that document. This method is exploited to improve the precision of the proposed WSD. Although our approach is tested on Persian it can be easily adopted for other languages such as English.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Miller, G.A., Leacock, C., Tengi, R., Bunker, R.T.: A semantic concordance. In: Proc. of the ARPA Workshop on Human Language Technology, pp. 303–308 (1993)
Navigli, R.: Word Sense Disambiguation: A Survey. ACM Computing Surveys 41(2), Article 10 (February 2009)
Tsatsaronis, G., Varlamis, I., Nørvåg, K.: An Experimental Study on Unsupervised Graph-based Word Sense Disambiguation. In: Gelbukh, A. (ed.) CICLing 2010. LNCS, vol. 6008, pp. 184–198. Springer, Heidelberg (2010)
Yarowsky, D.: Word-sense disambiguation using statistical models of roget’s categories trained on large corpora. In: Proc. of COLING, pp. 454–460 (1992)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proc. of the 5th SIGDOC, pp. 24–26 (1986)
Agirre, E., Soroa, A.: Personalizing pagerank for word sense disambiguation. In: Proc. of EACL, pp. 33–41 (2009)
Brody, S., Navigli, R., Lapata, M.: Ensemble methods for unsupervised wsd. In: Proc. of COLING/ACL, pp. 97–104 (2006)
Agirre, E., MartÃnez, D.: Exploring automatic word sense disambiguation with decision lists and the Web CoRR cs.CL/0010024 (2000)
Tang, X., Chen, X., Qu, W., Yu, S.: Semi-Supervised WSD in Selectional Preferences with Semantic Redundancy. In: COLING (Posters), pp. 1238–1246 (2010)
Brody, S.: Closing the Gap in WSD: Supervised Results with Unsupervised Methods. Doctor of Philosophy thesis, Institute for Communicating and Collaborative Systems. School of Informatics. University of Edinburgh (2009)
Yarowsky, D., Radu, F.: Evaluating sense disambiguation across diverse parameter spaces. Natural Language Engineering 9(4), 293–310 (2002)
Tsatsaronis, G., Vazirgiannis, M., Androutsopoulos, I.: Word sense disambiguation with spreading activation networks generated from thesauri. In: Proc. of IJCAI, pp. 1725–1730 (2007)
Tran, A., Bowes, C., Brown, D., Chen, P., Choly, M., Ding, W.: TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain. In: Proc. of the 5th International Workshop on Semantic Evaluation, ACL 2010, Uppsala, Sweden, July 15-16, pp. 396–401 (2010)
Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proc. of the 18th International Joint Conference on Artificial Intelligence, IJCAI, Acapulco, Mexico, pp. 805–810 (2003)
Saedi, C., Shamsfard, M.: Translating Persian documents into English using knowledge based WSD. In: ICDIM 2009, pp. 229–234 (2009)
Faili, H.: An Experiment of Word Sense Disambiguation in a Machine Translation System. In: Proc. of 2008 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2008), pp. 28–35 (2008)
Xu, S., Yang, S., Lau, F.: Keyword Extraction and Headline Generation Using Novel Word Features. In: Proc. of the Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)
Ercan, G., Cicekli, I.: Using lexical chains for keyword extraction. Information Processing and Management 43(6), 1705–1714 (2007)
Hulth, A.: Automatic Keyword Extraction. VDM Verlag Dr. Mueller, E.K. Binding: Paperback (2008) ISBN: 363903855X, ISBN-13: 9783639038552
Barker, K., Cornacchia, N.: Using Noun Phrase Heads to Extract Document Keyphrases. In: Canadian Conference on AI 2000, pp. 40–52 (2000)
Turney, P.D.: Learning algorithms for keyphrase extraction. Information Retrieval 2(4), 303–336 (2000) (NRC #44105)
Wartena, C., Brussee, R., Slakhorst, W.: Keyword Extraction Using Word Co-occurrence. In: Workshops on Database and Expert Systems Applications, Bilbao, Spain, August 30-September 03 (2010) ISBN: 978-0-7695-4174-7
Shamsfard, M., Jafari, H., Ilbeygi, M.: STeP-1: A Set of Fundamental Tools for Persian Text Processing. In: LREC (2010)
Shamsfard, M., Hesabi, A., Fadaei, H., Mansoory, N., Famian, A., Bagherbeigi, S., Fekri, E., Monshizadeh, M., Assi: Semi Automatic Development of FarsNet; The Persian WordNet. In: Assi: Semi Automatic Development of FarsNet; The Persian WordNet. 5th Global WordNet Conference (GWA 2010), Mumbai, India (2010)
Fragos, K., Maistros, I., Skourlas, C.: Word Sense Disambiguation using WordNet relations. In: Proc. of 1st Balkan Conference in Informatics, October 20-22 (2003)
AleAhmad, A., Amiri, H., Darrudi, E., Rahgozar, M., Oroumchian, F.: Hamshahri: A standard Persian text collection. Journal of Knowledge-Based Systems 22(5), 382–387 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Haghollahi, M., Shamsfard, M. (2011). A Semi-supervised Approach for Key-Synset Extraction to Be Used in Word Sense Disambiguation. In: Salem, M.V.M., Shaalan, K., Oroumchian, F., Shakery, A., Khelalfa, H. (eds) Information Retrieval Technology. AIRS 2011. Lecture Notes in Computer Science, vol 7097. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25631-8_54
Download citation
DOI: https://doi.org/10.1007/978-3-642-25631-8_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25630-1
Online ISBN: 978-3-642-25631-8
eBook Packages: Computer ScienceComputer Science (R0)