Skip to main content

A Semi-supervised Approach for Key-Synset Extraction to Be Used in Word Sense Disambiguation

  • Conference paper
Information Retrieval Technology (AIRS 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7097))

Included in the following conference series:

  • 1363 Accesses

Abstract

Nowadays, although many researches is being done in the field of word sense disambiguation in some languages like English, still some other languages like Persian have many things to be done. Some difficulties are in this way which might have made it less interactive for researchers. For example, Persian WordNet or FarsNet is newly developed and there is no sense tagged corpus developed based on it yet. So we propose a semi-supervised approach for extending FarsNet with some new relations and then use it for WSD. Also a method to extract semantic keywords or key-concepts from textual documents is used. As the key-concepts are extracted exploiting FarsNet, we call them Key-synsets. In fact Key-synsets of a document are those synsets which are semantically related to the main subjects of that document. This method is exploited to improve the precision of the proposed WSD. Although our approach is tested on Persian it can be easily adopted for other languages such as English.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Miller, G.A., Leacock, C., Tengi, R., Bunker, R.T.: A semantic concordance. In: Proc. of the ARPA Workshop on Human Language Technology, pp. 303–308 (1993)

    Google Scholar 

  2. Navigli, R.: Word Sense Disambiguation: A Survey. ACM Computing Surveys 41(2), Article 10 (February 2009)

    Google Scholar 

  3. Tsatsaronis, G., Varlamis, I., NørvÃ¥g, K.: An Experimental Study on Unsupervised Graph-based Word Sense Disambiguation. In: Gelbukh, A. (ed.) CICLing 2010. LNCS, vol. 6008, pp. 184–198. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  4. Yarowsky, D.: Word-sense disambiguation using statistical models of roget’s categories trained on large corpora. In: Proc. of COLING, pp. 454–460 (1992)

    Google Scholar 

  5. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proc. of the 5th SIGDOC, pp. 24–26 (1986)

    Google Scholar 

  6. Agirre, E., Soroa, A.: Personalizing pagerank for word sense disambiguation. In: Proc. of EACL, pp. 33–41 (2009)

    Google Scholar 

  7. Brody, S., Navigli, R., Lapata, M.: Ensemble methods for unsupervised wsd. In: Proc. of COLING/ACL, pp. 97–104 (2006)

    Google Scholar 

  8. Agirre, E., Martínez, D.: Exploring automatic word sense disambiguation with decision lists and the Web CoRR cs.CL/0010024 (2000)

    Google Scholar 

  9. Tang, X., Chen, X., Qu, W., Yu, S.: Semi-Supervised WSD in Selectional Preferences with Semantic Redundancy. In: COLING (Posters), pp. 1238–1246 (2010)

    Google Scholar 

  10. Brody, S.: Closing the Gap in WSD: Supervised Results with Unsupervised Methods. Doctor of Philosophy thesis, Institute for Communicating and Collaborative Systems. School of Informatics. University of Edinburgh (2009)

    Google Scholar 

  11. Yarowsky, D., Radu, F.: Evaluating sense disambiguation across diverse parameter spaces. Natural Language Engineering 9(4), 293–310 (2002)

    Article  Google Scholar 

  12. Tsatsaronis, G., Vazirgiannis, M., Androutsopoulos, I.: Word sense disambiguation with spreading activation networks generated from thesauri. In: Proc. of IJCAI, pp. 1725–1730 (2007)

    Google Scholar 

  13. Tran, A., Bowes, C., Brown, D., Chen, P., Choly, M., Ding, W.: TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain. In: Proc. of the 5th International Workshop on Semantic Evaluation, ACL 2010, Uppsala, Sweden, July 15-16, pp. 396–401 (2010)

    Google Scholar 

  14. Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proc. of the 18th International Joint Conference on Artificial Intelligence, IJCAI, Acapulco, Mexico, pp. 805–810 (2003)

    Google Scholar 

  15. Saedi, C., Shamsfard, M.: Translating Persian documents into English using knowledge based WSD. In: ICDIM 2009, pp. 229–234 (2009)

    Google Scholar 

  16. Faili, H.: An Experiment of Word Sense Disambiguation in a Machine Translation System. In: Proc. of 2008 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2008), pp. 28–35 (2008)

    Google Scholar 

  17. Xu, S., Yang, S., Lau, F.: Keyword Extraction and Headline Generation Using Novel Word Features. In: Proc. of the Twenty-Fourth AAAI Conference on Artificial Intelligence (2010)

    Google Scholar 

  18. Ercan, G., Cicekli, I.: Using lexical chains for keyword extraction. Information Processing and Management 43(6), 1705–1714 (2007)

    Article  Google Scholar 

  19. Hulth, A.: Automatic Keyword Extraction. VDM Verlag Dr. Mueller, E.K. Binding: Paperback (2008) ISBN: 363903855X, ISBN-13: 9783639038552

    Google Scholar 

  20. Barker, K., Cornacchia, N.: Using Noun Phrase Heads to Extract Document Keyphrases. In: Canadian Conference on AI 2000, pp. 40–52 (2000)

    Google Scholar 

  21. Turney, P.D.: Learning algorithms for keyphrase extraction. Information Retrieval 2(4), 303–336 (2000) (NRC #44105)

    Article  Google Scholar 

  22. Wartena, C., Brussee, R., Slakhorst, W.: Keyword Extraction Using Word Co-occurrence. In: Workshops on Database and Expert Systems Applications, Bilbao, Spain, August 30-September 03 (2010) ISBN: 978-0-7695-4174-7

    Google Scholar 

  23. Shamsfard, M., Jafari, H., Ilbeygi, M.: STeP-1: A Set of Fundamental Tools for Persian Text Processing. In: LREC (2010)

    Google Scholar 

  24. Shamsfard, M., Hesabi, A., Fadaei, H., Mansoory, N., Famian, A., Bagherbeigi, S., Fekri, E., Monshizadeh, M., Assi: Semi Automatic Development of FarsNet; The Persian WordNet. In: Assi: Semi Automatic Development of FarsNet; The Persian WordNet. 5th Global WordNet Conference (GWA 2010), Mumbai, India (2010)

    Google Scholar 

  25. Fragos, K., Maistros, I., Skourlas, C.: Word Sense Disambiguation using WordNet relations. In: Proc. of 1st Balkan Conference in Informatics, October 20-22 (2003)

    Google Scholar 

  26. AleAhmad, A., Amiri, H., Darrudi, E., Rahgozar, M., Oroumchian, F.: Hamshahri: A standard Persian text collection. Journal of Knowledge-Based Systems 22(5), 382–387 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Haghollahi, M., Shamsfard, M. (2011). A Semi-supervised Approach for Key-Synset Extraction to Be Used in Word Sense Disambiguation. In: Salem, M.V.M., Shaalan, K., Oroumchian, F., Shakery, A., Khelalfa, H. (eds) Information Retrieval Technology. AIRS 2011. Lecture Notes in Computer Science, vol 7097. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25631-8_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25631-8_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25630-1

  • Online ISBN: 978-3-642-25631-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics