Identification of Verbal Phraseological Units in Mexican News Stories
DOI:
https://doi.org/10.13053/cys-19-4-2328Keywords:
Verbal phraseological units, supervised machine learning, lexiconAbstract
Verbal Phraseological Units are phrasesmade up of two or more words in which at least one of thewords is a verb that plays the role of the predicate. Oneof the characteristics of this type of expression is that itsglobal meaning rarely can be deduced from the meaningof its components. The automatic recognition of this typeof linguistic structures is a very important task, since theyare a standard way of expressing a concept or idea. Inthis paper we present the results obtained when differentsupervised machine learning methods are employed fordetermining whether or not a verbal phraseological unitis present in a given story of a newspaper. The experimentshave been carried out using a supervised corpusof news stories (written in Mexican Spanish). Besidethe results obtained in the experiments aforementioned,we provide access to a new lexicon having phrases asentries (instead of single words), in which each entry isassociated to a real value (normalized between zero andone) indicating its probability of being a verbal phraseologicalunit.Downloads
Published
2015-12-18
Issue
Section
Articles
License
Hereby I transfer exclusively to the Journal "Computación y Sistemas", published by the Computing Research Center (CIC-IPN),the Copyright of the aforementioned paper. I also accept that these
rights will not be transferred to any other publication, in any other format, language or other existing means of developing.I certify that the paper has not been previously disclosed or simultaneously submitted to any other publication, and that it does not contain material whose publication would violate the Copyright or other proprietary rights of any person, company or institution. I certify that I have the permission from the institution or company where I work or study to publish this work.The representative author accepts the responsibility for the publicationof this paper on behalf of each and every one of the authors.
This transfer is subject to the following conditions:- The authors retain all ownership rights (such as patent rights) of this work, except for the publishing rights transferred to the CIC, through this document.
- Authors retain the right to publish the work in whole or in part in any book they are the authors or publishers. They can also make use of this work in conferences, courses, personal web pages, and so on.
- Authors may include working as part of his thesis, for non-profit distribution only.