LexAN: Lexical Association Networks

Authors

  • Jorge Reyes-Magaña Universidad Autónoma de Yucatán
  • Gerardo Sierra Universidad Nacional Autónoma de México
  • Gemma Bel-Enguix Universidad Nacional Autónoma de México
  • Helena Gomez-Adorno Universidad Nacional Autónoma de México

DOI:

https://doi.org/10.13053/cys-27-4-4774

Keywords:

Network, co-occurrence, lexical access

Abstract

This paper presents Lexical Association Networks (LexAN), which entail the development of a mathematical model comprising a collection of words derived from a textual corpus. The interconnections between word tokens are represented by weighted edges within a non-directed graph structure. The construction process of LexAN involves 6 stages: 1) Lemmatization 2) Multi-word expressions 3) Stopwords removal 4) Co-ocurrence graph 5) Word Co-ocurrence norms, and 6) LexAN construction. We employed a Medical text corpus containing 574,011 words to build our graphs. To assess the efficacy of our LexAN, these graph structures were implemented within a tool designed to address the lexical access problem, specifically functioning as a reverse dictionary. This application resulted in favorable and promising results.

Author Biographies

Gerardo Sierra, Universidad Nacional Autónoma de México

Instituto de Ingeniería

Gemma Bel-Enguix, Universidad Nacional Autónoma de México

Instituto de Ingeniería

Helena Gomez-Adorno, Universidad Nacional Autónoma de México

Instituto de Investigaciones en Matematicas Aplicadas y en Sistemas

Downloads

Published

2023-12-17

Issue

Section

Articles