Using Earth Mover's Distance and Word Embeddings for Recognizing Textual Entailment in Arabic

Authors

  • Tarik Boudaa University of Ibn-Tofail, Faculty of Sciences, Laboratory of Informatics Systems and Optimization
  • Mohamed El Marouani University of Ibn-Tofail, Faculty of Sciences, Laboratory of Informatics Systems and Optimization
  • Nourddine Enneya University of Ibn-Tofail, Faculty of Sciences, Laboratory of Informatics Systems and Optimization

DOI:

https://doi.org/10.13053/cys-24-4-3389

Keywords:

Recognizing Textual Entailment (RTE), Natural Language Inference (NLI), Arabic NLP, earth mover's distance, machine learning

Abstract

Recognizing Textual Entailment (RTE) is a task of Natural Language Processing (NLP) in which two texts denoted TEXT (T) and HYPOTHESIS (H) are processed by a system to determine whether the meaning of H is inferred (entailed) from T or not. This task is useful for several NLP applications and it has attracted a lot of attention in research. Most of studies focused on English as a target language. In this paper, we give an overview of the main studies on Textual Entailment for English and Arabic and we present a new approach to deal with this task for Arabic using a measure of similarity based on Earth Mover's Distance, and machine learning. We experimented this approach using state of the art Arabic NLP tools and we achieved encouraging results. Although we have applied this approach only to Arabic, its application to other languages is still possible.

Downloads

Published

2020-12-02

Issue

Section

Articles