Graph Mining under Linguistic Constraints for Exploring Large Texts

Authors

  • Solen Quiniou LINA, LUNAM Université de Nantes, Nantes, France
  • Peggy Cellier IRISA, INSA de Rennes, Rennes, France
  • Thierry Charnois GREYC, Université de Caen Basse-Normandie, Caen, France MoDyCO, Université Paris-Ouest Nanterre La Défense, Paris, France
  • Dominique Legallois CRISCO, Université de Caen Basse-Normandie, Caen, France

DOI:

https://doi.org/10.13053/cys-17-2-1529

Keywords:

Text coherence, graph representation, graph mining, Hoey’s linguistic model.

Abstract

In this paper, we propose an approach toexplore large texts by highlighting coherent sub-parts.The exploration method relies on a graph representationof the text according to Hoey’s linguistic model whichallows the selection and the binding of adjacent andnon-adjacent sentences. The main contribution of ourwork consists in proposing a method based on bothHoey’s linguistic model and a special graph miningtechnique, called CoHoP mining, to extract coherentsub-parts of the graph representation of the text. Wehave conducted some experiments on several Englishtexts showing the interest of the proposed approach. 

Downloads

Published

2013-06-29