Single-Document Keyphrase Extraction for Multi-Document Keyphrase Extraction

Authors

  • Gábor Berend University of Szeged, Department of Informatics
  • Richárd Farkas University of Szeged, Department of Informatics

DOI:

https://doi.org/10.13053/cys-17-2-1522

Keywords:

Multi-document keyphrase extraction, knowledge management, information retrieval.

Abstract

Here, we address the task of assigningrelevant terms to thematically and semantically relatedsub-corpora and achieve superior results compared tothe baseline performance. Our results suggest thatmore reliable sets of keyphrases can be assigned tothe semantically and thematically related subsets ofsome corpora if the automatically determined sets ofkeyphrases for the individual documents of an entirecorpus are identified first. The sets of keyphrasesassigned by our proposed method for the workshopspresent in the ACL Anthology Corpus over a 6-yearperiod were considered better in more than 60% ofthe test cases compared to our baseline system whenevaluated against an aggregation of different humanjudgements.

Downloads

Published

2013-06-29