Unsupervised Keyphrase Extraction: Ranking Step and Single-Word Phrase Problem

Authors

  • Svetlana Popova TU Dublin
  • Vera Danilova Uppsala University
  • Mikhail Alexandrov Russian Academy of National Economy and Public Administration
  • John Cardiff TU Dublin

DOI:

https://doi.org/10.13053/cys-28-3-5197

Keywords:

Keyphrase extraction, one-word phrase problem, keyphrase length, natural language processing

Abstract

Keyphrases provide a compact representation of a document‘s content and can be efficiently used to enhance Web search results and improve natural language processing tasks. This paper extends the state-of-the-art in unsupervised keyphrase extraction from scientific paper abstracts. It aims to demonstrate the existence of a dataset-dependent single-word phrase problem explicitly. We also aim to investigate how different unsupervised algorithms handle the task of ranking both single-word and multi-word phrases and to observe the effect the single-word phrase problemhas on phrase ranking. This paper helps analyze the reasons allowing algorithms to perform better or worse in comparison to each other and shows how the gained insights can enhance the quality of the existing algorithms.

Downloads

Published

2024-09-23

Issue

Section

Articles