The role of statistical and semantic features in single-document extractive summarization

Tatiana Vodolazova; Elena Lloret; Rafael Muñoz; Manuel Palomar

doi:10.5430/air.v2n3p35

The role of statistical and semantic features in single-document extractive summarization

Tatiana Vodolazova, Elena Lloret, Rafael Muñoz, Manuel Palomar

Abstract

This paper reports on the further results of the ongoing research analyzing the impact of a range of commonly used statistical and semantic features in the context of extractive text summarization. The features experimented with include word frequency, inverse sentence and term frequencies, stopwords filtering, word senses, resolved anaphora and textual entailment. The obtained results demonstrate the relative importance of each feature and the limitations of the tools available. It has been shown that the inverse sentence frequency combined with the term frequency yields almost the same results as the latter combined with stopwords filtering that in its turn proved to be a highly competitive baseline. To improve the suboptimal results of anaphora resolution, the system was extended with the second anaphora resolution module. The present paper also describes the first attempts of the internal document data representation.

Full Text:

PDF

DOI: https://doi.org/10.5430/air.v2n3p35

Refbacks

There are currently no refbacks.

Artificial Intelligence Research

ISSN 1927-6974 (Print) ISSN 1927-6982 (Online)

Copyright © Sciedu Press
To make sure that you can receive messages from us, please add the 'Sciedupress.com' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.

Artificial Intelligence Research
International Peer-reviewed and Open Access Journal for the Artificial Intelligence Specialists

Honorary Editor-in-chief

The role of statistical and semantic features in single-document extractive summarization

Abstract

Full Text:

Refbacks

Username
Password
Remember me

Artificial Intelligence Research International Peer-reviewed and Open Access Journal for the Artificial Intelligence Specialists

Honorary Editor-in-chief

The role of statistical and semantic features in single-document extractive summarization

Abstract

Full Text:

Refbacks

Artificial Intelligence Research
International Peer-reviewed and Open Access Journal for the Artificial Intelligence Specialists