Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

Exploiting Semantic Term Relations in Text Summarization

OAI: oai:igi-global.com:289607 • DOI: 10.4018/IJIRR.289607

Published by: IGI Global Scientific Publishing

Abstract

The traditional frequency based approach to creating multi-document extractive summary ranks sentences based on scores computed by summing up TF*IDF weights of words contained in the sentences. In this approach, TF or term frequency is calculated based on how frequently a term (word) occurs in the input and TF calculated in this way does not take into account the semantic relations among terms. In this paper, we propose methods that exploits semantic term relations for improving sentence ranking and redundancy removal steps of a summarization system. Our proposed summarization system has been tested on DUC 2003 and DUC 2004 benchmark multi-document summarization datasets. The experimental results reveal that performance of our multi-document text summarizer is significantly improved when the distributional term similarity measure is used for finding semantic term relations. Our multi-document text summarizer also outperforms some well known summarization baselines to which it is compared.

Information Retrieval Library and Information Science Information Retrieval