Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms

Authors

  • M. S. Bewoor Department of Computer Engineering, Bharati Vidyapeeth Deemed University College of Engineering, Pune, India
  • S. H. Patil Department of Computer Engineering, Bharati Vidyapeeth Deemed University College of Engineering, Pune, India
Volume: 8 | Issue: 1 | Pages: 2562-2567 | February 2018 | https://doi.org/10.48084/etasr.1775

Abstract

The availability of various digital sources has created a demand for text mining mechanisms. Effective summary generation mechanisms are needed in order to utilize relevant information from often overwhelming digital data sources. In this view, this paper conducts a survey of various single as well as multi-document text summarization techniques. It also provides analysis of treating a query sentence as a common one, segmented from documents for text summarization. Experimental results show the degree of effectiveness in text summarization over different clustering algorithms.

Keywords:

Text mining, text summarization, clustering

Downloads

Download data is not yet available.

References

A. Kaushik, S. Naithani, “A Comprehensive Study of Text Mining Approach”, International Journal of Computer Science and Network Security, Vol. 16, No. 2, pp. 69–76, 2016

R. Varadarajan, V. Hristidis, “A system for query-specific document summarization”, 15th ACM international conference on Information and knowledge management, pp. 622-631, 2006 DOI: https://doi.org/10.1145/1183614.1183703

J. Goldstein, V. Mittal, J. Carbonell, M. Kantrowitz, “Multi-Document Summarization By Sentence Extraction”, NAACL-ANLPWorkshop on Automatic summarization, Vol. 4, pp. 40–48, 2000 DOI: https://doi.org/10.3115/1567564.1567569

Y. J. Kumar, N. Salim, “Automatic multi document summarization approaches”, Journal of Computer Science, Vol. 8, No. 1, pp. 133–140, 2012 DOI: https://doi.org/10.3844/jcssp.2012.133.140

S. Gholamrezazadeh, M. A. Salehi, B. Gholamzadeh, “A comprehensive survey on text summarization systems”, 2nd International Conference on Computer Science and its Applications, pp. 1-6, 2009 DOI: https://doi.org/10.1109/CSA.2009.5404226

M. Steinbach, G. Karypis, V. Kumar, “A Comparison of Document Clustering Techniques”, KDD workshop on text mining, Vol. 400, No. 1, pp. 525-526, 2000

D. Vidyadharan, A. CR “A Query Based Summerizer Based on the Context ” International Journal of Science and Research, Vol. 4, No. 5, pp. 3018-3020, 2015

T. K. Fan, C. H. Chang, “Exploring Evolutionary Technical Trends from Academic Research Papers”, Eighth IAPR International Workshop on Document Analysis Systems, pp. 574-581, 2008 DOI: https://doi.org/10.1109/DAS.2008.25

D. Y. Sakhare, R. Kumar, “Syntactic and Sentence Feature Based Hybrid Approach for Text Summarization”, Internation Information Technology and Computer Science, Vol. 2014, No. 3, pp. 38–46, 2014 DOI: https://doi.org/10.5815/ijitcs.2014.03.05

M. N. Ingole, M. S. Bewoor, S. H. Patil, “Text Summarization using Expectation Maximization Clustering Algorithm”, International Journal of Engineering Research and Applications, Vol. 2, No. 4, pp. 168–171, 2012

V. J. Roma, M. S. Bewoor, S. H. Patil, “Automation Tool for Evaluation of NLP based Text Summary Generated through Summarization and Clustering Techniques by Quantitative and Qualitative Metrics”, International Journal of Computer Engineering and Technology, Vol. 4, No. 3, pp. 77–85, 2013

M. K. Gawali, M. S. Bewoor, S. H. Patil, “Review : Performance Evaluator of Optimized Text Summary Algorithm”, International Journal of Computer Science and Technology Vol. 4, No. 1, pp. 295–296, 2013

V. J. Roma, M. S. Bewoor, S. H. Patil, “Evaluator and Comparator : Document Summary Generation based on Quantitative and Qualitative Metrics for International Journal of Scientific & Engineering Research”, International Journal of Scientific & Engineering Research, Vol. 4, No. 5, pp. 1111–1115, 2013

A. Nenkova, “Automatic Text Summarization of Newswire: Lessons Learned from the Document Understanding Conference”, Association for the Advancement of Artificial Intelligence, Vol. 5, pp. 1436-1441, 2005

M. J. A. Eugster, Benchmark Experiments—A Tool for Analyzing Statistical Learning Algorithms, PhD Thesis, Ludwig-Maximilians-Universitat, 2011.

M. Hassel, Evaluation of Automatic Text Summarization, Licentiate Thesis, 2004

Downloads

How to Cite

[1]
M. S. Bewoor and S. H. Patil, “Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms”, Eng. Technol. Appl. Sci. Res., vol. 8, no. 1, pp. 2562–2567, Feb. 2018.

Metrics

Abstract Views: 691
PDF Downloads: 386

Metrics Information