Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms
Abstract
The availability of various digital sources has created a demand for text mining mechanisms. Effective summary generation mechanisms are needed in order to utilize relevant information from often overwhelming digital data sources. In this view, this paper conducts a survey of various single as well as multi-document text summarization techniques. It also provides analysis of treating a query sentence as a common one, segmented from documents for text summarization. Experimental results show the degree of effectiveness in text summarization over different clustering algorithms.
Keywords:
Text mining, text summarization, clusteringDownloads
References
A. Kaushik, S. Naithani, “A Comprehensive Study of Text Mining Approach”, International Journal of Computer Science and Network Security, Vol. 16, No. 2, pp. 69–76, 2016
R. Varadarajan, V. Hristidis, “A system for query-specific document summarization”, 15th ACM international conference on Information and knowledge management, pp. 622-631, 2006 DOI: https://doi.org/10.1145/1183614.1183703
J. Goldstein, V. Mittal, J. Carbonell, M. Kantrowitz, “Multi-Document Summarization By Sentence Extraction”, NAACL-ANLPWorkshop on Automatic summarization, Vol. 4, pp. 40–48, 2000 DOI: https://doi.org/10.3115/1567564.1567569
Y. J. Kumar, N. Salim, “Automatic multi document summarization approaches”, Journal of Computer Science, Vol. 8, No. 1, pp. 133–140, 2012 DOI: https://doi.org/10.3844/jcssp.2012.133.140
S. Gholamrezazadeh, M. A. Salehi, B. Gholamzadeh, “A comprehensive survey on text summarization systems”, 2nd International Conference on Computer Science and its Applications, pp. 1-6, 2009 DOI: https://doi.org/10.1109/CSA.2009.5404226
M. Steinbach, G. Karypis, V. Kumar, “A Comparison of Document Clustering Techniques”, KDD workshop on text mining, Vol. 400, No. 1, pp. 525-526, 2000
D. Vidyadharan, A. CR “A Query Based Summerizer Based on the Context ” International Journal of Science and Research, Vol. 4, No. 5, pp. 3018-3020, 2015
T. K. Fan, C. H. Chang, “Exploring Evolutionary Technical Trends from Academic Research Papers”, Eighth IAPR International Workshop on Document Analysis Systems, pp. 574-581, 2008 DOI: https://doi.org/10.1109/DAS.2008.25
D. Y. Sakhare, R. Kumar, “Syntactic and Sentence Feature Based Hybrid Approach for Text Summarization”, Internation Information Technology and Computer Science, Vol. 2014, No. 3, pp. 38–46, 2014 DOI: https://doi.org/10.5815/ijitcs.2014.03.05
M. N. Ingole, M. S. Bewoor, S. H. Patil, “Text Summarization using Expectation Maximization Clustering Algorithm”, International Journal of Engineering Research and Applications, Vol. 2, No. 4, pp. 168–171, 2012
V. J. Roma, M. S. Bewoor, S. H. Patil, “Automation Tool for Evaluation of NLP based Text Summary Generated through Summarization and Clustering Techniques by Quantitative and Qualitative Metrics”, International Journal of Computer Engineering and Technology, Vol. 4, No. 3, pp. 77–85, 2013
M. K. Gawali, M. S. Bewoor, S. H. Patil, “Review : Performance Evaluator of Optimized Text Summary Algorithm”, International Journal of Computer Science and Technology Vol. 4, No. 1, pp. 295–296, 2013
V. J. Roma, M. S. Bewoor, S. H. Patil, “Evaluator and Comparator : Document Summary Generation based on Quantitative and Qualitative Metrics for International Journal of Scientific & Engineering Research”, International Journal of Scientific & Engineering Research, Vol. 4, No. 5, pp. 1111–1115, 2013
A. Nenkova, “Automatic Text Summarization of Newswire: Lessons Learned from the Document Understanding Conference”, Association for the Advancement of Artificial Intelligence, Vol. 5, pp. 1436-1441, 2005
M. J. A. Eugster, Benchmark Experiments—A Tool for Analyzing Statistical Learning Algorithms, PhD Thesis, Ludwig-Maximilians-Universitat, 2011.
M. Hassel, Evaluation of Automatic Text Summarization, Licentiate Thesis, 2004
Downloads
How to Cite
License
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.