Sentiment Classification based on Machine Learning Approaches in Amazon Product Reviews
Received: 15 March 2023 | Revised: 14 April 2023 | Accepted: 23 April 2023 | Online: 7 May 2023
Corresponding author: Mohammad Abu Kausar
Abstract
Online retailers and merchants increasingly request feedback from their clients on the products they purchase. This has led to a significant increase in the number of product reviews posted online, as more people are making purchases online. The opinions expressed in these customer reviews have a significant impact on other customers' purchase decisions, as they are influenced by other customers' recommendations or complaints. This study used Amazon, a well-known and widely used e-commerce platform, to examine sentiment categorization using several machine learning techniques while analyzing an Amazon Reviews dataset. At first, the reviews were transformed into vector representations using the Bag-of-Words approach. Word cloud was used to illustrate the text data in terms of the frequency they appear in the review. Subsequently, the machine learning methods decision trees and logistic regression were used. The two models used in this study achieved high levels of accuracy in analyzing the dataset. Specifically, the Decision Tree model outperformed the Logistic Regression one, achieving an impressive accuracy of 99% compared to the 94% of the latter.
Keywords:
Amazon customer reviews, sentiment analysis, dataset, feature extraction, text classification, machine learningDownloads
References
X. Fang and J. Zhan, "Sentiment analysis using product review data," Journal of Big Data, vol. 2, no. 1, Jun. 2015, Art. no. 5. DOI: https://doi.org/10.1186/s40537-015-0015-2
J. McAuley, "Amazon product data," Recommender Systems and Personalization Datasets. https://cseweb.ucsd.edu/~jmcauley/datasets.html#amazon_reviews.
M. A. Kausar, A. Soosaimanicka, and M. Nasar, "Public Sentiment Analysis on Twitter Data during COVID-19 Outbreak," International Journal of Advanced Computer Science and Applications, vol. 12, no. 2, 2021. DOI: https://doi.org/10.14569/IJACSA.2021.0120252
M. Mahyoob, J. Algaraady, M. Alrahiali, and A. Alblwi, "Sentiment Analysis of Public Tweets Towards the Emergence of SARS-CoV-2 Omicron Variant: A Social Media Analytics Framework," Engineering, Technology & Applied Science Research, vol. 12, no. 3, pp. 8525–8531, Jun. 2022. DOI: https://doi.org/10.48084/etasr.4865
N. Nandal, R. Tanwar, and J. Pruthi, "Machine learning based aspect level sentiment analysis for Amazon products," Spatial Information Research, vol. 28, no. 5, pp. 601–607, Oct. 2020. DOI: https://doi.org/10.1007/s41324-020-00320-2
V. Schoenmueller, O. Netzer, and F. Stahl, "The Polarity of Online Reviews: Prevalence, Drivers and Implications," Journal of Marketing Research, vol. 57, no. 5, pp. 853–877, Oct. 2020. DOI: https://doi.org/10.1177/0022243720941832
Md. A. Kausar, V. S. Dhaka, and S. K. Singh, "An Effective Parallel Web Crawler based on Mobile Agent and Incremental Crawling," Journal of Industrial and Intelligent Information, vol. 1, no. 2, pp. 86–90, Jun. 2013. DOI: https://doi.org/10.12720/jiii.1.1.86-90
I. Karamitsos, S. Albarhami, and C. Apostolopoulos, "Tweet Sentiment Analysis (TSA) for Cloud Providers Using Classification Algorithms and Latent Semantic Analysis," Journal of Data Analysis and Information Processing, vol. 7, no. 4, Nov. 2019, Art. no. 69212. DOI: https://doi.org/10.4236/jdaip.2019.74016
U. P. Gurav and S. Kotrappa, "Sentiment Aware Stock Price Forecasting using an SA-RNN-LBL Learning Model," Engineering, Technology & Applied Science Research, vol. 10, no. 5, pp. 6356–6361, Oct. 2020. DOI: https://doi.org/10.48084/etasr.3805
A. Rahman and M. N. A. Khan, "A Classification Based Model to Assess Customer Behavior in Banking Sector," Engineering, Technology & Applied Science Research, vol. 8, no. 3, pp. 2949–2953, Jun. 2018. DOI: https://doi.org/10.48084/etasr.1917
V. K. Jain, S. Kumar, and P. Mahanti, "Sentiment Recognition in Customer Reviews Using Deep Learning," International Journal of Enterprise Information Systems (IJEIS), vol. 14, no. 2, pp. 77–86, Apr. 2018. DOI: https://doi.org/10.4018/IJEIS.2018040105
J. Lim, M. Park, S. Anitsal, M. M. Anitsal, and I. Anitsal, "Retail Customer Sentiment Analysis: Customers’ Reviews of Top Ten U.S. Retailers’ Performance," Global Journal of Managment and Marketing, vol. 3, no. 1, pp. 124–150, 2019.
R. S. Jagdale, V. S. Shirsat, and S. N. Deshmukh, "Sentiment Analysis on Product Reviews Using Machine Learning Techniques," in Cognitive Informatics and Soft Computing, Singapore, 2019, pp. 639–647. DOI: https://doi.org/10.1007/978-981-13-0617-4_61
V. Vyas and V. Uma, "Approaches to Sentiment Analysis on Product Reviews," in Sentiment Analysis and Knowledge Discovery in Contemporary Business, IGI Global, 2019, pp. 15–30. DOI: https://doi.org/10.4018/978-1-5225-4999-4.ch002
Md. A. Kausar, V. S. Dhaka, and S. K. Singh, "Web Crawler Based on Mobile Agent and Java Aglets," International Journal of Information Technology and Computer Science, vol. 5, no. 10, pp. 85–91, Sep. 2013. DOI: https://doi.org/10.5815/ijitcs.2013.10.09
S. Govindaraj and K. Gopalakrishnan, "Intensified Sentiment Analysis of Customer Product Reviews Using Acoustic and Textual Features," ETRI Journal, vol. 38, no. 3, pp. 494–501, 2016. DOI: https://doi.org/10.4218/etrij.16.0115.0684
M. Ghasemaghaei, S. P. Eslami, K. Deal, and K. Hassanein, "Reviews’ length and sentiment as correlates of online reviews’ ratings," Internet Research, vol. 28, no. 3, pp. 544–563, Jan. 2018. DOI: https://doi.org/10.1108/IntR-12-2016-0394
P. Sasikala and L. Mary Immaculate Sheela, "Sentiment analysis of online product reviews using DLMNN and future prediction of online product using IANFIS," Journal of Big Data, vol. 7, no. 1, May 2020, Art. no. 33. DOI: https://doi.org/10.1186/s40537-020-00308-7
S. K. Sharma, S. Chakraborti, and T. Jha, "Analysis of book sales prediction at Amazon marketplace in India: a machine learning approach," Information Systems and e-Business Management, vol. 17, no. 2, pp. 261–284, Dec. 2019. DOI: https://doi.org/10.1007/s10257-019-00438-3
A. Y. L. Chong, B. Li, E. W. T. Ngai, E. Ch’ng, and F. Lee, "Predicting online product sales via online reviews, sentiments, and promotion strategies: A big data architecture and neural network approach," International Journal of Operations & Production Management, vol. 36, no. 4, pp. 358–383, Jan. 2016. DOI: https://doi.org/10.1108/IJOPM-03-2015-0151
J. Du, J. Rong, S. Michalska, H. Wang, and Y. Zhang, "Feature selection for helpfulness prediction of online product reviews: An empirical study," PLOS ONE, vol. 14, no. 12, 2019, Art. no. e0226902. DOI: https://doi.org/10.1371/journal.pone.0226902
Meenakshi, A. Banerjee, N. Intwala, and V. Sawant, "Sentiment Analysis of Amazon Mobile Reviews," in ICT Systems and Sustainability, Singapore, 2020, pp. 43–52. DOI: https://doi.org/10.1007/978-981-15-0936-0_4
K. Q. Anh, Y. Nagai, and L. M. Nguyen, "Extracting Customer Reviews from Online Shopping and Its Perspective on Product Design," Vietnam Journal of Computer Science, vol. 06, no. 01, pp. 43–56, Feb. 2019. DOI: https://doi.org/10.1142/S2196888819500088
F. Pedregosa et al., "Scikit-learn: Machine Learning in Python," The Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
A. Tripathy, A. Agrawal, and S. K. Rath, "Classification of sentiment reviews using n-gram machine learning approach," Expert Systems with Applications, vol. 57, pp. 117–126, Sep. 2016. DOI: https://doi.org/10.1016/j.eswa.2016.03.028
Downloads
How to Cite
License
Copyright (c) 2023 Mohammad Abu Kausar, Sallam Osman Fageeri, Arockiasamy Soosaimanickam
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.