An Enhanced Random Forest (ERF)-based Machine Learning Framework for Resampling, Prediction, and Classification of Mobile Applications using Textual Features
Received: 1 October 2024 | Revised: 1 November 2024 and 14 November 2024 | Accepted: 5 December 2024 | Online: 13 December 2024
Corresponding author: Aitizaz Ali
Abstract
The amount of mobile applications is increasing rapidly, and it is difficult for software developers to identify the numerous key factors that affect their rating and performance. This study presents a machine-learning framework to improve decisions in adding new features to mobile applications and enhancing overall performance. A dataset of app attributes from the Apple AppStore was used, exploiting NLP techniques to preprocess the textual information and develop an Enhanced Random Forest (ERF) framework to assess and forecast ratings for multifunctional apps and investigate the connections between features and user ratings. The ERF model was compared with other renowned ML methods including Decision Trees (DT), Naive Bayes (NB), CNN, and ANN. The experimental results showed that the proposed model predicts app ratings more effectively compared to other complex models. The proposed model achieved precision, recall, and F1-score of 92.76%, 99.33%, and 95.93%, respectively.
Keywords:
machine learning, reliability, mobile applications, sustainable learning, predicting mobile app ratings, user ratings, XGBoost, random forest, NLP high-dimensional datasets, convolutional neural network (CNN), NSL-KDD, UNSW-NB15, mean square errorDownloads
References
P. M. Dhulavvagol and S. G. Totad, "Performance Enhancement of Distributed Processing Systems Using Novel Hybrid Shard Selection Algorithm," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13720–13725, Apr. 2024.
J. Song, J. Kim, D. R. Jones, J. Baker, and W. W. Chin, "Application discoverability and user satisfaction in mobile application stores: An environmental psychology perspective," Decision Support Systems, vol. 59, pp. 37–51, Mar. 2014.
C. Z. Liu, Y. A. Au, and H. S. Choi, "Effects of Freemium Strategy in the Mobile App Market: An Empirical Study of Google Play," Journal of Management Information Systems, vol. 31, no. 3, pp. 326–354, Jul. 2014.
A. Ghose and S. P. Han, "An Empirical Analysis of User Content Generation and Usage Behavior on the Mobile Internet," Management Science, vol. 57, no. 9, pp. 1671–1691, Sep. 2011.
D. W. Hosmer, S. Lemeshow, and R. X. Sturdivant, Applied Logistic Regression. John Wiley & Sons, 2013.
L. Breiman, "Random Forests," Machine Learning, vol. 45, no. 1, pp. 5–32, Oct. 2001.
T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, Aug. 2016, pp. 785–794.
V. Balakrishnan, Z. Shi, C. L. Law, R. Lim, L. L. Teh, and Y. Fan, "A deep learning approach in predicting products’ sentiment ratings: a comparative analysis," The Journal of Supercomputing, vol. 78, no. 5, pp. 7206–7226, Apr. 2022.
M. R. Dehkordi, H. Seifzadeh, G. Beydoun, and M. H. Nadimi-Shahraki, "Success prediction of android applications in a novel repository using neural networks," Complex & Intelligent Systems, vol. 6, no. 3, pp. 573–590, Oct. 2020.
X. Wu and Y. Zhu, "A Hybrid Approach Based on Collaborative Filtering to Recommending Mobile Apps," in 2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS), Wuhan, China, Dec. 2016, pp. 8–15.
B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1–2, pp. 1–135, Jul. 2008.
M. C. Chiu, J. H. Huang, S. Gupta, and G. Akman, "Developing a personalized recommendation system in a smart product service system based on unsupervised learning model," Computers in Industry, vol. 128, Jun. 2021, Art. no. 103421.
N. Jindal and B. Liu, "Opinion spam and analysis," in Proceedings of the international conference on Web search and web data mining - WSDM ’08, Palo Alto, CA, USA, 2008, pp. 219-230.
M. T. Ribeiro, S. Singh, and C. Guestrin, "‘Why Should I Trust You?’: Explaining the Predictions of Any Classifier," in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, Aug. 2016, pp. 1135–1144.
S. M. Mudambi and D. Schuff, "Research Note: What Makes a Helpful Online Review? A Study of Customer Reviews on Amazon.com," MIS Quarterly, vol. 34, no. 1, pp. 185–200, 2010.
G. Prakash, "Apple AppStore Apps." [Online]. Available: https://www.kaggle.com/datasets/gauthamp10/apple-appstore-apps.
Downloads
How to Cite
License
Copyright (c) 2024 Shahbaz Hussain, Nadeem Sarwar, Arshad Ali, Hamayun Khan, Irfanud Din, Abdullah M. Alqahtani, Mohamed Shabir, aitizaz.ali@apu.edu.my
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.