Automatic Bug Triaging Process: An Enhanced Machine Learning Approach through Large Language Models
Received: 28 August 2024 | Revised: 27 September 2024 | Accepted: 5 October 2024 | Online: 2 December 2024
Corresponding author: Deepshikha Chhabra
Abstract
Bug resolution and maintenance are the most critical phases of the software development life cycle. The traditional bug triaging concept refers to the manual assignment of bugs to the appropriate developer after reading the bug details from the bug tracker and further resolving it. The advent of machine learning algorithms provides various solutions for automated bug triaging. Machine learning algorithms can be used to classify bugs and assign each to a developer. Reducing manual efforts optimizes bug-triaging by utilizing manpower in other software development processes. Furthermore, machine learning Large Language Models (LLMs) can be used to take advantage of their natural language processing features and capabilities. This study proposes a machine learning-based embed chain LLM approach for automatic bug triaging. This approach is used to automatically classify bug reports. Based on the results, the appropriate developer is recommended. In addition, the proposed approach is used to automatically predict the priority of bug reports. This paper also discusses the strengths and challenges of the proposed approach.
Keywords:
bug triaging, machine learning, large language model, embed chainDownloads
References
R. Wang, X. Ji, S. Xu, Y. Tian, S. Jiang, and R. Huang, "An empirical assessment of different word embedding and deep learning models for bug assignment," Journal of Systems and Software, vol. 210, Apr. 2024, Art. no. 111961.
G. Catolino, F. Palomba, A. Zaidman, and F. Ferrucci, "Not all bugs are the same: Understanding, characterizing, and classifying bug types," Journal of Systems and Software, vol. 152, pp. 165–181, Jun. 2019.
R. Chen, S.-K. Guo, X. Z. Wang, and T. L. Zhang, "Fusion of Multi-RSMOTE With Fuzzy Integral to Classify Bug Reports With an Imbalanced Distribution," IEEE Transactions on Fuzzy Systems, vol. 27, no. 12, pp. 2406–2420, Sep. 2019.
G. Parthasarathy, D. C. Tomar, and B. John, "Analysis of Bug Triage using Data Preprocessing (Reduction) Techniques," International Journal of Computer Applications, vol. 125, no. 9, pp. 8–15, 2015.
M. N. A. Khan, A. M. Mirza, R. A. Wagan, M. Shahid, and I. Saleem, "A Literature Review on Software Testing Techniques for Smartphone Applications," Engineering, Technology & Applied Science Research, vol. 10, no. 6, pp. 6578–6583, Dec. 2020.
S. Q. Xi, Y. Yao, X. S. Xiao, F. Xu, and J. Lv, "Bug Triaging Based on Tossing Sequence Modeling," Journal of Computer Science and Technology, vol. 34, no. 5, pp. 942–956, Sep. 2019.
T. Zhang, J. Chen, G. Yang, B. Lee, and X. Luo, "Towards more accurate severity prediction and fixer recommendation of software bugs," Journal of Systems and Software, vol. 117, pp. 166–184, Jul. 2016.
B. S. Neysiani and S. M. Babamir, "New labeled dataset of interconnected lexical typos for automatic correction in the bug reports," SN Applied Sciences, vol. 1, no. 11, Oct. 2019, Art. no. 1385.
D. Dreyton, A. A. Araújo, A. Dantas, R. Saraiva, and J. Souza, "A Multi-objective Approach to Prioritize and Recommend Bugs in Open Source Repositories," in Search Based Software Engineering, Raleigh, NC, USA, 2016, pp. 143–158.
E. Eiroa-Lledo, R. H. Ali, G. Pinto, J. Anderson, and E. Linstead, "Large-Scale Identification and Analysis of Factors Impacting Simple Bug Resolution Times in Open Source Software Repositories," Applied Sciences, vol. 13, no. 5, Jan. 2023, Art. no. 3150.
A. Kukkar et al., "ProRE: An ACO- based programmer recommendation model to precisely manage software bugs," Journal of King Saud University - Computer and Information Sciences, vol. 35, no. 1, pp. 483–498, Jan. 2023.
R. Chauhan, S. Sharma, and A. Goyal, "DENATURE: duplicate detection and type identification in open source bug repositories," International Journal of System Assurance Engineering and Management, vol. 14, no. 1, pp. 275–292, Mar. 2023.
R. R. Panda and N. K. Nagwani, "Topic modeling and intuitionistic fuzzy set-based approach for efficient software bug triaging," Knowledge and Information Systems, vol. 64, no. 11, pp. 3081–3111, Nov. 2022.
Y. Liu, X. Qi, J. Zhang, H. Li, X. Ge, and J. Ai, "Automatic Bug Triaging via Deep Reinforcement Learning," Applied Sciences, vol. 12, no. 7, Jan. 2022, Art. no. 3565.
T. W. W. Aung, Y. Wan, H. Huo, and Y. Sui, "Multi-triage: A multi-task learning framework for bug triage," Journal of Systems and Software, vol. 184, Feb. 2022, Art. no. 111133.
Y. Su et al., "Reducing Bug Triaging Confusion by Learning from Mistakes with a Bug Tossing Knowledge Graph," in 2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), Melbourne, Australia, Nov. 2021, pp. 191–202.
H. Jahanshahi, K. Chhabra, M. Cevik, and A. Baþar, "DABT: A Dependency-aware Bug Triaging Method," in Evaluation and Assessment in Software Engineering, Trondheim Norway, Jun. 2021, pp. 221–230.
E. Tüzün, A. Çetin, and E. Doğan, "An Automated Bug Triaging Approach using Deep Learning: A Replication Study," European Journal of Science and Technology, no. 21, pp. 268–274, Jan. 2021.
A. D. Sawadogo, Q. Guimard, T. F. Bissyandé, A. K. Kaboré, J. Klein, and N. Moha, "Early Detection of Security-Relevant Bug Reports using Machine Learning: How Far Are We?" arXiv, Dec. 19, 2021.
H. Wu, Y. Ma, Z. Xiang, C. Yang, and K. He, "A spatial–temporal graph neural network framework for automated software bug triaging," Knowledge-Based Systems, vol. 241, Apr. 2022, Art. no. 108308.
A. Goyal and N. Sardana, "Feature Ranking and Aggregation for Bug Triaging in Open-Source Issue Tracking Systems," in 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, Jan. 2021, pp. 871–876.
C. Gupta and M. M. Freire, "A decentralized blockchain oriented framework for automated bug assignment," Information and Software Technology, vol. 134, Jun. 2021, Art. no. 106540.
Y. Kashiwa and M. Ohira, "A Release-Aware Bug Triaging Method Considering Developers’ Bug-Fixing Loads," IEICE Transactions on Information and Systems, vol. E103.D, no. 2, pp. 348–362, Feb. 2020.
C. A. Choquette-Choo, D. Sheldon, J. Proppe, J. Alphonso-Gibbs, and H. Gupta, "A Multi-label, Dual-Output Deep Neural Network for Automated Bug Triaging," in 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), Boca Raton, FL, USA, Dec. 2019, pp. 937–944.
A. Sarkar, P. C. Rigby, and B. Bartalos, "Improving Bug Triaging with High Confidence Predictions at Ericsson," in 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME), Cleveland, OH, USA, Sep. 2019, pp. 81–91.
S. Mani, A. Sankaran, and R. Aralikatte, "DeepTriage: Exploring the Effectiveness of Deep Learning for Bug Triaging," in Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, Kolkata, India, Jan. 2019, pp. 171–179.
X. Xia, D. Lo, Y. Ding, J. M. Al-Kofahi, T. N. Nguyen, and X. Wang, "Improving Automated Bug Triaging with Specialized Topic Model," IEEE Transactions on Software Engineering, vol. 43, no. 3, pp. 272–297, Mar. 2017.
S. F. A. Zaidi, H. Woo, and C.-G. Lee, "A Graph Convolution Network-Based Bug Triage System to Learn Heterogeneous Graph Representation of Bug Reports," IEEE Access, vol. 10, pp. 20677–20689, 2022.
C. Richter and H. Wehrheim, "Learning Realistic Mutations: Bug Creation for Neural Bug Detectors," in 2022 IEEE Conference on Software Testing, Verification and Validation (ICST), Valencia, Spain, Apr. 2022, pp. 162–173.
A. Gartziandia et al., "Machine learning-based test oracles for performance testing of cyber-physical systems: An industrial case study on elevators dispatching algorithms," Journal of Software: Evolution and Process, vol. 34, no. 11, 2022, Art. no. e2465.
S. T. Cynthia, B. Roy, and D. Mondal, "Feature Transformation for Improved Software Bug Detection Models," in 15th Innovations in Software Engineering Conference, Gandhinagar, India, Feb. 2022, pp. 1–10.
M. Fejzer, J. Narębski, P. Przymus, and K. Stencel, "Tracking Buggy Files: New Efficient Adaptive Bug Localization Algorithm," IEEE Transactions on Software Engineering, vol. 48, no. 7, pp. 2557–2569, Jul. 2022.
R. Sepahvand, R. Akbari, B. Jamasb, S. Hashemi, and O. Boushehrian, "Using word embedding and convolution neural network for bug triaging by considering design flaws," Science of Computer Programming, vol. 228, Jun. 2023, Art. no. 102945.
O. Picus and C. Serban, "Bugsby: a tool support for bug triage automation," in Proceedings of the 2nd ACM International Workshop on AI and Software Testing/Analysis, Jul. 2022, pp. 17–20.
J. Johnson, J. Mahmud, T. Wendland, K. Moran, J. Rubin, and M. Fazzini, "An Empirical Investigation into the Reproduction of Bug Reports for Android Apps," in 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Honolulu, HI, USA, Mar. 2022, pp. 321–322.
J. Jang and G. Yang, "A Bug Triage Technique Using Developer-Based Feature Selection and CNN-LSTM Algorithm," Applied Sciences, vol. 12, no. 18, Jan. 2022, Art. no. 9358.
M. M. Morovati, A. Nikanjam, F. Tambon, F. Khomh, and Z. M. (Jack) Jiang, "Bug characterization in machine learning-based systems," Empirical Software Engineering, vol. 29, no. 1, Dec. 2023, Art. no. 14.
S. Kumara, "Software bugs reports." [Online]. Available: https://www.kaggle.com/datasets/samanthakumara/software-bug-reports.
Downloads
How to Cite
License
Copyright (c) 2024 Deepshikha Chhabra, Raman Chadha
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.