Identify Severity Bug Report with Distribution Imbalance by CR-SMOTE and ELM-Reference-Cited by-同舟云学术

Identify Severity Bug Report with Distribution Imbalance by CR-SMOTE and ELM

Published:2019-02 Issue:02 Volume:29 Page:139-175
ISSN:0218-1940
Container-title:International Journal of Software Engineering and Knowledge Engineering
language:en
Short-container-title:Int. J. Soft. Eng. Knowl. Eng.

Author:

Guo Shikai¹,Chen Rong¹,Li Hui¹,Zhang Tianlun¹,Liu Yaqing¹²

Affiliation:

1. College of Information Science and Technology, Dalian Maritime University, Dalian 116026, P. R. China

2. Key Laboratory of Symbolic Computation and Knowledge, Engineering of Ministry of Education, Jilin University, Changchun 130012, P. R. China

Abstract

Manually inspecting bugs to determine their severity is often an enormous but essential software development task, especially when many participants generate a large number of bug reports in a crowdsourced software testing context. Therefore, boosting the capabilities of methods of predicting bug report severity is critically important for determining the priority of fixing bugs. However, typical classification techniques may be adversely affected when the severity distribution of the bug reports is imbalanced, leading to performance degradation in a crowdsourcing environment. In this study, we propose an enhanced oversampling approach called CR-SMOTE to enhance the classification of bug reports with a realistically imbalanced severity distribution. The main idea is to interpolate new instances into the minority category that are near the center of existing samples in that category. Then, we use an extreme learning machine (ELM) — a feedforward neural network with a single layer of hidden nodes — to predict the bug severity. Several experiments were conducted on three datasets from real bug repositories, and the results statistically indicate that the presented approach is robust against real data imbalance when predicting the severity of bug reports. The average accuracies achieved by the ELM in predicting the severity of Eclipse, Mozilla, and GNOME bug reports were 0.780, 0.871, and 0.861, which are higher than those of classifiers by 4.36%, 6.73%, and 2.71%, respectively.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Networks and Communications,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218194019500074

Reference23 articles.

1. Learning from Imbalanced Data

2. Mining with rarity

3. SMOTE: Synthetic Minority Over-sampling Technique

4. An experimental comparison of classification algorithms for imbalanced credit scoring data sets

Cited by 117 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting Bug Severity Using Machine Learning and Ensemble Learning Techniques;2023 14th International Conference on Information and Communication Systems (ICICS);2023-11-21

2. A Deep Learning Approach for Varicocele Detection from Ultrasound Images;2023 14th International Conference on Information and Communication Systems (ICICS);2023-11-21

3. Automatically Tagging the “AAA” Pattern in Unit Test Cases Using Machine Learning Models;IEEE Transactions on Software Engineering;2023-05-01

4. Predicting product advertisement links using hybrid learning within social networks;The Journal of Supercomputing;2023-04-13

5. Prioritizing tasks in software development: A systematic literature review;PLOS ONE;2023-04-06