Interpreting Deep Learning-based Vulnerability Detector Predictions Based on Heuristic Searching-Reference-Cited by-同舟云学术

Interpreting Deep Learning-based Vulnerability Detector Predictions Based on Heuristic Searching

Published:2021-03 Issue:2 Volume:30 Page:1-31
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Zou Deqing¹,Zhu Yawei¹,Xu Shouhuai²,Li Zhen³,Jin Hai¹,Ye Hengkai¹

Affiliation:

1. Huazhong University of Science and Technology, P.R. China

2. University of Texas at San Antonio, USA

3. Hebei University, P.R. China

Abstract

Detecting software vulnerabilities is an important problem and a recent development in tackling the problem is the use of deep learning models to detect software vulnerabilities. While effective, it is hard to explain why a deep learning model predicts a piece of code as vulnerable or not because of the black-box nature of deep learning models. Indeed, the interpretability of deep learning models is a daunting open problem. In this article, we make a significant step toward tackling the interpretability of deep learning model in vulnerability detection. Specifically, we introduce a high-fidelity explanation framework, which aims to identify a small number of tokens that make significant contributions to a detector’s prediction with respect to an example. Systematic experiments show that the framework indeed has a higher fidelity than existing methods, especially when features are not independent of each other (which often occurs in the real world). In particular, the framework can produce some vulnerability rules that can be understood by domain experts for accepting a detector’s outputs (i.e., true positives) or rejecting a detector’s outputs (i.e., false-positives and false-negatives). We also discuss limitations of the present study, which indicate interesting open problems for future research.

Funder

Natural Science Foundation of Hebei Province

Shenzhen Fundamental Research Program

National Natural Science Foundation of China

National Key Research and Development Plan of China

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3429444

Reference54 articles.

1. Checkmarx. 2020. Checkmarx—Application Security Testing and Static Code Analysis. Checkmarx Israel. Retrieved from https://www.checkmarx.com/. Checkmarx. 2020. Checkmarx—Application Security Testing and Static Code Analysis. Checkmarx Israel. Retrieved from https://www.checkmarx.com/.

2. Reza Abbasi-Asl and Bin Yu. 2017. Interpreting convolutional neural networks through compression. CoRR abs/1711.02329. Reza Abbasi-Asl and Bin Yu. 2017. Interpreting convolutional neural networks through compression. CoRR abs/1711.02329.

3. American Information Technology Laboratory 2020. National Vulnerability Database. American Information Technology Laboratory. Retrieved from https://nvd.nist.gov/. American Information Technology Laboratory 2020. National Vulnerability Database. American Information Technology Laboratory. Retrieved from https://nvd.nist.gov/.

4. American Information Technology Laboratory 2020. Software Assurance Reference Dataset. American Information Technology Laboratory. Retrieved from https://samate.nist.gov/SRD/. American Information Technology Laboratory 2020. Software Assurance Reference Dataset. American Information Technology Laboratory. Retrieved from https://samate.nist.gov/SRD/.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Interpretable Vulnerability Detection Framework Based on Multi-task Learning;Communications in Computer and Information Science;2023-11-30

2. Broken Promises: Measuring Confounding Effects in Learning-based Vulnerability Discovery;Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security;2023-11-26

3. Good-looking but Lacking Faithfulness: Understanding Local Explanation Methods through Trend-based Testing;Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security;2023-11-15

4. An Empirical Study on Model-Agnostic Techniques for Source Code-Based Defect Prediction;International Journal of Software Engineering and Knowledge Engineering;2023-11-04

5. SlicedLocator: Code vulnerability locator based on sliced dependence graph;Computers & Security;2023-11