Inverse Abstraction of Neural Networks Using Symbolic Interpolation-Reference-Cited by-同舟云学术

Inverse Abstraction of Neural Networks Using Symbolic Interpolation

Published:2019-07-17 Issue: Volume:33 Page:3437-3444
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Dathathri Sumanth,Gao Sicun,Murray Richard M.

Abstract

Neural networks in real-world applications have to satisfy critical properties such as safety and reliability. The analysis of such properties typically requires extracting information through computing pre-images of the network transformations, but it is well-known that explicit computation of pre-images is intractable. We introduce new methods for computing compact symbolic abstractions of pre-images by computing their overapproximations and underapproximations through all layers. The abstraction of pre-images enables formal analysis and knowledge extraction without affecting standard learning algorithms. We use inverse abstractions to automatically extract simple control laws and compact representations for pre-images corresponding to unsafe outputs. We illustrate that the extracted abstractions are interpretable and can be used for analyzing complex properties.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Certified Quantization Strategy Synthesis for Neural Networks;Lecture Notes in Computer Science;2024-09-11

2. Provable Preimage Under-Approximation for Neural Networks;Lecture Notes in Computer Science;2024