Affiliation:
1. Yangzhou University, Hefei University of Technology, China
2. Hefei University of Technology, China
3. Mininglamp Academy of Sciences, Minininglamp and Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei, China
Abstract
Deep learning seeks to achieve excellent performance for representation learning in image datasets. However, supervised deep learning models such as convolutional neural networks require a large number of labeled image data, which is intractable in applications, while unsupervised deep learning models like stacked denoising auto-encoder cannot employ label information. Meanwhile, the redundancy of image data incurs performance degradation on representation learning for aforementioned models. To address these problems, we propose a semi-supervised deep learning framework called stacked convolutional sparse auto-encoder, which can learn robust and sparse representations from image data with fewer labeled data records. More specifically, the framework is constructed by stacking layers. In each layer, higher layer feature representations are generated by features of lower layers in a convolutional way with kernels learned by a sparse auto-encoder. Meanwhile, to solve the data redundance problem, the algorithm of Reconstruction Independent Component Analysis is designed to train on patches for sphering the input data. The label information is encoded using a Softmax Regression model for semi-supervised learning. With this framework, higher level representations are learned by layers mapping from image data. It can boost the performance of the base subsequent classifiers such as support vector machines. Extensive experiments demonstrate the superior classification performance of our framework compared to several state-of-the-art representation learning methods.
Funder
Natural Science Foundation of China
National Key Research and Development Program of China
Ministry of Education
Publisher
Association for Computing Machinery (ACM)
Reference56 articles.
1. Stimulus Specific Responses from Beyond the Classical Receptive Field: Neurophysiological Mechanisms for Local-Global Comparisons in Visual Neurons
2. Learning Deep Architectures for AI
3. Yoshua Bengio Yann LeCun and Donnie Henderson. 1993. Globally trained handwritten word recognizer using spatial representation convolutional neural networks and hidden Markov models. In Advances in Neural Information Processing Systems. 937--944. Yoshua Bengio Yann LeCun and Donnie Henderson. 1993. Globally trained handwritten word recognizer using spatial representation convolutional neural networks and hidden Markov models. In Advances in Neural Information Processing Systems. 937--944.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献