近些年来,图像分类在模式识别领域占据着重要的地位。迅速增长的图像数据对于图像信息的分析与处理提出了新的要求。卷积神经网络应运而生,以其强大的图像识别分类能力被广泛的应用于各种图像分类系统,并取得了十分显著的效果。本文首先,回顾了图像分类技术的发展历程,介绍了图样空间与特征空间的分类方法以及图像特征提取方法。其次,介绍了卷积神经网络在图像分类问题上的研究情况。最后,总结卷积神经网络在图像分类问题上存在的问题,以及未来的发展方向。 In recent years, image classification occupies an important position in pattern recognition. The rapid growth of image data for image information analysis and processing put forward new re-quirements. Convolution neural network came into being, with its powerful image recognition classification ability is widely used in a variety of image classification system, and achieved very significant results. Firstly, this paper reviews the development of image classification technology, and introduces the classification methods of feature space and feature space and the image fea-ture extraction methods. Secondly, the research of convolution neural network on image classi-fication is introduced. Finally, the problems existing in the image classification of convolution neural networks are summarized, and the future development direction is summarized.
杨泽明,刘军,薛程,于子红. 卷积神经网络在图像分类上的应用综述Summary of Application of Convolution Neural Network on Image Classification[J]. 人工智能与机器人研究, 2018, 07(01): 17-24. http://dx.doi.org/10.12677/AIRR.2018.71002
参考文献 (References)References许可. 卷积神经网络在图像识别上的应用的研究[D]: [硕士学位论文]. 杭州: 浙江大学, 2012.席晓聪. 图像分类方法研究[D]: [硕士学位论文]. 济南: 山东大学, 2013.Swain, M.J. and Ballard, D.H. (1991) Color Indexing. International Journal of Computer Vision, 7, 11-32.
<br>https://doi.org/10.1007/BF00130487Haralick, R.M. (1973) Texture Features for Image Classification. IEEE Trans-actions on Systems Man & Cybernetics, 3, 610-621.章志勇, 潘志庚, 张明敏, 等. 基于多尺度通用傅里叶描述子的灰度图像检索[J]. 中国图象图形学报, 2005, 10(5): 611-615.刘三军. 基于图像特征空间学习的图像分类方法研究[D]: [硕士学位论文]. 西安: 西安电子科技大学, 2014.Kirby, M. and Sirovich, L. (1990) Application of the Karhunen-Loeve Pro-cedure for the Characterization of Human Faces. IEEE Computer Society.Swets, D.L. and Weng, J. (1996) Using Discri-minant Eigenfeatures for Image Retrieval. IEEE Computer Society.Friedman, J.H. and Tukey, J.W. (2006) A Projection Pursuit Algorithm for Exploratory Data Analysis. IEEE Transactions on Computers, C-23, 881-890. <br>https://doi.org/10.1109/T-C.1974.224051Chiang, S.S., Chang, C.I. and Ginsberg, I.W. (2001) Unsupervised Target Detection in Hyperspectral Images Using Projection Pursuit. IEEE Transactions on Geoscience & Remote Sensing, 39, 1380-1391.
<br>https://doi.org/10.1109/36.934071Burges, C.J.C. (1998) A Tutorial on Support Vector Machines for Pattern Recog-nition. Kluwer Academic Publishers, Dordrecht.Zhang, D., Chen, S. and Zhou, Z.H. (2006) Recognizing Face or Object from a Single Image: Linear vs. Kernel Methods on 2D Patterns. Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshops, SSPR 2006 and SPR 2006, Hong Kong, 17-19 August 2006, 889-897.Mika, S., Smola, A. and Scholz, M. (1999) Kernel PCA and De-Noising in Feature Spaces. In: Conference on Advances in Neural Information Processing Systems II, MIT Press, Cambridge, 536-542.LeCun, Y.L., Bottou, L., Bengio, Y., et al. (1998) Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE, 86, 2278-2324. <br>https://doi.org/10.1109/5.726791Hubel, D.H. and Wiesel, T.N. (1962) Receptive Fields, Binocular Interaction and Functional Architecture in the Cat’s Visual Cortex. Journal of Physiology, 160, 106-154. <br> https://doi.org/10.1113/jphysiol.1962.sp006837Fukushima, K. (1980) Neo-cognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position. Biological Cybernetics, 36, 193-202. <br>https://doi.org/10.1007/BF00344251Hinton, G.E. and Salakhutdinov, R.R. (2006) Reducing the Dimensionality of Data with Neural Networks. Science, 313, 504-507. <br>https://doi.org/10.1126/science.1127647Deng, J., Dong, W., Socher, R., et al. (2009) ImageNet: A Large-Scale Hier-archical Image Database. Computer Vision and Pattern Recognition, Miami, 20-25 June 2009, 248-255.Ahonen, T., Hadid, A. and Pietikainen, M. (2006) Face Description with Local Binary Patterns: Application to Face Recognition. IEEE Transactions on Pattern Analysis & Machine Intelligence, 28, 2037-2041.
<br>https://doi.org/10.1109/TPAMI.2006.244Russakovsky, O., Deng, J., Su, H., et al. (2015) ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision, 115, 211-252. <br>https://doi.org/10.1007/s11263-015-0816-yKrizhevsky, A., Sutskever, I. and Hinton, G.E. (2017) ImageNet Classi-fication with Deep Convolutional Neural Networks. Communications of the ACM, 60, 84-90.Zeiler, M.D. and Fergus, R. (2013) Visualizing and Understanding Convolutional Networks. Computer Vision, Zurich, 6-12 September 2014, 818-833.Szegedy, C., Liu, W., Jia, Y., et al. (2015) Going Deeper with Convolutions. Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 1-9.Lin, M., Chen, Q. and Yan, S. (2013) Network in Network. International Con-ference on Learning Representations, Scottsdale, 2-4 May 2013, 10.He, K., Zhang, X., Ren, S., et al. (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis & Machine Intelligence, 37, 1904-1916.
<br>https://doi.org/10.1109/TPAMI.2015.2389824He, K., Zhang, X., Ren, S., et al. (2015) Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. IEEE International Conference on Computer Vision, Santiago, 7-13 December 2015, 1026-1034.Ioffe, S. and Szegedy, C. (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. International Conference on Machine Learning, Lille, 6-11 July 2015, 448-456.Jaderberg, M., Simonyan, K., Zisserman, A., et al. (2015) Spatial Transformer Networks. Neural Information Processing Systems, Montreal, 7-12 December 2015, 2017-2025.He, K., Zhang, X., Ren, S., et al. (2015) Deep Residual Learning for Image Recognition. Computer Vision and Pattern Recognition, Las Vegas, 26 June-1 July 2016, 770-778.Nguyen, A., Yosinski, J. and Clune, J. (2014) Deep Neural Networks Are Easily Fooled: High Confidence Pre-dictions for Unrecognizable Images. Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 427-436.