Received: 20-07-2020
Accepted: 02-09-2020
DOI:
Views
Downloads
How to Cite:
Content-based Image Retrieval with Convolutional Neural Networks and Binary Hashing Method
Keywords
Content-based image retrieval, CBIR, convolutional neural networks, CNN, binary hashing
Abstract
Content-based image retrieval has received great attention in recent years because this method overcomes the disadvantages of the text-based image retrieval that is not affected by the lack of or wrong of the text attached to the image. In addition, deep learning methods such as convolutional neural networks have demonstrated their ability to process large-sized data, especially computer vision and image processing. The aims of this study was develop a content-based image retrieval program and method to reduce image query time using the convolutional neural network (CNN). Also, we combined CNN with a binary hashing method to improve image retrieval time. The experimental results on CIFAR-10 and MNIST data sets showed that combining CNN with the binary hashing method for content-based image retrieval achieved an accuracy of approximately 89% on CIFAR-10,98% on MNIST and significantly improved retrieval time.
References
Babenko A., Slesarev A., Chigorin A. & Lempitsky V. (2014). Neural codes for image retrieval. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8689 LNCS(PART 1). pp.584-599.
Cheng Z., Yang Q. & Sheng B. (2015). Deep Colorization. Proceedings of the IEEE International Conference on Computer Vision. pp.415-423.
Ciressan D., Meier U.& Schmidhuber J. (2012). Multi-column Deep Neural Networks for Image Classification. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
Jia Y., Shelhamer E., Donahue J., Karayev S., Long J., Girshick R., Guadarame S. & Darrell T. (2014). Caffe: Convolutional Architecture for Fast Feature Embedding. UC Berkeley EECS, Berkeley, CA 94702.
Krizhevsky A., Nair V. & Hinton G. (2009). CIFAR-10 and CIFAR-100 dataset.Retrieved from https://www.cs.toronto.edu/~kriz/cifar.htmlon May 12, 2018.
Krizhevsky A., Sutskever I. & E. Hinton G. (2012). ImageNet Classification with Deep Convolutional Neural Networks. In Proc. NIPS.
Lecun Y., Cortes C. &Burges C. (1998). MNIST handwritten digit database. Retrieved from http://yann.lecun.com/exdb/mnist/on May 12, 2018.
Lecun Y., Bottou L., Bengio Y. & Haffner P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE.
Li H., Lin Z., Shen X., Brandt J. & Hua G. (2015). A Convolutional Neural Network Cascade for Face Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5325-5334.
Luo P., Tian, Y., Wang X. & Tang X. (2014). Switchable Deep Network for Pedestrian Detection Ping. Computer Vision Fundation.
Lê Minh Phúc & Trần Công Án (2017). Tìm kiếm ảnh theo nội dung vàngữ nghĩa. Tạp chí Khoahọc,Trường Đại học Cần Thơ. Số chuyên đề: Công nghệthông tin. tr.58-64.
Photoindustrie-Verband e.V. (2016). Photo and imaging market: Trend report. p.63.
Văn Thế Thành (2017). Tìm kiếm ảnh dựa trên đồ thị chữ ký nhị phân. Luận án Tiến sĩ khoa học máy tính. Đại học Huế.
Văn Thế Thành & Lê Mạnh Thạnh (2016). Một số cải tiến cho hệ truy vấn ảnh dựa trên cây S-Tree. Kỷ yếu hội nghị khoa học quốc gia lần thứIX - Nghiên cứu cơ bản vàứng dụng công nghệ thông tin (FAIR’9).