[1] WU L, HOI S C H, YU N. Semantics-preserving bag-of-words models and applications[J]. IEEE Transactions on Image Processing, 2010, 19(7): 1908-1920. [2] YANG J, JIANG Y G, HAUPTMANN A G, et al. Evaluating bag-of-visual-words representations in scene classification[C]//Proceedings of the 2007 International Workshop on Workshop on Multimedia Information Retrieval. New York: ACM, 2007: 197-206. [3] LOWE D G. Object recognition from local scale-invariant features[C]//Proceedings of the 1999 IEEE International Conference on Computer Vision. Piscataway, NJ: IEEE, 1999: 1150-1157. [4] BAY H, ESS A, TUYTELAARS T, et al. Speeded-Up Robust Features (SURF)[J]. Computer Vision and Image Understanding, 2008, 110(3): 346-359. [5] SCHMIDHUBER J. Deep learning in neural networks: an overview[J]. Neural Networks, 2015, 61: 85-117. [6] WAN J, WANG D, HOI S C H, et al. Deep learning for content-based image retrieval: a comprehensive study[C]//Proceedings of the 2014 ACM International Conference on Multimedia. New York: ACM, 2014: 157-166. [7] WU P, HOI S C H, XIA H, et al. Online multimodal deep similarity learning with application to image retrieval[C]//Proceedings of the 21st ACM International Conference on Multimedia. New York: ACM, 2013: 153-162. [8] KRIZHEYSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 26th Annual Conference on Neural Information Processing Systems. Lake Tahoe, Nevada: [s.n.], 2012: 1097-1105. [9] VEDALDI A, LENC K. MatConvNet — convolutional neural networks for MATLAB [EB/OL]. [2015-06-21]. http://arxiv.org/pdf/1412.4564.pdf. [10] CHATFIELD K, SIMONYAN K, VEDALDI A, et al. Return of the devil in the details: delving deep into convolutional nets [EB/OL]. [2014-11-05]. http://arxiv.org/pdf/1405.3531.pdf [11] BABENKO A, SLESAREV A, CHIGORIN A, et al. Neural codes for image retrieval[C]//ECCV 2014: Proceedings of the 13th European Conference on Computer Vision. Berlin: Springer, 2014: 584-599. [12] DONAHUE J, JIA Y, VINYALS O, et al. DeCAF: a deep convolutional activation feature for generic visual recognition [EB/OL]. [2015-05-10]. http://arxiv.org/abs/1310.1531v1. [13] ZHOU D, WESTON J, GRETTON A, et al. Ranking on data manifolds[C]//NIPS 2003: Proceedings of the 2003 Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2004, 16: 169-176. [14] XU B, BU J, CHEN C, et al. Efficient manifold ranking for image retrieval[C]//Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 2011: 525-534. [15] OLIVA A, TORRALBA A. Modeling the shape of the scene: a holistic representation of the spatial envelope[J]. International Journal of Computer Vision, 2001, 42(3): 145-175. [16] CHECHIK G, SHARMA V, SHALIT U, et al. Large scale online learning of image similarity through ranking[J]. Journal of Machine Learning Research, 2009, 11(2): 1109-1135. |