[1] WOLF W H. Key frame selection by motion analysis[C]//Proceedings of the 1996 IEEE Conference on Acoustics, Speech, and Signal Processing. Washington, DC:IEEE Computer Society, 1996:1228-1231. [2] ZHANG H, WU J, ZHONG D, et al. An integrated system for content-based video retrieval and browsing[J]. Pattern Recognition, 1997, 30(4):643-658. [3] LU Z, GRAUMAN K. Story-driven summarization for egocentric video[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2013:2714-2721. [4] YAO T, MEI T, NGO C, et al. Annotation for free:video tagging by mining user search behavior[C]//Proceedings of the 21st ACM International Conference on Multimedia. New York:ACM, 2013:977-986. [5] EL SAYAD I, MARTINET J, URRUTY T, et al.A semantically significant visual representation for social image retrieval[C]//Proceedings of the 2011 IEEE International Conference on Multimedia and Expo. Washington, DC:IEEE Computer Society, 2011:1-6. [6] 王晗,吴心筱,贾云得. 使用异构互联网图像组的视频标注[J]. 计算机学报,2013,36(10):2062-2069.(WANG H, WU X X, JIA Y D. Video annotation by using heterogeneous multiple image groups on the Web[J].Chinese Journal of Computers, 2013,36(10):2062-2069.) [7] 王晗. 基于迁移学习的视频标注方法[D]. 北京:北京理工大学, 2014.(WANG H. Video annotation based on transfer learning[D]. Beijing:Beijing Institute of Technology, 2014.) [8] WANG H, WU X. Finding event videos via image search engine[C]//Proceedings of the 2015 IEEE International Conference on Data Mining Workshop. Washington, DC:IEEE Computer Society, 2015:1221-1228. [9] WANG H, WU X, JIA Y. Video Annotation via image groups from the Web[J]. IEEE Transactions on Multimedia, 2014, 16(5):1282-1291. [10] WANG H, SONG H, WU X, et al. Video annotation by incremental learning from grouped heterogeneous sources[C]//Proceedings of the 12th Asian Conference on Computer Vision. Berlin:Springer, 2014:493-507. [11] 余春艳,翁子林.音频情感感知与视频精彩片段提取[J].计算机辅助设计与图形学学报, 2015, 27(10):1890-1899.(YU C Y, WENG Z L. Audio emotion perception and video highlight extraction[J].Journal of Computer Aided Design and Computer Graphics,2015,27(10):1890-1899.) [12] ZHANG K, CHAO W, SHA F, et al. Summary transfer:exemplar-based subset selection for video summarization[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2016:1059-1067. [13] YAO T, MEI T, RUI Y. Highlight detection with pairwise deep ranking for first-person video summarization[C]//Proceedings 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2016:982-990. [14] LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110. [15] HOIEM D,EFROS A, HEBERT M. Recovering surface layout from an image[J]. International Journal of Computer Vision, 2007,75(1):151-172. [16] OLIVA A, TORRALBA A. Modeling the shape of the scene:a holistic representation of the spatial envelope[J]. International Journal of Computer Vision, 2001, 42(3):145-175. [17] SWAIN M J, BALLARD D H. Indexing via color histograms[C]//Proceedings of the 3rd International Conference on Computer Vision. Piscataway, NJ:IEEE, 1990:390-393. [18] MEI T, TANG L, TANG J, et al. Near-lossless semantic video summarization and its applications to video analysis[J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2013, 9(3):Article No. 16. [19] PLATT J C, CRISTIANINI N, SHAWE-TAYLOR J. Large margin DAGs for multiclass classification[J]. Advances in Neural Information Processing Systems, 2000, 12(3):547-553. [20] FERNANDO B, HABRARD A, SEBBAN M, et al. Unsupervised visual domain adaptation using subspace alignment[C]//Proceedings of the 2013 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2013:2960-2967. [21] GRAUMAN K. Geodesic flow kernel for unsupervised domain adaptation[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2012:2066-2073. [22] MENG J, WANG H, YUAN J, et al. From keyframes to key objects:video summarization by representative object proposal selection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2016:1039-1048. |