计算机应用 ›› 2011, Vol. 31 ›› Issue (11): 2997-3000.DOI: 10.3724/SP.J.1087.2011.02997

• 图形图像技术 • 上一篇    下一篇

基于多尺寸特征融合的快速转码算法

吕卓逸1,2,贾克斌1,萧允治2   

  1. 1. 北京工业大学 电子信息与控制工程学院,北京 100124
    2. 香港理工大学 电子及资讯工程学系, 香港
  • 收稿日期:2011-05-24 修回日期:2011-07-01 发布日期:2011-11-16 出版日期:2011-11-01
  • 通讯作者: 吕卓逸
  • 作者简介:吕卓逸(1983-),女,北京人,博士研究生,主要研究方向:视频编码、视频转码;
    贾克斌(1962-),男,河南安阳人,教授,博士生导师,主要研究方向:多媒体信号处理;
    萧允治(1954-),男,香港人,教授,博士生导师,主要研究方向:图像编码、视频编码、模式识别。
  • 基金资助:
    国家自然科学基金资助项目;国家博士点基金资助项目

Fast video transcoding algorithm based on hybrid characteristic of multi-scale videos

LU Zhuo-yi1,2,JIA Ke-bin2,XIAO Yun-zhi1   

  1. 1. Department of Electronic and Information Engineering, Hong Kong Polytechnic University, Hong Kong
    2. School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China
  • Received:2011-05-24 Revised:2011-07-01 Online:2011-11-16 Published:2011-11-01
  • Contact: LU Zhuo-yi

摘要: 为了降低H.264转码器的运算复杂度,满足视频转码实时应用的要求,提出一种结合多尺寸视频特征的快速视频转码帧内预测算法。首先利用二维直方图提取小尺寸视频中宏块的空间特性,结合双阈值的方法,选择Intra 16×16或Intra 4×4模式。然后从输入的视频码流中提取编码信息组成多维特征向量训练支持向量机(SVM)分类器模型,通过SVM建立大尺寸视频编码信息与小尺寸视频宏块编码模式之间的联系,进而对Intra 4×4中的9种模式进行细分。此算法减少了预测模式数量,实现率失真优化算法的提前终止,在高效转码的同时保证了转码后视频的高质量。

关键词: 降尺寸视频转码, 帧内预测, 二维直方图, 支持向量机, 率失真优化

Abstract: A fast intra mode decision scheme for down-sizing video transcoding in H.264 based on hybrid characteristic of multi-scale videos was presented. In order to reduce the high computational complexity of using conventional intra prediction in the H.264 re-encoder, the proposed scheme firstly utilized 2D-histogram to extract the spatial characteristic of macro-blocks in the low-resolution video to choose from intra 16×16 and intra 4×4. Then the Support Vector Machine (SVM) was used to exploit the correlation between coding information extracted from the input high-resolution bit-stream and the coding modes of macro-blocks in down-sized video frames. With the SVM classifier, the improbable modes in the nine intra 4×4 modes were eliminated and only a small number of candidate modes were carried out using the RDO operations. Hence, remarkable computation time can be saved, while maintaining nearly the same quality of the transcoded pictures.

Key words: down-sizing video transcoding, intra prediction, 2D-histogram, Support Vector Machine (SVM), Rate Distortion Optimization (RDO)

中图分类号: