计算机应用 ›› 2014, Vol. 34 ›› Issue (5): 1467-1472.DOI: 10.11772/j.issn.1001-9081.2014.05.1467

• 虚拟现实与数字媒体 • 上一篇    下一篇

基于可变码长的音视频同步编码改进算法

曾碧,林健浩,肖红,何元烈   

  1. 广东工业大学 计算机学院,广州 510006
  • 收稿日期:2013-11-06 修回日期:2013-12-25 出版日期:2014-05-01 发布日期:2014-05-30
  • 通讯作者: 林健浩
  • 作者简介:曾碧(1963-),女,广东梅州人,教授,博士研究生,CCF高级会员,主要研究方向:嵌入式系统与智能技术、信息物理融合系统;林健浩(1988-),男,广东潮州人,硕士研究生,主要研究方向:嵌入式系统与智能技术;肖红(1972-),女,湖北咸宁人,博士研究生,主要研究方向:移动计算、物联网;何元烈(1976-),男,广东江门人,博士研究生,主要研究方向:计算机视觉、机器人、嵌入式系统。
  • 基金资助:

    广州市科技计划项目

Improved algorithm of audio-video synchronization coding based on variable code length

ZENG Bi,LIN Jianhao,XIAO Hong,HE Yuanlie   

  1. Faculty of Computer, Guangdong University of Technology, Guangzhou Guangdong 510006, China
  • Received:2013-11-06 Revised:2013-12-25 Online:2014-05-01 Published:2014-05-30
  • Contact: LIN Jianhao

摘要:

针对音视频同步的问题提出一种基于H.264帧间预测的音视频同步编码的改进算法。该算法引入可变码长的概念,将音频编码数据分为若干码组,每个码组为2或3比特的待嵌入数据。在H.264的帧间预测环节,可变尺寸块与码组之间根据公式确定映射关系。根据待嵌入数据来动态决定宏块分割模式的编码方式,以及根据映射关系提取数据的解码方法,使用4×4宏块分割模式表示嵌入数据的结束。实验结果表明,该算法使视频采集样本的峰值信噪比(PSNR)下降了0.031dB,码率变化率为5.16%,产生1.97%的嵌入开销,但是所嵌入的音频编码数据可以正确完整地提取。因此该算法能够在增加数据嵌入容量、保持视频质量、保证数据正确性和完整性的基础上实现音视频同步编码。

Abstract:

To solve the synchronization problem of audio and video, an improved algorithm of audio-video synchronization coding based on H.264 inter-frame prediction was proposed. The algorithm introduced the concept of variable code length. The audio encoding data was divided into several code groups, and each code group had 2 or 3 bits of embedded data. In the stage of H.264 inter-frame prediction, the mappings between various variable size blocks and the data of code groups were based on formula. The coding method was dynamically determined for the macro block modes coding according to embedded data, and a proposed decoding method could extract the corresponding data according to the mapping relationship. Finally, the 4×4 macro block mode was used to indicate the end of the audio data.The experimental results show that the proposed algorithm enables the Peak Signal-to-Noise Ratio (PSNR) of video samples to reduce by 0.031dB, the bit rate to increase by 5.16% and the overhead to increase by 1.97%, but the embedded audio data can be correctly and completely extracted. Therefore,the algorithm can implement the synchronization of audio and video coding while increasing the data embedding capacity, maintaining the quality of video, ensuring the correctness and completeness of the data.

中图分类号: