计算机应用 ›› 2011, Vol. 31 ›› Issue (01): 11-15.

• 第八届中国计算机图形学大会优秀论文 • 上一篇    下一篇

并行绘制系统中基于网络处理单元的图像合成及显示

李瑞瑞1,张一天2,秦桂明2,秦开怀1   

  1. 1. 清华大学
    2.
  • 收稿日期:2010-07-12 修回日期:2010-08-26 发布日期:2011-01-12 出版日期:2011-01-01
  • 通讯作者: 李瑞瑞
  • 基金资助:
    国家自然科学基金项目;国家高技术研究发展计划;国家高技术研究发展计划

NPU-based image composition and display in parallel rendering system

  • Received:2010-07-12 Revised:2010-08-26 Online:2011-01-12 Published:2011-01-01
  • Contact: Rui-Rui LI

摘要: 随着绘制任务复杂度和绘制数据规模不断增长,使用PC集群进行分布式并行绘制是一个常用的解决方案。Sort-last分布式并行绘制方法具有好的扩展性和负载平衡,但由于图像合成瓶颈的限制,绘制速度不能满足实时需求。本文提出了一个使用网络处理单元(NPU)来进行快速硬件图像合成的方法,开发了一个sort-last并行绘制系统NPUPR。实验表明,针对4个绘制节点,基于NPU的硬件图像合成方法与direct send的合成算法相比,绘制速度有了4倍的提高。本文也给出通过增加网络处理单元来扩展系统支持更多绘制节点的方案,分析表明,系统图像合成性能不会随节点个数的增加而明显降低。

关键词: 并行绘制, 图像合成, 分布式集群, 硬件加速, 网络处理单元

Abstract: In real-time rendering of massive data sets, it is a popular common solution to use PC clusters for the real-time parallel rendering. The image composition of all the results parallelly rendered by the cluster computers is a notorious bottleneck in sort-last clustered rendering systems. This paper presents a network processing unit (NPU for short)-based image composition method and a sort-last distributed rendering system, called NPUPR. The experimental results show that the NPU-based scheme can get three times faster frame rate than the ‘direct send scheme’ in the case of four rendering nodes. This paper also presents the scheme to extend four rendering nodes to more rendering nodes. Analytically, the system is fully scalable with negligible penalty in frame rate.

Key words: parallel rendering, image compositing, PC clusters, hardware-assisted, NPU