计算机应用 ›› 2011, Vol. 31 ›› Issue (04): 935-937.DOI: 10.3724/SP.J.1087.2011.00935

• 计算机软件技术 • 上一篇    下一篇

基于数据流图的异构VLIW DSP分簇方法

邱鹏飞1,2,洪一3,耿锐3,徐云1,2   

  1. 1. 安徽省高性能计算重点实验室,合肥 230027
    2. 中国科学技术大学 计算机科学与技术学院,合肥 230027
    3. 中国电子科技集团 第38研究所,合肥 230088
  • 收稿日期:2010-10-08 修回日期:2010-11-17 发布日期:2011-04-08 出版日期:2011-04-01
  • 通讯作者: 邱鹏飞
  • 作者简介:邱鹏飞(1987-),男,安徽庐江人,硕士研究生,主要研究方向:编译器技术、高性能计算;
    洪一(1943-),男,安徽铜陵人,教授,主要研究方向:雷达信号处理;
    耿锐(1975-),男,安徽淮南人,工程师,硕士,主要研究方向:计算机体系结构;
    徐云(1960-),男,安徽合肥人,副教授,博士,主要研究方向:网格计算、并行算法、生物信息学。

Operation partitioning for heterogeneous VLIW DSP based on dataflow graph

Peng-fei QIU1,2,Yi HONG3,Rui GENG3,Yun XU1,2   

  1. 1. Key Laboratory of High Performance Computing of Anhui Province, Hefei Anhui 230027, China
    2. School of Computer Science and Technology, University of Science and Technology of China, Hefei Anhui 230027,China
    3. The 38 Research Institute, China Electronics Technology Group Corporation, Hefei Anhui 230088,China
  • Received:2010-10-08 Revised:2010-11-17 Online:2011-04-08 Published:2011-04-01
  • Contact: Peng-fei QIU

摘要: 超长指令字数字信号处理器(VLIW DSP)的指令级并行性(ILP)主要通过指令分簇和软件流水来实现。在以前的研究中,指令分簇主要只考虑指令级并行性和减少簇间转移指令,对异构体系结构和某些寄存器只能分在指定簇上的情况考虑较少。提出一种基于数据流图(DFG)的异构体系结构上的分簇方法,利用指令的相关性将DFG划分为与簇数目相同个数的子图,再根据特殊寄存器对簇的要求采用启发式算法对子图进行调整,实验结果表明这种分簇方法使得负载更均衡,加速比相对于传统方法可以提高8%左右。

关键词: 超长指令字, 指令级并行性, 异构, 分簇, 启发式算法

Abstract: The Instruction Level Parallelism (ILP) of VLIW DSP processor is acquired through operation partitioning and software pipeline. In the previous research of operation partitioning, people always focus on reducing move operations between clusters, but rarely consider the effect of heterogeneous architecture and some registers that should be placed on reserved cluster. A method based on DataFlow Graph (DFG) for heterogeneous architecture was described to solve this problem. First, the DFG was partitioned into several sub-graphs according to the relations between operations, then the sub-graphs were refined with a heuristic method to meet the requirements of special registers. The experimental results show that this method can make the load of cluster more balanced, and achieve an average of 8% improvement over traditional method.

Key words: Very Long Instruction Word(VLIW), ILP (Instruction Level Parallelism), heterogeneous, operation partitioning, heuristic method

中图分类号: