计算机应用 ›› 2019, Vol. 39 ›› Issue (9): 2558-2567.DOI: 10.11772/j.issn.1001-9081.2019030450

• 人工智能 • 上一篇    下一篇

医学影像人工智能辅助诊断的样本增广方法

魏小娜1, 李英豪2, 王振宇1, 李皓尊2, 汪红志1,2   

  1. 1. 上海市磁共振重点实验室(华东师范大学), 上海 200062;
    2. 华东师范大学 物理与材料科学学院, 上海 200062
  • 收稿日期:2019-03-19 修回日期:2019-05-15 出版日期:2019-09-10 发布日期:2019-05-28
  • 通讯作者: 汪红志
  • 作者简介:魏小娜(1986-),女,安徽宿州人,硕士研究生,主要研究方向:深度学习、医学图像处理;李英豪(1999-),男,重庆人,主要研究方向:机器学习、图像重建;王振宇(1996-),男,江西抚州人,硕士研究生,主要研究方向:深度学习、医学图像处理;李皓尊(1996-),男,上海人,主要研究方向:机器学习、图像重建;汪红志(1975-),男,湖北黄冈人,副教授,博士,主要研究方向:磁共振(成像)技术、医学影像分析。
  • 基金资助:

    上海浦江人才计划项目(17PJ1432500)。

Methods of training data augmentation for medical image artificial intelligence aided diagnosis

WEI Xiaona<sup>1</sup>, LI Yinghao<sup>2</sup>, WANG Zhenyu<sup>1</sup>, LI Haozun<sup>2</sup>, WANG Hongzhi<sup>1,2</sup>   

  1. 1. Shanghai Key Laboratory of Magnetic Resonance(East China Normal University), Shanghai 200062, China;
    2. School of Physics and Material Science, East China Normal University, Shanghai 200062, China
  • Received:2019-03-19 Revised:2019-05-15 Online:2019-09-10 Published:2019-05-28
  • Supported by:

    This work is partially supported by the Shanghai Pujiang Talent Plan (17PJ1432500).

摘要:

针对不同领域人工智能(AI)应用研究所面临的采用常规手段获取大量样本时耗时耗力耗财的问题,许多AI研究领域提出了各种各样的样本增广方法。首先,对样本增广的研究背景与意义进行介绍;其次,归纳了几种公知领域(包括自然图像识别、字符识别、语义分析)的样本增广方法,并在此基础上详细论述了医学影像辅助诊断方面的样本获取或增广方法,包括X光片、计算机断层成像(CT)图像、磁共振成像(MRI)图像的样本增广方法;最后,对AI应用领域数据增广方法存在的关键问题进行总结,并对未来的发展趋势进行展望。经归纳总结可知,获取足够数量且具有广泛代表性的训练样本是所有领域AI研发的关键环节。无论是公知领域还是专业领域都进行样本增广,且不同领域甚至同一领域的不同研究方向,其样本获取或增广方法均不相同。此外,样本增广并不是简单地增加样本数量,而是尽可能再现小样本量无法完全覆盖的真实样本存在,进而提高样本多样性,增强AI系统性能。

关键词: 人工智能, 医学影像, 辅助诊断, 样本增广

Abstract:

For the problem of time, effort and money consuming to obtain a large number of samples by conventional means faced by Artificial Intelligence (AI) application research in different fields, a variety of sample augmentation methods have been proposed in many AI research fields. Firstly, the research background and significance of data augmentation were introduced. Then, the methods of data augmentation in several common fields (including natural image recognition, character recognition and discourse parsing) were summarized, and on this basis, a detailed overview of sample acquisition or augmentation methods in the field of medical image assisted diagnosis was provided, including X-ray, Computed Tomography (CT), Magnetic Resonance Imaging (MRI) images. Finally, the key issues of data augmentation methods in AI application fields were summarized and the future development trends were prospected. It can be concluded that obtaining a sufficient number of broadly representative training samples is the key to the research and development of all AI fields. Both the common fields and the professional fields have conducted sample augmentation, and different fields or even different research directions in the same field have different sample acquisition or augmentation methods. In addition, sample augmentation is not simply to increase the number of samples, but to reproduce the existence of real samples that cannot be completely covered by small sample size as far as possible, so as to improve sample diversity and enhance AI system performance.

Key words: Artificial Intelligence (AI), medical image, aided diagnosis, sample augmentation

中图分类号: