Journal of Computer Applications ›› 2021, Vol. 41 ›› Issue (12): 3608-3613.DOI: 10.11772/j.issn.1001-9081.2021060886

• The 18th China Conference on Machine Learning • Previous Articles    

Prediction method of liver transplantation complications based on transfer component analysis and support vector machine

Hongliang CAO1,2, Ying ZHANG1,2(), Bin WU1, Fanyu LI1, Xubo NA1   

  1. 1.School of Control and Computer Engineering,North China Electric Power University,Beijing 102206,China
    2.Beijing Key Laboratory of Traf?c Data Analysis and Mining (Beijing Jiaotong University),Beijing 100044,China
  • Received:2021-05-12 Revised:2021-06-14 Accepted:2021-06-23 Online:2021-12-28 Published:2021-12-10
  • Contact: Ying ZHANG
  • About author:CAO Hongliang, born in 2000. His research interests include machine learning, artificial intelligence.
    WU Bin, born in 1996, M. S. candidate. His research interests include deep learning, spatio-temporal big data.
    LI Fanyu, born in 1998, M. S. candidate. Her research interests include reinforcement learning, path planning.
    NA Xubo, born in 1994, M. S. candidate. His research interests include urban computing, spatio-temporal big data.
  • Supported by:
    the Surface Program of National Natural Science Foundation of China(52078212)

基于迁移成分分析和支持向量机的肝移植并发症预测方法

曹鸿亮1,2, 张莹1,2(), 武斌1, 李繁菀1, 那绪博1   

  1. 1.华北电力大学 控制与计算机工程学院,北京 102206
    2.交通数据分析与挖掘北京市重点实验室(北京交通大学),北京 100044
  • 通讯作者: 张莹
  • 作者简介:曹鸿亮(2000—),男,江苏淮安人,主要研究方向:机器学习、人工智能
    武斌(1996—),男,河北邯郸人,硕士研究生,主要研究方向:深度学习、时空大数据
    李繁菀(1998—),女,湖南株洲人,硕士研究生,主要研究方向:强化学习、路径规划
    那绪博(1994—),男,辽宁沈阳人,硕士研究生,主要研究方向:城市计算、时空大数据。
  • 基金资助:
    国家自然基金面上项目(52078212)

Abstract:

Many machine learning algorithms can cope well with prediction and classification, but these methods suffer from poor prediction accuracy and F1 score when they are used on medical datasets with small samples and large feature spaces. To improve the accuracy and F1 score of liver transplantation complication prediction, a prediction and classification method of liver transplantation complications based on Transfer Component Analysis (TCA) and Support Vector Machine (SVM) was proposed. In this method, TCA was used for mapping and dimension reduction of the feature space, and the source domain and the target domain were mapped to the same reproducing kernel Hilbert space, thereby achieving the adaptivity of edge distribution. The SVM was trained in the source domain after transferring, and the complications were predicted in the target domain after training. In the liver transplantation complication prediction experiments for complication Ⅰ, complication Ⅱ, complication Ⅲa, complication Ⅲb, and complication Ⅳ, compared with the traditional machine learning and Heterogeneous Domain Adaptation (HDA), the accuracy of the proposed method was improved by 7.8% to 42.8%, and the F1 score reached 85.0% to 99.0%, while the traditional machine learning and HDA had high accuracy but low recall due to the imbalance of positive and negative samples. Experimental results show that TCA combined with SVM can effectively improve the accuracy and F1 score of liver transplantation complication prediction.

Key words: transfer learning, Transfer Component Analysis (TCA), Support Vector Machine (SVM), liver transplantation, prediction of complications

摘要:

已有很多机器学习算法能够很好地应对预测分类问题,但这些方法在用于小样本、大特征空间的医疗数据集时存在着预测准确率和F1值不高的问题。为改善肝移植并发症预测的准确率和F1值,提出一种基于迁移成分分析(TCA)和支持向量机(SVM)的肝移植并发症预测分类方法。该方法采用TCA进行特征空间的映射和降维,将源领域和目标领域映射到同一再生核希尔伯特空间,从而实现边缘分布自适应;迁移完成之后在源领域上训练SVM,训练完成后在目标领域上实现并发症的预测分析。在肝移植并发症预测实验中,针对并发症Ⅰ、并发症Ⅱ、并发症Ⅲa、并发症Ⅲb、并发症Ⅳ进行预测,与传统机器学习和渐进式对齐异构域适应(HDA)相比,所提方法的准确率提升了7.8%~42.8%,F1值达到85.0%~99.0%,而传统机器学习和HDA由于正负样本不均衡出现了精确率很高而召回率很低的情况。实验结果表明TCA结合SVM能够有效提高肝移植并发症预测的准确率和F1值。

关键词: 迁移学习, 迁移成分分析, 支持向量机, 肝移植, 并发症预测

CLC Number: