基于自适应阈值学习的时序因果推断方法

doi:10.11772/j.issn.1001-9081.2023091278

《计算机应用》唯一官方网站

• • 下一篇

基于自适应阈值学习的时序因果推断方法

赵秦壮,谭红叶

山西大学计算机与信息技术学院

收稿日期:2023-09-18 修回日期:2024-03-13 发布日期:2024-04-16 出版日期:2024-04-16
通讯作者: 赵秦壮
作者简介:赵秦壮(1998—)，男，山西运城人，博士研究生，CCF会员，主要研究方向：因果推断；谭红叶(1971—)，女，广西灵山人，教授，博士，CCF会员，主要研究方向：自然语言处理。
基金资助:
国家自然科学基金资助项目（62076155）

Time series causal inference method based on adaptive threshold learning

ZHAO Qinzhuang, TAN Hongye

School of Computer and Information Technology, Shanxi University

Received:2023-09-18 Revised:2024-03-13 Online:2024-04-16 Published:2024-04-16
About author:ZHAO Qinzhuang, born in 1998, Ph.D. candidate. His research interests include causal inference. TAN Hongye, born in 1971, Ph.D., professor. Her research interests include natural language processing.
Supported by:
National Natural Science Foundation of China (62076155)

摘要/Abstract

摘要： 时序数据存在近因性特点，即变量值普遍依赖近期的历史信息，而现有方法没有充分考虑时序数据的这种特性，在通过假设检验推断不同延迟的因果关系时使用统一的阈值，难以有效推断较弱的因果关系。针对上述问题，提出了基于自适应阈值学习的时序因果推断方法：首先提取数据特性，然后根据不同延迟下数据呈现的性质，自动地学习假设检验过程中使用的阈值组合，最后将该阈值组合用于PC（Peter-Clark）算法、PCMCI（Peter-Clark and Momentary Conditional Independence）算法和VAR-LINGAM（Vector Autoregressive Linear non-Gaussian Acyclic Model）算法的假设检验过程，以得到更为准确的因果关系结构。在仿真数据集上进行了实验验证：在数据集a上，采用所提方法的自适应PC算法、自适应PCMCI算法、自适应VAR-LINGAM算法的F1值分别提高了约、1、0.03个百分点；在数据集b上分别提高了约0.53、1.16、2.36个百分点；在数据集c上分别提高了约0.22、3.56、0.98个百分点。

关键词: 因果推断, 时间序列, 假设检验, 参数优化, 自适应

Abstract: The recency characteristic was exhibited by time-series data, where variable values were generally dependent on recent historical information. This characteristic was not fully considered by existing methods, which used a uniform threshold when causal relationships with different delays were inferred through hypothesis testing, making it difficult to effectively infer weaker causal relationships. To address the aforementioned issue, a method for time-series causal inference based on adaptive threshold learning was proposed: first, data characteristics were extracted, then, based on the nature of the data at different delays, a combination of thresholds used in the hypothesis testing process was automatically learned. Finally, this threshold combination was applied to the hypothesis testing processes of the PC (Peter-Clark) algorithm, PCMCI (Peter-Clark and Momentary Conditional Independence) algorithm, and VAR-LINGAM (Vector Autoregressive Linear non-Gaussian Acyclic Model) algorithm to obtain a more accurate causal relationship structure. Experimental verification was conducted on a simulated dataset: on dataset a, the adaptive PC algorithm, adaptive PCMCI algorithm, and adaptive VAR-LINGAM algorithm using the proposed method improved the F1 score by approximately 1.31, 1, and 0.03 percentage points respectively; on dataset b, they improved by approximately 0.53, 1.16, and 2.36 percentage points respectively; on dataset c, they improved by approximately 0.22, 3.56, and 0.98 percentage points respectively.

Key words: causal inference, time series, hypothesis test, parameter optimization, adaptive

中图分类号:

TP391

赵秦壮谭红叶. 基于自适应阈值学习的时序因果推断方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023091278.

ZHAO Qinzhuang, TAN Hongye. Time series causal inference method based on adaptive threshold learning[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2023091278.

[1]	佘维, 李阳, 钟李红, 孔德锋, 田钊. 基于改进实数编码遗传算法的神经网络超参数优化[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 671-676.
[2]	刘一迪, 温自豪, 任富香, 李诗音, 唐德玉. 自适应球形演化的药物-靶标相互作用预测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 989-994.
[3]	任帅, 纪元法, 孙希延, 韦照川, 林子安. 基于改进灰狼优化与支持向量回归的滑坡位移预测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 972-982.
[4]	刘勇, 杨锟. 新能源汽车电池回收网点竞争选址模型及算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 595-603.
[5]	李俊杰, 望育梅, 李志军, 刘雨. 全景视频基于块的视口自适应传输方案综述[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 536-547.
[6]	王震, 张珊珊, 邬斌扬, 苏万华. 基于自适应粒子群优化算法的串联复合涡轮储能优化策略[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 611-618.
[7]	欧云, 周恺卿, 尹鹏飞, 刘雪薇. 双收敛因子策略下的改进灰狼优化算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2679-2685.
[8]	何添, 沈宗鑫, 黄倩倩, 黄雁勇. 基于自适应学习的多视图无监督特征选择方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2657-2664.
[9]	徐丽, 符祥远, 李浩然. 基于门控卷积的时空交通流预测模型[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2760-2765.
[10]	于碧辉, 蔡兴业, 魏靖烜. 基于提示学习的小样本文本分类方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2735-2740.
[11]	陆辉, 黄瑞章, 薛菁菁, 任丽娜, 林川. 深度动态文本聚类模型DDDC[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2370-2375.
[12]	韩春港, 刘永辉. 基于GhostNet和特征融合的人脸活体检测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2588-2592.
[13]	周寅莹, 周允升, 余敦辉, 孙军. 基于消极相似性的自适应社会化推荐[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2439-2447.
[14]	李豆豆, 李汪根, 夏义春, 束阳, 高坤. 基于特征交互与自适应融合的骨骼动作识别[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2581-2587.
[15]	刘安阳, 赵怀慈, 蔡文龙, 许泽超, 解瑞灯. 基于主动判别机制的自适应生成对抗网络图像去模糊算法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2288-2294.

基于自适应阈值学习的时序因果推断方法

Time series causal inference method based on adaptive threshold learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics