《计算机应用》唯一官方网站 ›› 2020, Vol. 40 ›› Issue (2): 465-472.DOI: 10.11772/j.issn.1001-9081.2019081900

• 第36届CCF中国数据库学术会议(NDBC 2019) • 上一篇    下一篇

空间亚频繁co-location模式的主导特征挖掘

马董, 陈红梅(), 王丽珍, 肖清   

  1. 云南大学 信息学院,昆明 650504
  • 收稿日期:2019-08-12 修回日期:2019-11-06 接受日期:2019-11-08 发布日期:2019-11-18 出版日期:2020-02-10
  • 通讯作者: 陈红梅
  • 作者简介:马董(1992—),男,云南曲靖人,硕士研究生,主要研究方向:空间数据挖掘
    王丽珍(1962-),女,山东博兴人,教授,博士,CCF高级会员,主要研究方向:数据库、空间数据挖掘
    肖清(1975-),女,江西吉水人,讲师,硕士,CCF会员,主要研究方向:空间数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61662086);云南省创新团队项目(2018HC019);云南大学“东陆中青年骨干教师”培养计划项目(WX069051)

Dominant feature mining of spatial sub-prevalent co-location patterns

Dong MA, Hongmei CHEN(), Lizhen WANG, Qing XIAO   

  1. School of Information Science and Engineering,Yunnan University,Kunming Yunnan 650504,China
  • Received:2019-08-12 Revised:2019-11-06 Accepted:2019-11-08 Online:2019-11-18 Published:2020-02-10
  • Contact: Hongmei CHEN
  • About author:MA Dong, born in 1992, M. S. candidate. His research interests include spatial data mining.
    WANG Lizhen, born in 1962, Ph. D., professor. Her research interests include databaase, spatial data mining.
    XIAO Qing, born in 1975, M. S., lecturer. Her research interests include spatial data mining.
  • Supported by:
    the National Natural Science Foundation of China(61662086);the Project of Innovative Research Team of Yunnan Province(2018HC019);the Program for Donglu Young and Middle-aged Skeleton Teachers of Yunnan University(WX069051)

摘要:

空间co-location模式是一组空间特征的子集,它们的实例在邻域内频繁并置出现。通常,空间co-location模式挖掘方法假设空间实例相互独立,并采用空间实例参与到模式实例的频繁性(参与率)来度量空间特征在模式中的重要性,采用空间特征的最小参与率(参与度)来度量模式的有趣程度,忽略了空间特征间的某些重要关系。因此为了揭示空间特征间的主导关系而提出主导特征co-location模式。现有主导特征模式挖掘方法是基于传统频繁模式及其团实例模型进行挖掘,然而,团实例模型可能会忽略非团的空间特征间的主导关系。因此,基于星型实例模型,研究空间亚频繁co-location模式的主导特征挖掘,以更好地揭示空间特征间的主导关系,挖掘更有价值的主导特征模式。首先,定义了两个度量特征主导性的指标;其次,设计了有效的主导特征co-location模式挖掘算法;最后,在合成数据集和真实数据集上通过大量实验验证了所提算法的有效性以及主导特征模式的实用性。

关键词: 空间数据挖掘, 空间co-location模式, 亚频繁co-location模式, 主导特征, 主导特征co-location模式

Abstract:

The spatial co-location pattern is a subset of spatial features whose instances frequently appear together in the neighborhoods. Co-location pattern mining methods usually assume that spatial instances are independent to each other, adopt a participation rate, which is the frequency of spatial instances participating in pattern instances, to measure the importance of spatial features in the co-location pattern, and adopt a participation index, which is the minimal participation rate of spatial features, to measure the interest of patterns. These methods neglect some important relationships between spatial features. Therefore, the co-location pattern with dominant feature was proposed to reveal the dominant relationship between spatial features. The existing method for mining co-location pattern with dominant feature is based on the traditional co-location pattern mining and its clique instance model. However, the clique instance model may neglect the non-clique dominant relationship between spatial features. Motivated by the above, the dominant feature mining of spatial sub-prevalent co-location patterns was studied based on the star instance model to better reveal the dominant relationship between spatial features and mine more valuable co-location patterns with dominant feature. Firstly, two metrics to measure feature’s dominance were defined. Secondly, an effective algorithm for mining co-location pattern with dominant feature was designed. Finally, the experimental results on both synthetic and real datasets show that the proposed mining algorithm is efficient and the co-location pattern with dominant feature is pratical.

Key words: spatial data mining, spatial co-location pattern, sub-prevalent co-location pattern, dominant feature, co-location pattern with dominant feature

中图分类号: