基于查询集空间分布的聚合最近邻查询算法

doi:10.3724/SP.J.1087.2011.02402

计算机应用 ›› 2011, Vol. 31 ›› Issue (09): 2402-2404.DOI: 10.3724/SP.J.1087.2011.02402

基于查询集空间分布的聚合最近邻查询算法

徐超,张东站,郑艳红,饶丽丽

厦门大学计算机科学系，福建厦门 361005

收稿日期:2011-03-31 修回日期:2011-06-16 发布日期:2011-09-01 出版日期:2011-09-01
通讯作者: 徐超
作者简介:徐超(1986-)，男，山东临沂人，硕士研究生，主要研究方向：XML数据库、数据挖掘、Web数据管理；
张东站(1974-)，男，江苏新沂人，副教授，博士，主要研究方向：数据挖掘；
郑艳红(1988-)，女，福建莆田人，硕士研究生，主要研究方向：数据挖掘；
饶丽丽(1988-)，女，江西上饶人，硕士研究生，主要研究方向：数据挖掘。
基金资助:
国家自然科学基金资助项目(50604012)

Aggregate nearest neighbor query algorithm based on spatial distribution of query set

XU Chao,ZHANG Dong-zhan,ZHENG Yan-hong,RAO Li-li

Computer Science Department, Xiamen University, Xiamen Fujian 361005, China

Received:2011-03-31 Revised:2011-06-16 Online:2011-09-01 Published:2011-09-01
Contact: XU Chao

摘要/Abstract

摘要： 聚合最近邻查询涉及到多个查询对象，因此比传统最近邻查询更复杂,而且其查询集空间分布特征暗含了查询集聚合最近邻的区域分布信息。充分考虑查询集分布特征，给出了利用分布特征指导聚合最近邻搜索的方法，并以此提出了一种新的聚合最近邻查询算法——AM算法。AM算法能动态地捕捉并利用查询集空间分布特征，使得对数据点的搜索按正确的次序进行，避免对不必要数据点的搜索。最后通过实验验证了AM算法的高效性。

关键词: 聚合最近邻查询, 优势组, 劣势点, 优先扩展

Abstract: Aggregate nearest neighbor query involves many query points, so it is more complicated than traditional nearest neighbor query, and the distribution characteristic of query set implies the region where its aggregate nearest neighbor exists. Taking full account of the distribution characteristic of query set, a method by utilizing distribution characteristic to direct the way of aggregate nearest neighbor searching was given. Based on the method, a new algorithm named AM was presented for aggregate nearest neighbor query. AM algorithm can dynamically capture and use the distribution characteristic of query set, which enables it to search data points in a right order, and avoid unnecessary searching to data points. The experimental results show the efficiency of the algorithm.

Key words: aggregate nearest neighbor query, superiority group, inferior point, extension with high priority

中图分类号:

TP311.13

徐超张东站郑艳红饶丽丽. 基于查询集空间分布的聚合最近邻查询算法[J]. 计算机应用, 2011, 31(09): 2402-2404.

XU Chao ZHANG Dong-zhan ZHENG Yan-hong RAO Li-li. Aggregate nearest neighbor query algorithm based on spatial distribution of query set[J]. Journal of Computer Applications, 2011, 31(09): 2402-2404.

[1]	赵全, 汤小春, 朱紫钰, 毛安琪, 李战怀. 大规模短时间任务的低延迟集群调度框架[J]. 计算机应用, 2021, 41(8): 2396-2405.
[2]	冯钧王秉发陆佳民. 分布式资源描述框架数据管理系统查询性能评价[J]. 计算机应用, 0, (): 0-0.
[3]	李国荣, 冶继民, 甄远婷. 基于新的鲁棒相似性度量的时间序列聚类[J]. 计算机应用, 2021, 41(5): 1343-1347.
[4]	林定康颜嘉麒巴·楠登符朕皓姜皓晨. 门罗币匿名及追踪技术综述[J]. 计算机应用, 0, (): 0-0.
[5]	沈忱, 邰凌翔, 彭煜玮. 面向自动参数调优的动态负载匹配方法[J]. 计算机应用, 2021, 41(3): 657-661.
[6]	杨程, 陆佳民, 冯钧. 分布式环境下大规模资源描述框架数据划分方法综述[J]. 计算机应用, 2020, 40(11): 3184-3191.
[7]	兰海, 韩珂, 申砾, 崔秋, 彭煜玮. TiDB的多索引访问优化[J]. 计算机应用, 2020, 40(2): 410-415.
[8]	崔艺馨, 陈晓东. Spark框架优化的大规模谱聚类并行算法[J]. 计算机应用, 2020, 40(1): 168-172.
[9]	万静, 郑龙君, 何云斌, 李松. 高维不确定数据的子空间聚类算法[J]. 计算机应用, 2019, 39(11): 3280-3287.
[10]	李博, 张晓, 颜靖艺, 李可威, 李恒, 凌玉龙, 张勇. 基于值差度量和聚类优化的K最近邻算法在银行客户行为预测中的应用[J]. 计算机应用, 2019, 39(9): 2784-2788.
[11]	李耘书, 滕飞, 李天瑞. 基于微操作的Hadoop参数自动调优方法[J]. 计算机应用, 2019, 39(6): 1589-1594.
[12]	霍峥, 张坤, 贺萍, 武彦斌. 满足本地化差分隐私的众包位置数据采集[J]. 计算机应用, 2019, 39(3): 763-768.
[13]	朱跃龙, 朱晓晓, 王继民. 基于子序列全连接和最大团的时间序列模体发现算法[J]. 计算机应用, 2019, 39(2): 414-420.
[14]	尹远, 张昌, 文凯, 郑云俊. 基于DiffNodeset结构的最大频繁项集挖掘算法[J]. 计算机应用, 2018, 38(12): 3438-3443.
[15]	曲立平, 吴家喜. 基于评分可靠性的跨域个性化推荐方法[J]. 计算机应用, 2018, 38(11): 3081-3083.

基于查询集空间分布的聚合最近邻查询算法

Aggregate nearest neighbor query algorithm based on spatial distribution of query set

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics