计算机应用 ›› 2011, Vol. 31 ›› Issue (09): 2402-2404.DOI: 10.3724/SP.J.1087.2011.02402

• 数据库技术 • 上一篇    下一篇

基于查询集空间分布的聚合最近邻查询算法

徐超,张东站,郑艳红,饶丽丽   

  1. 厦门大学 计算机科学系,福建 厦门 361005
  • 收稿日期:2011-03-31 修回日期:2011-06-16 发布日期:2011-09-01 出版日期:2011-09-01
  • 通讯作者: 徐超
  • 作者简介:徐超(1986-),男,山东临沂人,硕士研究生,主要研究方向:XML数据库、数据挖掘、Web数据管理;
    张东站(1974-),男,江苏新沂人,副教授,博士,主要研究方向:数据挖掘;
    郑艳红(1988-),女,福建莆田人,硕士研究生,主要研究方向:数据挖掘;
    饶丽丽(1988-),女,江西上饶人,硕士研究生,主要研究方向:数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(50604012)

Aggregate nearest neighbor query algorithm based on spatial distribution of query set

XU Chao,ZHANG Dong-zhan,ZHENG Yan-hong,RAO Li-li   

  1. Computer Science Department, Xiamen University, Xiamen Fujian 361005, China
  • Received:2011-03-31 Revised:2011-06-16 Online:2011-09-01 Published:2011-09-01
  • Contact: XU Chao

摘要: 聚合最近邻查询涉及到多个查询对象,因此比传统最近邻查询更复杂,而且其查询集空间分布特征暗含了查询集聚合最近邻的区域分布信息。充分考虑查询集分布特征,给出了利用分布特征指导聚合最近邻搜索的方法,并以此提出了一种新的聚合最近邻查询算法——AM算法。AM算法能动态地捕捉并利用查询集空间分布特征,使得对数据点的搜索按正确的次序进行,避免对不必要数据点的搜索。最后通过实验验证了AM算法的高效性。

关键词: 聚合最近邻查询, 优势组, 劣势点, 优先扩展

Abstract: Aggregate nearest neighbor query involves many query points, so it is more complicated than traditional nearest neighbor query, and the distribution characteristic of query set implies the region where its aggregate nearest neighbor exists. Taking full account of the distribution characteristic of query set, a method by utilizing distribution characteristic to direct the way of aggregate nearest neighbor searching was given. Based on the method, a new algorithm named AM was presented for aggregate nearest neighbor query. AM algorithm can dynamically capture and use the distribution characteristic of query set, which enables it to search data points in a right order, and avoid unnecessary searching to data points. The experimental results show the efficiency of the algorithm.

Key words: aggregate nearest neighbor query, superiority group, inferior point, extension with high priority

中图分类号: