计算机应用

• 数据库技术(特约专题)(Database technology • 上一篇    下一篇

基于不确定数据的查询处理综述

崔斌 卢阳   

  1. 北京大学信息科学技术学院
  • 收稿日期:2008-07-08 修回日期:1900-01-01 出版日期:2008-11-01 发布日期:2008-11-01
  • 通讯作者: 崔斌

Survey on query processing based on uncertain data

Bing CUI Yang LU   

  • Received:2008-07-08 Revised:1900-01-01 Online:2008-11-01 Published:2008-11-01
  • Contact: Bing CUI

摘要: 不确定数据在一些重要应用领域中是固有存在的,如传感器网络和移动物体追踪。在不确定数据上使用传统的查询方法会使查询结果出现偏差,不能满足用户的需求。因此,基于不确定数据的查询处理受到了越来越多的关注。与在确定数据上查询不同,不确定数据上的研究工作将概率引入到数据模型中来衡量不确定对象成为结果集中元素的可能性。由于问题定义和数据模型的不同,不确定数据上的查询类型也多种多样。从问题定义、数据模型、剪枝策略和算法等角度,对基于不确定数据的范围查询、top-k查询以及skyline查询进行了介绍。

关键词: 不确定数据, 范围查询, top-k查询, skyline查询

Abstract: Uncertain data is inherent in some important application fields, such as sensor networks and mobile object tracking. Using traditional querying methods on uncertain data will bias the answer set, and hence cannot satisfy users' needs. Therefore, query processing based on uncertain data has attracted more and more attention. Different from queries on certain data, research work on uncertain data introduce probability into data model to measure the likeness of an uncertain object as one element of the answer set. Due to different problem definitions and data models, query types differentiate from each other greatly. This survey introduced range queries, top-k queries and skyline queries based on uncertain data from the views of problem definitions, data models, pruning strategies and algorithms.

Key words: uncertain data, range queries, top-k queries, skyline queries