论文标题

验证分析查询结果的正确性

Verifying the Correctness of Analytic Query Results

论文作者

Nosrati, Masoud, Cai, Ying

论文摘要

数据外包是一个具有成本效益的解决方案,可以解决数据所有者解决大量数据,大量用户以及数据分析所需的密集计算等问题。他们可以简单地将数据库上传到云中,并让其执行所有管理工作,包括查询处理。此服务模型的一个问题是查询发行者如何验证他们收到的查询结果的确是正确的。这种担忧是合法的,因为作为第三方,云可能并不完全值得信赖,并且作为大型数据中心,云是黑客的理想目标。在查询结果验证方面已经进行了重要的工作,但是大多数仅考虑简单查询,可以通过直接检查原始数据直接在查询条件下检查查询结果。在本文中,我们考虑了使用户能够验证分析查询结果的正确性的问题。与简单的查询不同,分析查询涉及对数据库进行评分函数的排名,这使得很难构建用于验证目的的数据结构。我们提出了两种方法,即一个签名和多签名,并表明它们在分析和实验中都可以在包括TOP-K,范围和KNN查询在内的三种代表性类型的分析性查询中运作良好。

Data outsourcing is a cost-effective solution for data owners to tackle issues such as large volumes of data, huge number of users, and intensive computation needed for data analysis. They can simply upload their databases to a cloud and let it perform all management works, including query processing. One problem with this service model is how query issuers can verify the query results they receive are indeed correct. This concern is legitimate because, as a third party, clouds may not be fully trustworthy, and as a large data center, clouds are ideal targets for hackers. There has been significant work on query result verification, but most consider only simple queries where query results can be attained by checking the raw data against the query conditions directly. In this paper, we consider the problem of enabling users to verify the correctness of the results of analytic queries. Unlike simple queries, analytic queries involve ranking functions to score a database, which makes it difficult to build data structures for verification purposes. We propose two approaches, namely one-signature and multi-signature, and show that they work well on three representative types of analytic queries, including top-k, range, and KNN queries, through both analysis and experiments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源