论文标题
敏感组的隐私意识到实验:一般的Chi Square方法
Privacy Aware Experimentation over Sensitive Groups: A General Chi Square Approach
论文作者
论文摘要
我们研究了一个新的隐私模型,用户属于某些敏感群体,我们想就各组之间的结果存在显着差异进行统计推断。特别是,我们不认为用户的结果是敏感的,而只是对某些组的成员资格。这与以前考虑了当地私人统计测试的工作形成鲜明对比的是,结果和团体共同私有化,以及私人A/B测试,在该群体被视为公共(对照组和治疗组)的同时,结果私有化。在样本中将小组成员私有化之后,我们涵盖了几种不同的假设检验设置,包括二进制和实际有价值的结果。我们采用了在不同隐私模型中的其他假设测试中使用的广义$χ^2 $测试框架,这使我们能够覆盖$ z $ - 检验,$χ^2 $测试,用于独立,t检验和ANOVA测试,采用单个统一方法。在考虑两组时,我们会为均值的真正差异提供置信区间,并显示传统的计算置信区间的方法,当引入隐私时会错过真正的差异。对于两个以上的组,我们考虑了将小组成员私有化的几种机制,这表明我们可以改善统计能力,而不是忽略由于隐私而忽略噪音的传统测试。我们还考虑对私人A/B测试的应用,以确定对照组和治疗之间敏感组之间的平均值差异是否存在显着变化。
We study a new privacy model where users belong to certain sensitive groups and we would like to conduct statistical inference on whether there is significant differences in outcomes between the various groups. In particular we do not consider the outcome of users to be sensitive, rather only the membership to certain groups. This is in contrast to previous work that has considered locally private statistical tests, where outcomes and groups are jointly privatized, as well as private A/B testing where the groups are considered public (control and treatment groups) while the outcomes are privatized. We cover several different settings of hypothesis tests after group membership has been privatized amongst the samples, including binary and real valued outcomes. We adopt the generalized $χ^2$ testing framework used in other works on hypothesis testing in different privacy models, which allows us to cover $Z$-tests, $χ^2$ tests for independence, t-tests, and ANOVA tests with a single unified approach. When considering two groups, we derive confidence intervals for the true difference in means and show traditional approaches for computing confidence intervals miss the true difference when privacy is introduced. For more than two groups, we consider several mechanisms for privatizing the group membership, showing that we can improve statistical power over the traditional tests that ignore the noise due to privacy. We also consider the application to private A/B testing to determine whether there is a significant change in the difference in means across sensitive groups between the control and treatment.