论文标题
通过因果关系恢复的条件动态推断
Inference in conditioned dynamics through causality restoration
论文作者
论文摘要
从条件动力学中计算可观察到的物体通常很难计算,因为尽管从无条件的动力学中有效地获得独立样品通常是可行的,但通常必须丢弃大多数样本(以重要性采样形式),因为它们不满足所施加的条件。直接从条件分布进行采样是非平凡的,因为条件打破了动力学的因果特性,最终使采样过程有效。实现它的标准方法是通过大都会蒙特卡洛程序,但是此过程通常很慢,需要大量的蒙特卡罗步骤来获得少量统计独立的样本。在这项工作中,我们提出了一种从条件分布中产生独立样本的替代方法。该方法了解了广义动力学模型的参数,该模型以各种意义上最佳描述条件分布。结果是一个有效的,无条件的动力学模型,可以从中可以从中获得独立的样本,从而有效地恢复条件分布的因果关系。后果是双重的:一方面,它使我们能够通过简单地在独立的样本上平均从条件动力学中有效地计算可观察力。另一方面,该方法给出了有效的无条件分布,更易于解释。该方法是灵活的,可以实际应用于任何动态。我们讨论了该方法的重要应用,即(不完美)临床测试的流行风险评估问题,对于赋予吉莱斯皮式样本样本的大型时间连续的流行模型。我们表明,该方法与最新技术的比较,包括软砂方法和平均场方法。
Computing observables from conditioned dynamics is typically computationally hard, because, although obtaining independent samples efficiently from the unconditioned dynamics is usually feasible, generally most of the samples must be discarded (in a form of importance sampling) because they do not satisfy the imposed conditions. Sampling directly from the conditioned distribution is non-trivial, as conditioning breaks the causal properties of the dynamics which ultimately renders the sampling procedure efficient. One standard way of achieving it is through a Metropolis Monte-Carlo procedure, but this procedure is normally slow and a very large number of Monte-Carlo steps is needed to obtain a small number of statistically independent samples. In this work, we propose an alternative method to produce independent samples from a conditioned distribution. The method learns the parameters of a generalized dynamical model that optimally describe the conditioned distribution in a variational sense. The outcome is an effective, unconditioned, dynamical model, from which one can trivially obtain independent samples, effectively restoring causality of the conditioned distribution. The consequences are twofold: on the one hand, it allows us to efficiently compute observables from the conditioned dynamics by simply averaging over independent samples. On the other hand, the method gives an effective unconditioned distribution which is easier to interpret. The method is flexible and can be applied virtually to any dynamics. We discuss an important application of the method, namely the problem of epidemic risk assessment from (imperfect) clinical tests, for a large family of time-continuous epidemic models endowed with a Gillespie-like sampler. We show that the method compares favorably against the state of the art, including the soft-margin approach and mean-field methods.