论文标题
在文本和社交图上自审学习的谣言检测
Rumor Detection with Self-supervised Learning on Texts and Social Graph
论文作者
论文摘要
近年来,谣言检测已成为一个新兴和积极的研究领域。核心是对丰富信息中固有的谣言特征进行建模,例如社交网络中的传播模式和邮政内容中的语义模式,并将其与事实区分开。但是,现有关于谣言检测的工作在建模异质信息的建模方面缺乏,要么仅使用一个信息源(例如社交网络或发布内容),要么忽略了多个来源之间的关系(例如,通过简单串联融合社交和内容功能)。因此,他们可能有缺点,可以全面理解谣言,并准确地检测它们。在这项工作中,我们探讨了对异质信息来源的对比度自我监督学习,以揭示他们的关系并更好地描述谣言。从技术上讲,我们通过辅助自我监督的任务补充了主要监督的检测任务,该任务通过自我歧视来丰富邮政的表示。具体而言,鉴于对帖子的两个异质观点(即编码社会模式和语义模式的表示形式),与其他帖子相比,歧视是通过最大化同一帖子不同观点之间的相互信息来完成的。我们根据信息源的不同关系,设计了集群和实例的方法来产生观点并进行歧视。我们将此框架称为自我监督的谣言检测(SRD)。在三个现实世界数据集上进行的广泛实验验证了SRD在社交媒体上自动谣言检测的有效性。
Rumor detection has become an emerging and active research field in recent years. At the core is to model the rumor characteristics inherent in rich information, such as propagation patterns in social network and semantic patterns in post content, and differentiate them from the truth. However, existing works on rumor detection fall short in modeling heterogeneous information, either using one single information source only (e.g. social network, or post content) or ignoring the relations among multiple sources (e.g. fusing social and content features via simple concatenation). Therefore, they possibly have drawbacks in comprehensively understanding the rumors, and detecting them accurately. In this work, we explore contrastive self-supervised learning on heterogeneous information sources, so as to reveal their relations and characterize rumors better. Technically, we supplement the main supervised task of detection with an auxiliary self-supervised task, which enriches post representations via post self-discrimination. Specifically, given two heterogeneous views of a post (i.e. representations encoding social patterns and semantic patterns), the discrimination is done by maximizing the mutual information between different views of the same post compared to that of other posts. We devise cluster-wise and instance-wise approaches to generate the views and conduct the discrimination, considering different relations of information sources. We term this framework as Self-supervised Rumor Detection (SRD). Extensive experiments on three real-world datasets validate the effectiveness of SRD for automatic rumor detection on social media.