论文标题
一种快速算法,用于通过在线社交平台中的影响来对用户进行排名
A Fast Algorithm for Ranking Users by their Influence in Online Social Platforms
论文作者
论文摘要
衡量用户在社交网络中的影响是众多应用程序的关键。最近提出的影响度量指标(以$ψ$分数为单位)可以超越传统的中心度指标,仅通过进一步纳入用户发布和重新启动活动提供的丰富信息来评估结构图的重要性。实际上,$ψ$ -SCORE概括了Pagerank以进行非均匀节点活动。尽管它具有重要意义,但它的扩展比大数据集的范围很差。对于$ n $用户的网络,它需要求解$ n $尺寸$ n $的方程式的$ n $线性系统。为了解决这个问题,这项工作引入了一种新颖的可扩展算法,用于快速近似$ψ$ -SCORE,名为\ textit {power} - $ψ$。所提出的算法基于一个新颖方程式,表明它可以求解一个大小$ n $的方程式系统来计算$ψ$ - 分数。然后,我们的算法利用了这样一个事实,即可以递归并分布近似于任何所需误差的事实。这允许$ψ$ -SCORE总结节点的结构和行为信息,可以像Pagerank一样快地运行。我们在几个现实世界中的数据集上验证了提出的算法的有效性,该算法是开源Python库。
Measuring the influence of users in social networks is key for numerous applications. A recently proposed influence metric, coined as $ψ$-score, allows to go beyond traditional centrality metrics, which only assess structural graph importance, by further incorporating the rich information provided by the posting and re-posting activity of users. The $ψ$-score is shown in fact to generalize PageRank for non-homogeneous node activity. Despite its significance, it scales poorly to large datasets; for a network of $N$ users, it requires to solve $N$ linear systems of equations of size $N$. To address this problem, this work introduces a novel scalable algorithm for the fast approximation of $ψ$-score, named \textit{Power}-$ψ$. The proposed algorithm is based on a novel equation indicating that it suffices to solve one system of equations of size $N$ to compute the $ψ$-score. Then, our algorithm exploits the fact that such a system can be recursively and distributedly approximated to any desired error. This permits the $ψ$-score, summarizing both structural and behavioral information for the nodes, to run as fast as PageRank. We validate the effectiveness of the proposed algorithm, which we release as an open source Python library, on several real-world datasets.