可解释，稳定和可扩展的图形卷积网络，用于学习图表表示

论文标题

可解释，稳定和可扩展的图形卷积网络，用于学习图表表示

Explainable, Stable, and Scalable Graph Convolutional Networks for Learning Graph Representation

论文作者

Lu, Ping-En, Chang, Cheng-Shang

论文摘要

将图中的节点映射到欧几里得空间中向量的网络嵌入问题对于解决图上的几个重要任务非常有用。最近，已经提出了图形神经网络（GNN）来解决此类问题。但是，大多数嵌入算法和GNN都难以解释，并且不能很好地扩展以处理数百万个节点。在本文中，我们从新的角度解决了问题，基于三个约束优化问题的等效性：网络嵌入问题，模块化图中模块化矩阵的痕量最大化问题以及采样图中模块化矩阵的矩阵分数问题。这三个问题的最佳解决方案是模块化矩阵的主要特征向量。我们提出了两种属于特殊类别的卷积网络（GCN）的算法，以解决这些问题：（i）作为嵌入GCN（CAFE-GCN）和（ii）球形GCN的特征聚类。两种算法都是稳定的痕量最大化算法，它们产生了良好的主要特征向量近似值。此外，还有用于稀疏图的线性时间实现。除了解决网络嵌入问题外，两个提出的GCN都能够降低维度。进行了各种实验，以评估我们提出的GCN，并表明我们所提出的GCN的表现几乎超过所有基线方法。此外，CAFE-GCN可以从标记的数据中受益，并在各种性能指标方面取得了巨大改进。

The network embedding problem that maps nodes in a graph to vectors in Euclidean space can be very useful for addressing several important tasks on a graph. Recently, graph neural networks (GNNs) have been proposed for solving such a problem. However, most embedding algorithms and GNNs are difficult to interpret and do not scale well to handle millions of nodes. In this paper, we tackle the problem from a new perspective based on the equivalence of three constrained optimization problems: the network embedding problem, the trace maximization problem of the modularity matrix in a sampled graph, and the matrix factorization problem of the modularity matrix in a sampled graph. The optimal solutions to these three problems are the dominant eigenvectors of the modularity matrix. We proposed two algorithms that belong to a special class of graph convolutional networks (GCNs) for solving these problems: (i) Clustering As Feature Embedding GCN (CAFE-GCN) and (ii) sphere-GCN. Both algorithms are stable trace maximization algorithms, and they yield good approximations of dominant eigenvectors. Moreover, there are linear-time implementations for sparse graphs. In addition to solving the network embedding problem, both proposed GCNs are capable of performing dimensionality reduction. Various experiments are conducted to evaluate our proposed GCNs and show that our proposed GCNs outperform almost all the baseline methods. Moreover, CAFE-GCN could be benefited from the labeled data and have tremendous improvements in various performance metrics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题