论文标题
GNN-XML:极端多标签文本分类的图形神经网络
GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
论文作者
论文摘要
极端的多标签文本分类(XMTC)旨在标记文本实例,其标签集中最相关的标签子集。 XMTC由于现代应用所产生的大量标签集(例如新闻注释和产品推荐)引起了人们的关注。 XMTC的主要挑战是数据可伸缩性和稀疏性,从而导致两个问题:i)扩展到极端标签设置的棘手性,ii)存在长尾标签的分布,这意味着很大一部分标签具有很少的积极培训实例。为了克服这些问题,我们提出了GNN-XML,这是一个针对XMTC问题量身定制的可扩展图神经网络框架。具体而言,我们通过挖掘其共发生模式来利用标签相关性,并基于相关矩阵构建标签图。然后,我们通过使用低通图滤波器执行图形卷积来进行归因图聚类,以共同模型标签依赖项和标签特征,从而诱导语义标签簇。我们进一步提出了一个双边分支图同构网络,以将表示和分类器学习解),以更好地建模尾标。多个基准数据集的实验结果表明,GNN-XML在保持可比较的预测效率和模型大小的同时显着胜过最先进的方法。
Extreme multi-label text classification (XMTC) aims to tag a text instance with the most relevant subset of labels from an extremely large label set. XMTC has attracted much recent attention due to massive label sets yielded by modern applications, such as news annotation and product recommendation. The main challenges of XMTC are the data scalability and sparsity, thereby leading to two issues: i) the intractability to scale to the extreme label setting, ii) the presence of long-tailed label distribution, implying that a large fraction of labels have few positive training instances. To overcome these problems, we propose GNN-XML, a scalable graph neural network framework tailored for XMTC problems. Specifically, we exploit label correlations via mining their co-occurrence patterns and build a label graph based on the correlation matrix. We then conduct the attributed graph clustering by performing graph convolution with a low-pass graph filter to jointly model label dependencies and label features, which induces semantic label clusters. We further propose a bilateral-branch graph isomorphism network to decouple representation learning and classifier learning for better modeling tail labels. Experimental results on multiple benchmark datasets show that GNN-XML significantly outperforms state-of-the-art methods while maintaining comparable prediction efficiency and model size.