论文标题
有效,高效的查询感知摘要提取用于网络搜索
Effective and Efficient Query-aware Snippet Extraction for Web Search
论文作者
论文摘要
查询感知的网页段提取被广泛用于搜索引擎中,以帮助用户在单击之前更好地了解返回的网页的内容。尽管很重要,但很少研究。在本文中,我们提出了一个名为DeepQse的有效的查询网页摘要提取方法,旨在选择一些句子,这些句子可以最好地在输入查询的上下文中汇总网页内容。 DeepQse首先学习每个句子的查询句子表示,以捕获查询和句子之间的细粒度相关性,然后学习摘要提取的文档含义句子句子相关性表示。由于查询和每个句子是在DeepQse中共同建模的,因此其在线推断可能很慢。因此,我们进一步提出了一个高效的DeepQSe,称为“有效的点数”,它可以显着提高DeepQSe的推理速度而不会影响其性能。有效的点数的核心思想是将查询意识的摘要提取任务分解为两个阶段,即可以缓存句子表示的粗粒度候选句子选择阶段,以及一个细粒度的相关性建模阶段。在两个现实世界数据集上的实验验证了我们方法的有效性和效率。
Query-aware webpage snippet extraction is widely used in search engines to help users better understand the content of the returned webpages before clicking. Although important, it is very rarely studied. In this paper, we propose an effective query-aware webpage snippet extraction method named DeepQSE, aiming to select a few sentences which can best summarize the webpage content in the context of input query. DeepQSE first learns query-aware sentence representations for each sentence to capture the fine-grained relevance between query and sentence, and then learns document-aware query-sentence relevance representations for snippet extraction. Since the query and each sentence are jointly modeled in DeepQSE, its online inference may be slow. Thus, we further propose an efficient version of DeepQSE, named Efficient-DeepQSE, which can significantly improve the inference speed of DeepQSE without affecting its performance. The core idea of Efficient-DeepQSE is to decompose the query-aware snippet extraction task into two stages, i.e., a coarse-grained candidate sentence selection stage where sentence representations can be cached, and a fine-grained relevance modeling stage. Experiments on two real-world datasets validate the effectiveness and efficiency of our methods.