论文标题

对齐视觉和词汇语义

Aligning Visual and Lexical Semantics

论文作者

Giunchiglia, Fausto, Bagchi, Mayukh, Diao, Xiaolei

论文摘要

我们讨论与计算机视觉(CV)系统有关的两种语义 - 视觉语义和词汇语义。虽然视觉语义专注于人类在使用视觉来感知目标现实时如何构建概念,但词汇语义集中在人类如何通过使用语言来建立相同目标现实的概念。反过来,视觉语义和词汇语义之间缺乏巧合,对语义差距问题(SGP)形式的简历系统产生了重大影响。该论文在广泛地说明了上述缺乏巧合的同时,引入了一种一般的,域的无知方法,以实施视觉和词汇语义之间的一致性。

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源