论文标题
印度语言网格及其与普林斯顿文字的联系
Indian Language Wordnets and their Linkages with Princeton WordNet
论文作者
论文摘要
WordNet是丰富的词典语义资源。链接的WordNet是WordNet的扩展,它们在不同语言的WordNet中链接了相似的概念。这些资源在许多自然语言处理(NLP)应用程序中非常有用,主要是基于知识的方法的应用程序。在这种方法中,这些资源被视为黄金标准/甲骨文。因此,这些资源拥有正确的信息至关重要。因此,它们是由人类专家创建的。但是,很难获得多种语言的人类专家。因此,社区将受益于共享此类手动创建的资源。在本文中,我们发布了与普林斯顿文章相关的18种印度语言WordNet的映射。我们认为,此类资源的可用性将直接影响这些语言的NLP进度。
Wordnets are rich lexico-semantic resources. Linked wordnets are extensions of wordnets, which link similar concepts in wordnets of different languages. Such resources are extremely useful in many Natural Language Processing (NLP) applications, primarily those based on knowledge-based approaches. In such approaches, these resources are considered as gold standard/oracle. Thus, it is crucial that these resources hold correct information. Thereby, they are created by human experts. However, human experts in multiple languages are hard to come by. Thus, the community would benefit from sharing of such manually created resources. In this paper, we release mappings of 18 Indian language wordnets linked with Princeton WordNet. We believe that availability of such resources will have a direct impact on the progress in NLP for these languages.