排序方式: 共有111条查询结果,搜索用时 15 毫秒
1.
有导词义消歧机器学习方法由于需要大量人力进行词义标注,难以适用于大规模词义消歧任务.提出一种避免人工词义标注的无导消歧方法.该方法综合利用WordNet知识库中的多种知识源(包括:词义定义描述、使用实例、结构化语义关系、领域属性等)描述歧义词的词义信息,生成词义的“代表词汇集”和“领域代表词汇集”,结合词汇的词频分布信息和所处的上下文环境进行词义判定.利用通用测试集Senseval 3对6个典型的无导词义消歧方法进行开放实验,该方法取得平均正确率为49.93%的消歧结果. 相似文献
2.
3.
4.
In machine translation, collocation dictionaries are important for selecting accurate target words. However, if the dictionary size is too large it can decrease the efficiency of translation. This paper presents a method to develop a compact collocation dictionary for transitive verb–object pairs in English–Korean machine translation without losing translation accuracy. We use WordNet to calculate the semantic distance between words, and k-nearestneighbor learning to select the translations. The entries in the dictionary are minimized to balance the trade-off between translation accuracy and time. We have performed several experiments on a selected set of verbs extracted from a raw corpus of over 3 million words. The results show that in real-time translation environments the size of a collocation dictionary can be reduced up to 40% of its original size without significant decrease in its accuracy. 相似文献
5.
提出了一种基于语义网络结构的词义消歧方法。将文本片段中出现词的所有词义都看作节点,将两个词的任意两个词义之间的语义关系看作弧,将语义关系的紧密程度看作弧的权重,从而构成一个无向赋权网络;将Google的网页分级(PageRank)算法应用到无向赋权图中,评价网络中节点的重要性,并结合共指词义和词义的常用程度,对文本中出现的名词进行消歧。实验证明了该方法对文本进行词义消歧是有效的。 相似文献
6.
7.
This paper presents an automatic construction of Korean WordNet from pre-existing lexical resources. We develop a set of automatic word sense disambiguation techniques to link a Korean word sense collected from a bilingual machine-readable dictionary to a single corresponding English WordNet synset. We show how individual links provided by each word sense disambiguation method can be non-linearly combined to produce a Korean WordNet from existing English WordNet for nouns. 相似文献
8.
9.
10.
检索系统可以通过引入本体来弥补传统关键词检索语义匮乏的缺陷,然而,领域专家构建本体存在过程复杂、工期长、更新困难等弊端.为此,综合分析多种本体构建方法和技术,针对专利数据的特点给出一套半自动构建本体的方案,在此基础上提出基于半自动构建本体的专利信息检索系统的体系框架,描述系统原型的设计思想和检索流程,通过实验验证该系统能很好的扩充延伸检索词,明显地提高了检索效率以及查全率. 相似文献