首页 | 本学科首页   官方微博 | 高级检索  
     

基于共同语境的近义词/同义词短语查找模型
引用本文:石晨,张宇,胡博. 基于共同语境的近义词/同义词短语查找模型[J]. 计算机工程与应用, 2021, 57(14): 142-147. DOI: 10.3778/j.issn.1002-8331.2006-0269
作者姓名:石晨  张宇  胡博
作者单位:1.东南大学,南京 2111892.浙江警察学院,杭州 310053
摘    要:为了实现大型语料库中近义词/同义词短语的查找,提出了一种基于共同语境的近义词/同义词短语查找模型,它通过n-gram分布式方法捕获语义相似性,不需要解析就能隐式地保存局部句法结构,使底层方法语言独立;具体实现分为两个阶段:第一阶段是上下文收集和过滤,即用围绕查询短语的本地上下文作为条件模型的特征来捕获语义和语法信息.第...

关 键 词:近义词/同义词  查询短语  语义相似性  上下文  评分函数

Model for Near-Synonym/Synonym Phrase Finding Based on Common Surrounding Context
SHI Chen,ZHANG Yu,HU Bo. Model for Near-Synonym/Synonym Phrase Finding Based on Common Surrounding Context[J]. Computer Engineering and Applications, 2021, 57(14): 142-147. DOI: 10.3778/j.issn.1002-8331.2006-0269
Authors:SHI Chen  ZHANG Yu  HU Bo
Affiliation:1.Southeast?University, Nanjing 211189, China2.Zhejiang Police College, Hangzhou 310053, China
Abstract:In order to find near-synonyms/synonyms phrases in large corpus, a near-synonym/synonym phrase finding model based on common surrounding context is proposed in this paper. It captures semantic similarity via [n]-gram distribu-ted method, and implicitly preserves local syntactic structure without parsing, making the underlying method language independent. The specific implementation is divided into two phases:The first phase is context collection and filtering, that is, it uses the local contexts surrounding the query phrase as features to the conditional model to capture both semantic and syntactic information. The second phase is the collection and screening of candidate phrases, that is, it iterates over all the instances of each “left”, “right” and “pairing” in the data to collect a set of near-synonym/synonym candidate phrases. And the elements that make up the model and the scoring functions used to evaluate the performance of the model are also given. The experimental results based on different large corpus show that the proposed modeling method is superior to other common finding method models in terms of statistical scoring finding performance and overall scalability.
Keywords:near-synonyms/synonyms  query phrases  semantic similarity  context  scoring function  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号