首页 | 本学科首页   官方微博 | 高级检索  
     

基于知网的词语语义相似度改进算法研究
引用本文:王辉,Marius.Petrescu,潘俊辉,王浩畅,张强.基于知网的词语语义相似度改进算法研究[J].计算机与数字工程,2022,50(2):225-228,293.
作者姓名:王辉  Marius.Petrescu  潘俊辉  王浩畅  张强
作者单位:东北石油大学计算机与信息技术学院 大庆 163318,普洛耶什蒂石油天然气大学 普洛耶什蒂 100680
基金项目:黑龙江省自然科学基金;东北石油大学引导性创新基金;国家自然科学基金
摘    要:词语语义相似度计算在很多自然语言处理相关领域都有着广泛应用.基于知网的现有词语语义相似度计算方法未深入考虑同棵义原层次树的义原距离、义原深度、义原密度及主次关系的影响,致使相似度计算结果并不够精确.针对该问题,提出一种词语语义相似度改进算法,通过分析知网中的义项表达式和义原层次树,用集合的加权平均值代替了义项相似度最大...

关 键 词:知网  词语语义相似度  义原密度  义原深度

Research on Improved Algorithm of Word Semantic Similarity Based on HowNet
WANG Hui,Marius.Petrescu,PAN Junhui,WANG Haochang,ZHANG Qiang.Research on Improved Algorithm of Word Semantic Similarity Based on HowNet[J].Computer and Digital Engineering,2022,50(2):225-228,293.
Authors:WANG Hui  MariusPetrescu  PAN Junhui  WANG Haochang  ZHANG Qiang
Affiliation:(Department of Computer and Information Technology,Northeast Petroleum University,Daqing 163318;Petroleum-Gas University of Ploiesti,Ploiesti 100680)
Abstract:Semantic similarity of words has been widely used in many fields related to NLP. Distance,depth,density of sememes on the same semantic hierarchy tree and the primary and secondary relationship between them,which are not considered deeply in existing algorithms of word semantic similarity on HowNet,so the results of similarity calculation are inaccurate enough.To solve the problem,the paper proposes improved algorithm of word semantic similarity based on HowNet,by analyzing the semiotic expression and semantic hierarchy tree in HowNet,weighted average of set is used to replace maximum sememe similarity,density of sememe is introduced into the new edge weight function,and the influence of sememe depth and sememe density on sememe similarity is restricted by the weight factor. Experimental results show that the accuracy of word semantic similarity is effectively improved,which is more reasonable than existing methods.
Keywords:HowNet  semantic similarity of words  density of sememe  depth of sememe
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号