首页 | 本学科首页   官方微博 | 高级检索  
     

CNN_BiLSTM双通道维吾尔语名词短语指代消解
引用本文:张江,田生伟,禹龙.CNN_BiLSTM双通道维吾尔语名词短语指代消解[J].计算机仿真,2020,37(4):255-259.
作者姓名:张江  田生伟  禹龙
作者单位:新疆大学软件学院,新疆乌鲁木齐830008;新疆大学网络中心,新疆乌鲁木齐市830046
基金项目:人才培养项目;新疆天山青年计划项目;国家自然科学基金
摘    要:提出了一种利用CNN_BiLSTM双重通道模型的维吾尔语名词短语指代消解。利用包含维语语言特点的Hand-crafted特征初步筛选先行语和照应语,减少不必要的负例,然后使用wordembedding将先行语和照应语向量化,并作为CNN_BiLSTM双重通道模型的输入,使用双通道模型提取空间语义特征和时间语义特征。两种特征融合之后训练softmax分类器,最终完成指代消解任务。上述方法在维吾尔语名词短语指代消解任务中的准确率为84.3召回率为78.1,F1值为81。实验结果表明,充分利用CNN和BiLSTM分别提取时间和空间双重特征的,可以有效提高维吾尔语名词短语指代消解的性能。

关 键 词:卷积神经网络  长短时记忆网络  指代消解  词向量

Coreference Resolution of Uyghur Noun Phrases Based on CNN_BiLSTM
ZHANG Jiang,TIAN Sheng-wei,YU Long.Coreference Resolution of Uyghur Noun Phrases Based on CNN_BiLSTM[J].Computer Simulation,2020,37(4):255-259.
Authors:ZHANG Jiang  TIAN Sheng-wei  YU Long
Affiliation:(School of Software Xinjiang University,Urumqi Xinjiang 830008,China;Network Center,Xinjiang University,Urumqi Xinjiang 830046,China)
Abstract:We proposed a dual channel model for coreference resolution of Uyghur noun phrase. First, we screened the antecedent and anaphora by using the Hand-crafted features which were full of language characteristics of Uygur, In this way, we can reduce the unnecessary negative cases. Then, they were vectorized by using word embedding. These vectors can be the input to the CNN_BiLSTM dual channel model. So we can get the spatial semantic features and temporal semantic features. At last, we integrated these features and training softmax to finish the anaphora resolution task. Our experiment show that the precision rate, the recall rate and F value respectively reach 84.3%, 78.1% and 81%, which demonstrates the efficiency of the dual channel model.
Keywords:CNN  BiLSTM  Coreference resolution  Word embedding
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号