首页 | 本学科首页   官方微博 | 高级检索  
     

藏文历史文献识别过程中藏文自由虚词的自动识别及消歧算法的研究
引用本文:卓玛吉. 藏文历史文献识别过程中藏文自由虚词的自动识别及消歧算法的研究[J]. 广东电脑与电讯, 2018, 1(12): 20-22
作者姓名:卓玛吉
作者单位:青海民族大学计算机学院
摘    要:虚词作为藏文文献中重要成分,对文献识别过程也造成了很大的难度。本文通过传统藏文文法和语法规则,主要研究并提出了三种藏文历史文献中大量藏文自由虚词的识别算法,同时建立了具有284条规则的藏文自由虚词消歧规则库。使文献数字化过程中快速地识别并消除藏文句子中不自由虚词的歧义问题,提高藏文文献自动识别的准确率。

关 键 词:藏文虚词  自动识别  消歧  算法  

Research on Algorithms of Automatic Recognition and Disambiguation of Tibetan Free Functional Words in Tibetan Historical Documents Recognition
ZHUO Ma-ji. Research on Algorithms of Automatic Recognition and Disambiguation of Tibetan Free Functional Words in Tibetan Historical Documents Recognition[J]. Computer & Telecommunication, 2018, 1(12): 20-22
Authors:ZHUO Ma-ji
Abstract:Functional words, as an important component of Tibetan literature, has caused great difficulties in the process of document recognition. Based on the traditional Tibetan grammar and grammar rules, this paper mainly studies and puts forward three kinds of recognition algorithms for a large number of Tibetan free function words in Tibetan historical documents, and establishes a rule base of 284 rules for Tibetan free function words disambiguation. In the process of digitalization, the ambiguity of unfree function words in Tibetan sentences can be quickly identified and eliminated, and the accuracy of automatic identification of Tibetan documents can be improved.
Keywords:Tibetan functional words  automatic identification  disambiguation  algorithm  
点击此处可从《广东电脑与电讯》浏览原始摘要信息
点击此处可从《广东电脑与电讯》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号