首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于免疫遗传算法的网络新词识别方法
引用本文:丁建立,慈祥,黄剑雄. 一种基于免疫遗传算法的网络新词识别方法[J]. 计算机科学, 2011, 38(1): 240-245
作者姓名:丁建立  慈祥  黄剑雄
作者单位:1. 中国民航大学计算机科学与技术学院,天津,300300;中国民航信息技术科研基地,天津,300300
2. 中国国际航空股份有限公司信息管理部,北京,100071
基金项目:本文受国家高技术研究发展计划(863)(2006AA12A106),国家自然科学基金(60879015,60572167)资助。
摘    要:随着互联网的发展,网络新词不断涌现,但是目前的分词方法很难及时、准确地对其做出识别。对此提出一种应用免疫遗传算法的网络新词识别方法。在分析网络新词特点的基础上,利用汉语词群现象和词位的概念提取出示范抗体,在遗传算法进行的过程中有针对性地注入该抗体。实验表明,该方法对于分词碎片中符合词群现象的新词有着极高的识别率,对于一般网络新词的识别率也基本令人满意。

关 键 词:免疫遗传算法,汉语词群,词位,杭体,网络新词识别

Approach of Internet New Word Identification Based on Immune Genetic Algorithm
DING Jian-li,CI Xiang,HUANG Jian-xiong. Approach of Internet New Word Identification Based on Immune Genetic Algorithm[J]. Computer Science, 2011, 38(1): 240-245
Authors:DING Jian-li  CI Xiang  HUANG Jian-xiong
Affiliation:(College of Computer Science and Technology, Civil Aviation University oI China,Tianjin 300300,China);(Information Technology Research Base,Civil Aviation Administration of China,Tianjin 300300,China);(Information Management Department in Air China,Beijing 100071,China)
Abstract:The development of Internet leads the Internet new word coming into being. These unknown words are difficult to identify timely and accurately by the current Word Segmentation Method, therefore Internet new word identification method using Immune genetic algorithm was brought forward. This method is based on the analysis of characteristics of Internet new word, using the phenomenon of Chinese words and word groups to extract exemplary antibody, and injecting the antibody targeted during the process of genetic algorithm.The experiment results show that the methodnot only has a higher recognition rates of the new words consistent with the phenomenon of word groups in word fragments but the result of identifying ordinary Internet new word is adequate.
Keywords:Immune genetic algorithm   Word group   Word position   Antibody   Internet new word identification
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号