首页 | 本学科首页   官方微博 | 高级检索  
     

基于文本挖掘的搭配词典自动架构探讨
引用本文:张辉,薛贵荣.基于文本挖掘的搭配词典自动架构探讨[J].上海工程技术大学学报,2004,18(4):323-326.
作者姓名:张辉  薛贵荣
作者单位:1. 上海工程技术大学电子电气工程学院,上海,200065
2. 扬州大学计算机科学系,扬州,225009
基金项目:上海工程技术大学青年基金资助项目(2003Q03)
摘    要:研究词语搭配的关系对于自然语言处理有很大的帮助。目前对计算机用的搭配词典是用人工方法实现的,它由人工进行维护,有更新慢、收藏的词少等缺点。为此,利用文本挖掘技术对大规模语料库进行分析,挖掘词语搭配的深层关系,在此基础上自动建立词语搭配词典,实验结果显示该方法是有效的。

关 键 词:文本挖掘  互信息  关联规则挖掘  搭配词典
文章编号:1009-444X(2004)04-0323-04
修稿时间:2004年3月27日

Automatic Construction of Collocations Dictionary Based on Text Mining
ZHANG Hui,XUE Gui-rong.Automatic Construction of Collocations Dictionary Based on Text Mining[J].Journal of Shanghai University of Engineering Science,2004,18(4):323-326.
Authors:ZHANG Hui  XUE Gui-rong
Affiliation:ZHANG Hui~1,XUE Gui-rong~2
Abstract:A collocations dictionary is the useful component to many natural language and spoken language processing application such as grammar checking, text-speech conversion and machine translation. Currently The collocations dictionary is constructed artificially, firstly it may not be updated frequently and many lexicon entries may be not available. Secondly to construct a dictionary may need lots of human resources. In this paper, text-mining approach for constructing a collocations dictionary is surveyed. The main purpose is to enable cheap and quick acquisition of a collocations dictionary from a large text corpus. Experimental results show that the approach is effective and suitable.
Keywords:text mining  mutual information  association rule mining  collocations dictionary
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号