首页 | 本学科首页   官方微博 | 高级检索  
     

A Study of the Techniques of Automatic Abstracting and Knowledge Acquisition Systems
作者姓名:SUN Chun kui directed by ZHONG Yi  xin
摘    要:A Study of the Techniques of Automatic Abstracting and Knowledge Acquisition Systems


A Study of the Techniques of Automatic Abstracting and Knowledge Acquisition Systems
SUN Chun kui directed by ZHONG Yi,xin.A Study of the Techniques of Automatic Abstracting and Knowledge Acquisition Systems[J].The Journal of China Universities of Posts and Telecommunications,2001,8(4).
Authors:SUN Chun-kui  ZHONG Yi-xin
Affiliation:SUN Chun kui directed by ZHONG Yi xin
Abstract:ing; automatic knowledge acquisition; machine learning; natural language processing Abstract One of the most important signs of the information society is the explosion of information. The information in Internet is out of order and is mostly written in natural languages which need to be processed by the technology of natural language processing. When you search for some certain information on Internet through a search engine, you might be confused by the huge amount of results which the search engine provides. However, if a search engine is embedded with Automatic Abstracting (AA) processing systems, you could locate the information quickly or you could get more information within a limited time. So, the AA technology is valuable both in science and application. The work of this thesis was begun when we took over a project that is called "The Key Technology Research of Computer Networks Providing Intelligent Information Services" which belongs to the national 863 plan. One of the tasks is "The Key Technology Research of Automatic Abstracting Systems of Chinese Text". As a member of this research group, I took part in designing and implementing an AA system called Literature Abstract and Digest Information Extract System(LADIES). From then on, I have been working in this field and this paper is the conclusion of my work. The main topic of the thesis is AA technology. There are two parts of it. One is about the research of understanding based AA systems, and the other is about the invcestigation of Automatic Knowledge Acquistion(AKA) in AA systems. In the first part, the contents of AA technology are introduced and an understanding based AA model is put forward. Based on this model, LADIES is implemented. There are two major features of LADIES: (1) it understands text with the grammar, semantic and pragmatic information of words; (2) it chunks words into a relatively independent entity with chunking rules which are substitutes of syntactic analyzing rules. The results demonstrate that it performs better than those statistical based AA systems. However, the application of LADIES is limited for its knowledge bases. And it is difficult to use in other fields because the knowledge bases are setup manually. So we investigate the techniques of automatic knowledge acquisition in order to solve the above problems to some extent. In the second part, we introduce the basic ideas of AKA and some Machine Learning (ML) methods which AKA applies. Then we propose a comprehensive dictionary model that contains grammar, semantic and pragmatic information of words. And we investigate a strategy of automatic learning pragmatic information for words. Also we put forward another strategy of automatic learning rule of salience sentences in texts and based on it, we establish an AA system LADIES NEW. Eventually, we suggest a AKA based AA system model called hierarchical feature extracting AA system model.
Keywords:automatic abstracting  automatic knowledge acquisition  machine learning  natural language processing
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号