An experimental model of Chinese textual database |
| |
Authors: | Shih‐Shyeng Tseng Chen‐Chau Yang Ching‐Chun Hsieh |
| |
Affiliation: | 1. Computing Center , Academia Sinica , Taipei, Taiwan, 11529, R.O.C.;2. Department of Electronic Engineering , National Taiwan Institute of Technology , Taipei, Taiwan, 10772, R.O.C.;3. Institute of Information Science , Academia Sinica , Taipei, Taiwan, 11529, R.O.C. |
| |
Abstract: | Abstract A textual database deals with retrieval and manipulation of documents. It allows a user to search on‐line complete documents or parts of documents rather than attributes of documents. Resembling a formatted database which uses a data model as its underlying structure, a textual database has to base its development upon a document model. In this paper, a document model, called the ECHO model, is proposed. The ECHO model provides a document representation, called the ECHO structure, for expressing documents and operations on the representation that serve to express queries and manipulations on documents. It has the ability to provide multiple document structures for a document, a flexible search unit for retrieving textual information, and a subrange search on a textual database. In addition, the ECHO structure is relatively easy to maintain. An architecture of a textual database based on the ECHO model is also proposed. In order to improve the query performance, a refined character inversion method, called ARCIM, is proposed as the text‐access method of the Chinese textual database. The ARCIM can retrieve texts faster than a simple inversion method and requires less space overhead. |
| |
Keywords: | |
|
|