首页 | 本学科首页   官方微博 | 高级检索  
     


An effective 3-in-1 keyword search method over heterogeneous data sources
Authors:Guoliang Li  Jianhua Feng  Beng Chin Ooi  Jianyong Wang  Lizhu Zhou
Affiliation:1. Department of Computer Science, Tsinghua University, Beijing 100084, China;2. School of Computing, National University of Singapore, 117543 Singapore, Singapore
Abstract:Conventional keyword search engines are restricted to a given data model and cannot easily adapt to unstructured, semi-structured or structured data. In this paper, we propose an efficient and adaptive keyword search method, called EASE, for indexing and querying large collections of heterogeneous data. To achieve high efficiency in processing keyword queries, we first model unstructured, semi-structured and structured data as graphs, and then summarize the graphs and construct graph indices instead of using traditional inverted indices. We propose an extended inverted index to facilitate keyword-based search, and present a novel ranking mechanism for enhancing search effectiveness. We have conducted an extensive experimental study using real datasets, and the results show that EASE achieves both high search efficiency and high accuracy, and outperforms the existing approaches significantly.
Keywords:Keyword search  Inverted index  Extended inverted index  Graph index  Ranking  Unstructured data  Semi-structured data  Structured data
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号