Web表格信息抽取的研究 Research on Web Table Extraction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Web表格信息抽取的研究

引用本文：	林科锵,左志宏,林琳.Web表格信息抽取的研究[J].通讯和计算机,2005,2(8):27-31.

作者姓名：	林科锵左志宏林琳

作者单位：	电子科技大学计算机科学与工程学院,成都610054

摘要：	Web表格信息抽取是信息抽取在Web表格上的一种应用，是当今的一个研究热点。本文首先分析了Web表格信息抽取的过程，包括表格识别、结构识别以及“属性-值”对的提取；然后对当前国内外在基于特定域和独立城两种表格信息抽取研究方法上的动态及成果追行了比较和分析。在此基础上，提出了表格抽取的关键技术——表格结构识别上的一些想法；最后展望了Web表格信息抽取技术的发展趋势。
关键词：	信息抽取 Web表格特定域独立域
Research on Web Table Extraction

Keqiang Lin, ZhihongZuo, Lin Lin.Research on Web Table Extraction[J].Journal of Communication and Computer,2005,2(8):27-31.

Authors:	Keqiang Lin ZhihongZuo Lin Lin

Abstract:	Web table extraction, which is a current research hotspot, is an application of information extraction on Web table. In this paper, we first analyze the flow of Web table extraction, including table detection, structure recognition and attribute-value pair extraction. Then we compare what others have done with both domain-specific and domain-independent methodologies ir this field. Based on the above survey and analysis, we put forward some ideas in the table structure recognition, which is one of the key steps in the flow of whole extraction. At last, we present the tendency of development of Web table extraction.

Keywords:	Information Extraction Web Table Domain-specific Domain-independent
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏