一种基于DOM的Web信息提取方法 A DOM-based Web Information Extraction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于DOM的Web信息提取方法

引用本文：	邓超,熊选东.一种基于DOM的Web信息提取方法[J].微型电脑应用,2007,23(3):49-52.

作者姓名：	邓超熊选东

作者单位：	解放军信息工程大学电子技术学院研究所,郑州,450004

摘要：	文章提出一种基于DOM的Web信息提取方法,通过归纳学习获得被提取信息的定位路径,利用XPath和XSLT在数据定位和数据转换方面的特点编写提取模式,根据网页元素与DOM节点对应关系,判断所获得信息源是否适用于已有提取模式。
关键词：	Web信息提取基于DOM的网页结构判断
文章编号：	1007-757X（2007）03-0049-04
收稿时间：	2006-05-08
修稿时间：	2006年5月8日
A DOM-based Web Information Extraction

DENG Chao,XIONG Xuan-dong.A DOM-based Web Information Extraction[J].Microcomputer Applications,2007,23(3):49-52.

Authors:	DENG Chao XIONG Xuan-dong

Abstract:	This paper proposes a DOM-based Web Information Extraction solution,to get the location path of extracted infor- mation byinduction study,to edit the extraction pattern through characteristics of XPath and XSLT on data location and data transform,and according to the mapping relation between Web page's elements and DOM's nodes,to judge if the gained infor- mation sources are the same as that of the generated extraction pattern.

Keywords:	DOM XPath XSLT
本文献已被 CNKI 维普万方数据等数据库收录！