首页 | 本学科首页   官方微博 | 高级检索  
     

基于HTTP1.1的WEB信息还原关键技术研究
引用本文:陈晓苏,邓硕,熊兵.基于HTTP1.1的WEB信息还原关键技术研究[J].电脑开发与应用,2007,20(4):48-49,67.
作者姓名:陈晓苏  邓硕  熊兵
作者单位:华中科技大学,武汉,430074
摘    要:对因特网中一些重要数据信息进行还原和提取,是保证网络应用的健康发展和打击网络犯罪的一个重要手段。根据因特网中的实时网络数据大多基于HTTP协议进行构建与传输这一应用背景,针对HTTP1.1中增加的持续性连接(Persist Connection)与块编码(Chunked Encoding)技术,探讨了在新技术下的信息还原方法,实现了对某些应用场合所关心数据的还原、提取和存储。所给出的还原方法经实践验证,处理效率和效果达到了预期目标,对识别因特网中的不良网页信息具有较大的实用价值。

关 键 词:WEB信息还原  HTTP协议  持续性连接  块编码
文章编号:1003-5850(2007)04-0048-03
收稿时间:2006-10-24
修稿时间:2006-10-242007-02-24

Research on WEB Information Extraction Technology based on HTTP 1.1 Protocol
Chen Xiaosu.Research on WEB Information Extraction Technology based on HTTP 1.1 Protocol[J].Computer Development & Applications,2007,20(4):48-49,67.
Authors:Chen Xiaosu
Abstract:To guarantee the normally development of network application and attack the crime of using network, one of the most important aspects is monitoring and restoring the HTTP information. As most network applications’ data transmission is now based on HTTP 1.1, this paper studies the extraction of WEB information that used Persist Connection and Chunked Encoding technologies which are added in HTTP 1.1, implementing the reduction, extraction and store of data that are concerned in some application fields. The result of the experiment shows that: both efficiency and effect of the process are achieved and has a great meaning in distinguish the information from the Internet.
Keywords:WEB information extraction  HTTP protocol  persist connection  chunked encoding
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号