首页 | 本学科首页   官方微博 | 高级检索  
     

基于 Heritrix 视频资源抓取的研究与实现
引用本文:徐 枫,归伟夏. 基于 Heritrix 视频资源抓取的研究与实现[J]. 集成技术, 2014, 3(3): 85-91
作者姓名:徐 枫  归伟夏
作者单位:广西大学计算机与电子信息学院;广西银行学校信息与管理教学部;
摘    要:教学视频资源是教学资源库的重要组成部分,对视频资源的添加是系统平台的一项重要工作。目前很多教学资源库对视频资源的添加采用手工方式进行,效率不理想且工作量极大。通过引入网络爬虫,利用Heritrix的扩展功能,可以定制相应的模块,使其自动抓取网络上的课程视频资源。而通过优化其抓取算法,可以提高资源库中视频的抓取效率和准确率。

关 键 词:视频资源  Heritrix抓取  主题爬虫  垂直搜索

Research and Implementation of Video Resource Capture Based on Heritrix
XU Feng and GUI Weixia. Research and Implementation of Video Resource Capture Based on Heritrix[J]. , 2014, 3(3): 85-91
Authors:XU Feng and GUI Weixia
Abstract:The video teaching resource is an important part of the teaching resource library, and it is important to addvideo resources for the system platform. At present, the adding of video resources for many teaching resource librariesis done by hand, which is of low efficiency and produces heavy workload. By introducing the network crawler and usingthe extended function of Heritrix, the corresponding module was customized to make it automatically grasp course videoresources from the network. And it could improve the video grasping efficiency and accuracy of the resource library byoptimizing its grasping algorithm.
Keywords:video resources   Heritrix grasp   the theme crawler   vertical search
本文献已被 CNKI 等数据库收录!
点击此处可从《集成技术》浏览原始摘要信息
点击此处可从《集成技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号