首页 | 本学科首页   官方微博 | 高级检索  
     

6搜-高效的专用IPv6搜索引擎
引用本文:黄皓凌,张凡.6搜-高效的专用IPv6搜索引擎[J].电子设计工程,2011,19(23):34-37,40.
作者姓名:黄皓凌  张凡
作者单位:深圳大学信息中心,广东深圳,518060
摘    要:基于开源搜索引擎Nutch,通过修改、调整和创新研制了文中介绍的6搜——一个专门搜索支持IPv6协议网站的专用IPv6搜索引擎。6搜的特点和创新点有:采集IPv6网页的速度在每秒100页以上;采集了54 195个IPv6网站,存储有2 000万IPv6网页,并且网页在不断更新和增加;有中文分词功能和自主创新的搜索网站功能。通过运行,6搜为用户提供了优质IPv6搜索服务;通过对6搜采集数据的分析,得到世界IPv6网站的分布。展现了IPv6网络的发展。

关 键 词:IPv6  搜索  搜索引擎  搜索网站  网络爬虫  顶级域名  Nutch  Lucene

6 sou-A highly effective specialized IPv6 search engine
HUANG Hao-ling,ZHANG Fan.6 sou-A highly effective specialized IPv6 search engine[J].Electronic Design Engineering,2011,19(23):34-37,40.
Authors:HUANG Hao-ling  ZHANG Fan
Affiliation:(Information Technology Center,Shenzhen University,Shenzhen 518060,China)
Abstract:Based on open source search engine Nutch, through modification, tuning and innovation, 6sou, a search engine that only searches IPv6 protocol supporting web sites, is developed. It contains following features and innovations: 6sou crawls IPv6 web sites at more than 100 pages per second; 6sou has crawled 54195 web sites and has stored 20 million IPv6 web pages; the number of pages is increasing and the pages are being updated continuously; 6sou has Chinese word segmentation feature and independently innovated search web site feature. After going online, 6sou has provided users with high quality IPv6 search service. Through the analysis of data collected by 6sou, world IPv6 web site distribution is presented. It reflects the development of IPv6 network.
Keywords:IPv6  search  search engine  search web site  web crawler  top level domain name  Nutch  Lucene
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号