首页 | 本学科首页   官方微博 | 高级检索  
     

基于搜索引擎的模糊字频统计
引用本文:李世明,李铮,苑志伟,尤枫,赵恒永. 基于搜索引擎的模糊字频统计[J]. 计算机工程与设计, 2010, 31(2)
作者姓名:李世明  李铮  苑志伟  尤枫  赵恒永
作者单位:1. 中国石油化工股份有限公司催化荆分公司,北京,100011
2. 一零二四互动营销顾问(北京)有限公司,北京,100029
3. 北京化工大学信息科学与技术学院,北京,100029
摘    要:针对传统字频统计方法周期长、代价高的弱点,提出了一种利用互联网内容并借助搜索引擎检索进行汉字模糊字频统计的全新方法,有效利用了网络时代的相关技术和发展成果,在一定程度上缓解了字频统计需求频繁的和传统统计方法的低效且代价高昂之间的矛盾,同时对该方法进行了实例化的分析、验证和改进.

关 键 词:中文信息处理  模糊字频统计  搜索引擎  互联网  汉字字频

Fuzzy frequency statistics of Chinese characters based on search engine
LI Shi-ming,LI Zheng,YUAN Zhi-wei,YOU Feng,ZHAO Heng-yong. Fuzzy frequency statistics of Chinese characters based on search engine[J]. Computer Engineering and Design, 2010, 31(2)
Authors:LI Shi-ming  LI Zheng  YUAN Zhi-wei  YOU Feng  ZHAO Heng-yong
Affiliation:LI Shi-ming1,LI Zheng2,YUAN Zhi-wei1,YOU Feng3,ZHAO Heng-yong3(1.China Petroleum , Chemical Corporation Sinopec Catalyst Company,Beijing 100011,China,2.1024 Interactive Marketing Consultant(Beijing) Co Ltd,Beijing 100029,3.School of Information Science , Technology,Beijing University of Chemical Technology,China)
Abstract:Considering the traditional frequency statistics methods have drawbacks that may take longer time and higher spending,a brand new way of fuzzy frequency statistics of Chinese characters is presented by utilizing content of Internet and relying on search engines.To a certain degree the new method relieves the inconsistency between excessive demand of frequency statistics and ineffectiveness,expensiveness of traditional statistical method.Meanwhile,the analysis,verification and improvement of this new method ...
Keywords:Chinese information process  fuzzy frequency statistics  search engine  Internet  Chinese characters frequency
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号