首页 | 本学科首页   官方微博 | 高级检索  
     

基于局部词频指纹的论文抄袭检测算法
引用本文:秦玉平,冷强奎,王秀坤,王春立.基于局部词频指纹的论文抄袭检测算法[J].计算机工程,2011,37(6):193-194.
作者姓名:秦玉平  冷强奎  王秀坤  王春立
作者单位:1. 渤海大学信息科学与工程学院,辽宁,锦州,121000
2. 大连理工大学电子与信息工程学院,辽宁,大连,116024
3. 大连海事大学信息科学技术学院,辽宁,大连,116026
基金项目:国家自然科学基金资助项目,国家"973"计划基金资助项目
摘    要:提出一种基于局部词频指纹的论文抄袭检测算法。将句子看成文档的基本构成元素,对其进行有效关键词提取排序重构,根据编码和词频联合方式获取句子指纹,以此计算文本间相似度。在新闻网页精简集SOGOU-T上的实验结果表明,该算法在一定程度上克服了现有论文抄袭检测算法检测精度低的缺点,具有较快的检测速度。

关 键 词:抄袭检测  数字指纹  局部词频  相似度

Plagiarism-detection Algorithm for Scientific Papers Based on Local Word-frequency Fingerprint
QIN Yu-ping,LENG Qiang-kui,WANG Xiu-kun,WANG Chun-li.Plagiarism-detection Algorithm for Scientific Papers Based on Local Word-frequency Fingerprint[J].Computer Engineering,2011,37(6):193-194.
Authors:QIN Yu-ping  LENG Qiang-kui  WANG Xiu-kun  WANG Chun-li
Affiliation:1.College of Information Science and Engineering,Bohai University,Jinzhou 121000,China;2.School of Electronic and Information Engineering,Dalian University of Technology,Dalian 116024,China;3.College of Information Science and Technology,Dalian Maritime University,Dalian 116026,China)
Abstract:An algorithm for plagiarism-detection of scientific papers based on local word-frequency fingerprint is presented.Sentence is regarded as the basic component elements of a document,and extracting efficient keywors,sorting and reconstructing them.According to the code and word-frequency,the fingerprints are get to compute text similarity degree.The identification experiments on SOGOU-T database are done with the algorithm.Experimental results show that it partly overcomes the shortage of existing plagiarism-detection of scientific papers,and it has better performance on identification precision and identification speed.
Keywords:plagiarism-detection  digital fingerprint  local word-frequency  similarity
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号