首页 | 本学科首页   官方微博 | 高级检索  
     

基于AST的代码抄袭检测方法研究
引用本文:刘呈龙,贾胜颖,张丽萍,刘东升. 基于AST的代码抄袭检测方法研究[J]. 计算机工程与设计, 2012, 33(4): 1660-1664
作者姓名:刘呈龙  贾胜颖  张丽萍  刘东升
作者单位:内蒙古师范大学计算机与信息工程学院,内蒙古呼和浩特,010022
基金项目:国家自然科学基金项目(60940027);内蒙古自然科学基金项目(2010MS0906)
摘    要:为了检测程序设计类课程中出现的作业抄袭行为,提出了基于抽象语法树的抄袭检测方法.运用语法分析工具对代码进行语法分析生成抽象语法树(AST),通过计算生物学中序列匹配的算法进行程序相似度的计算.提取程序相似部分的AST特征,生成空间向量,聚类分析找出“抄袭团伙”.实验结果表明,该方法对抄袭行为具有较好的检测效果,并能比较准确地找到“抄袭团伙”.

关 键 词:抄袭检测  抽象语法树  序列匹配  空间向量模型  聚类

AST-based plagiarism detection method
LIU Cheng-long , JIA Sheng-ying , ZHANG Li-ping , LIU Dong-sheng. AST-based plagiarism detection method[J]. Computer Engineering and Design, 2012, 33(4): 1660-1664
Authors:LIU Cheng-long    JIA Sheng-ying    ZHANG Li-ping    LIU Dong-sheng
Affiliation:(College of Computer and Information Engineering,Inner Mongolia Normal University,Hohhot 010022,China)
Abstract:To detect plagiarism on programming course,a AST-based plagiarism detection method is proposed.Firstly,the code is parsed by syntax analysis tool to generate the corresponding abstract syntax tree(AST).Biology sequence matching algorithms are used to calculate the similarity of the program.The similar part of the code is found,and then the AST feature is extracted in this part.A vector space model is generated,and then "copy cluster" is found by clustered the feature.Experiments show that this method has a good effect on the detection of plagiarism and can find the "copy cluster" accurate.
Keywords:plagiarism detection  AST  sequence alignment  VSM  cluster
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号