首页 | 本学科首页   官方微博 | 高级检索  
     

基于LSI的代码-文档可追溯关联挖掘研究
引用本文:杨雪敏,张毅坤,崔颖安,张保卫,夏辉.基于LSI的代码-文档可追溯关联挖掘研究[J].计算机工程,2011,37(8):34-36.
作者姓名:杨雪敏  张毅坤  崔颖安  张保卫  夏辉
作者单位:西安理工大学计算机科学与工程学院,西安,710048
基金项目:陕西省自然科学基金资助项目,陕西省教育厅专项基金资助项目
摘    要:软件过程产品间可追溯关联挖掘对软件维护及需求跟踪等众多领域至关重要。基于此,提出一种基于潜在语义索引提取程序代码和中文文档关联信息的方法,该方法是对向量空间模型的改进,通过分析文本间隐含的语义结构来确定关联度,而不依赖于词项的匹配。实验结果表明,该方法不依赖于代码和文档预先定义的同义词库和知识库,并能一定程度上提高查全率和查准率。

关 键 词:软件维护  可追溯关联挖掘  隐含语义索引  信息检索  跨语言信息检索

Research on Code and Documentation Traceability Association Mining Based on LSI
YANG Xue-min,ZHANG Yi-kun,CUI Ying-an,ZHANG Bao-wei,XIA Hui.Research on Code and Documentation Traceability Association Mining Based on LSI[J].Computer Engineering,2011,37(8):34-36.
Authors:YANG Xue-min  ZHANG Yi-kun  CUI Ying-an  ZHANG Bao-wei  XIA Hui
Affiliation:(School of Computer Science and Engineering,Xi’an University of Technology,Xi’an 710048,China)
Abstract:Traceability link recovery among software process products is very important in many fields, such as software maintenance, as well as requirement trac. Based on Latent Semantic lndexing(LSI), the traceability recovery information can be extracted automatically from program source code and the related Chinese documentation. The obvious advantage is that the presented method does not rely on the pre-defined thesaurus and knowledge for the code and documentation, and to some extent, it improves the recall and precision.
Keywords:software maintenance  traceability association mining  Latent Semantic Indexing(LSl)  Information Retrieval(IR)  Cross-Language Information Retrieval(CLIR)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号