首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于命名实体识别的需求跟踪方法
引用本文:王金水,薛醒思,唐郑熠.一种基于命名实体识别的需求跟踪方法[J].计算机应用研究,2016,33(1).
作者姓名:王金水  薛醒思  唐郑熠
作者单位:福建工程学院 信息科学与工程学院,福建工程学院 信息科学与工程学院,福建工程学院 信息科学与工程学院
基金项目:国家自然科学基金资助项目
摘    要:针对基于文本的需求跟踪方法严重依赖文本质量的问题,提出了一种利用命名实体识别技术标注制品文档关键词的需求跟踪方法。该方法通过代码实体上下文构建命名实体识别模型,解决了抽象语法树和正则表达式无法解析非源代码形式的软件制品的问题。利用命名实体识别模型标识出软件制品中的代码实体之后,方法将软件制品转换为文档集合并进行语义聚类,最后再通过映射算法创建制品间的需求跟踪关系。实验结果表明,与基于所有词项和基于高权重词项的需求跟踪方法相比,该方法能够有效提高需求跟踪结果的质量。

关 键 词:需求跟踪  命名实体识别  语义聚类  自然语言处理  权重计算
收稿时间:2015/1/13 0:00:00
修稿时间:2015/11/21 0:00:00

Recovering traceability links using named entity recognition
WANG Jinshui,XUE Xingsi and Tang Zhengyi.Recovering traceability links using named entity recognition[J].Application Research of Computers,2016,33(1).
Authors:WANG Jinshui  XUE Xingsi and Tang Zhengyi
Affiliation:College of Information Science and Engineering,Fujian University of Technology,College of Information Science and Engineering,Fujian University of Technology,College of Information Science and Engineering,Fujian University of Technology
Abstract:Aiming at the problem that requirement traceability approaches based on textual information were rely heavily on the quality of the text, this paper proposed a traceability approach utilized named entity recognition technology to identify key words in software artefacts. Firstly, the proposed method constructed a named entity recognition model through the context of code entity, which solved the issue that abstract syntax tree and the regular expression was not able to parse non-source form software artefacts. After that, the proposed method transformed software artefacts to document set, and then carried out a semantic clustering process to cluster documents. Finally, the proposed method created trace links between software artefacts using the mapping algorithm. The experimental results showed that comparing with those traceability approaches based on the all terms and high weight terms, our proposal was able to effectively improve the quality of requirement tracing results.
Keywords:requirement traceability  named entity recognition  semantic clustering  natural language process  term weighting
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号