首页 | 本学科首页   官方微博 | 高级检索  
     

一种适用于多样性环境的业务流程挖掘方法
引用本文:杨丽琴,康国胜,郭立鹏,田朝阳,张亮,张笑楠,高翔.一种适用于多样性环境的业务流程挖掘方法[J].软件学报,2015,26(3):550-561.
作者姓名:杨丽琴  康国胜  郭立鹏  田朝阳  张亮  张笑楠  高翔
作者单位:复旦大学 计算机科学技术学院, 上海 201203;上海市数据科学重点实验室复旦大学, 上海 201203;上海中医药大学 图书信息中心, 上海 201203,复旦大学 计算机科学技术学院, 上海 201203;上海市数据科学重点实验室复旦大学, 上海 201203,复旦大学 计算机科学技术学院, 上海 201203;上海市数据科学重点实验室复旦大学, 上海 201203,复旦大学 计算机科学技术学院, 上海 201203;上海市数据科学重点实验室复旦大学, 上海 201203,复旦大学 计算机科学技术学院, 上海 201203;上海市数据科学重点实验室复旦大学, 上海 201203,中国移动有限公司 信息系统管理部, 北京 100084,中国移动有限公司 信息系统管理部, 北京 100084
基金项目:国家自然科学基金(60873115); 教育部-中国移动科研基金(MCM20123011); 上海市科技发展基金(13dz2260200, 13511504300); 上海中医药大学预算内项目(2013JW30)
摘    要:从运行日志挖掘业务流程模型的流程挖掘方法研究方兴未艾,然而,复杂多变的运行环境使流程日志也不可避免地呈现出多样性.传统的流程挖掘算法各有其适用对象,因此,如何挑选适合多样性流程日志的流程挖掘算法成为了一项挑战.提出一种适用于多样性环境的业务流程挖掘方法 So Fi(survival of fittest integrator).该方法基于领域知识对日志进行分类,使用多种现有的挖掘算法对每一类子日志产生一组流程模型作为遗传算法的初始种群,借助遗传算法的优化能力,从中整合得到高质量的业务流程模型.针对模拟日志和某通信公司真实日志的实验结果表明:相对于任何单一的挖掘算法,So Fi产生的流程模型具有更高的综合质量,即重现度、精确度、通用性和简单性.

关 键 词:流程挖掘  流程整合  遗传算法  日志分类  Pro  M
收稿时间:7/1/2014 12:00:00 AM
修稿时间:2014/11/21 0:00:00

Process Mining Approach for Diverse Application Environments
YANG Li-Qin,KANG Guo-Sheng,GUO Li-Peng,TIAN Zhao-Yang,ZHANG Liang,ZHANG Xiao-Nan and GAO Xiang.Process Mining Approach for Diverse Application Environments[J].Journal of Software,2015,26(3):550-561.
Authors:YANG Li-Qin  KANG Guo-Sheng  GUO Li-Peng  TIAN Zhao-Yang  ZHANG Liang  ZHANG Xiao-Nan and GAO Xiang
Affiliation:School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science Fudan University, Shanghai 201203, China;Library and Information Center, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China,School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science Fudan University, Shanghai 201203, China,School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science Fudan University, Shanghai 201203, China,School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science Fudan University, Shanghai 201203, China,School of Computer Science, Fudan University, Shanghai 201203, China;Shanghai Key Laboratory of Data Science Fudan University, Shanghai 201203, China,Department of Management Information System, China Mobile Communications Corporation, Beijing 100084, China and Department of Management Information System, China Mobile Communications Corporation, Beijing 100084, China
Abstract:Mining business process models from running logs is in its ascendant. Inevitably, the ever changing operational environment makes these log records diverse. Considering every mining algorithm has its pros and cons, this paper focuses on the challenge to apply a best mining algorithm against diverse logs. A novel approach, SoFi (survival of fittest integrator), is proposed to mine business process models effectively in such a diverse environment. SoFi tackles the diversity issue by utilizing domain knowledge to classify the cases in a log and applying various mining algorithms on these categories to obtain comprehensive process models as candidates for optimization. A genetic algorithm (GA) based optimizer takes these candidates as initial population for purpose of both genetic quality as well as genetic diversity. Under the principle of survival of fittest, the GA optimizer can aggregate best process fragments with context into the final process model for the entire log. Experiments on synthetic data and real cases from a telecommunication firm demonstrate the effectiveness of SoFi and comprehensive quality of mined process models in terms of replay fitness, accuracy, generalization, and simplicity.
Keywords:process mining  process consolidation  genetic algorithm  log classification  ProM
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号