首页 | 本学科首页   官方微博 | 高级检索  
     


S3Mining: A model-driven engineering approach for supporting novice data miners in selecting suitable classifiers
Affiliation:1. Wake Research Group, Universidad Tecnológica de Chile INACAP, Chile;2. Dpto. de Ingeniería Informática y Electrónica, Universidad de Cantabria, Santander, Spain;3. Wake Research Group, Dpto. Ciencias del Mar y Biología Aplicada, Universidad de Alicante, Spain;4. Wake Research Group, Dpto. Lenguajes y Sistemas Informáticos, Instituto Universitario de Investigación Informática, Universidad de Alicante, Spain;1. Universidad de Murcia, Facultad de Informatica, Campus de Espinardo, 30100 Murcia, Spain;2. iMinds-DistriNet-KU Leuven, Belgium;1. ITK, 5 rue de la cavalerie, Montpellier F-34000, France;2. ISIMA/LIMOS, UMR 6158 CNRS, Blaise Pascal University, BP 10125, Aubiere F-63177, France;1. Qingdao University, Qingdao 266001, China;2. Department of Transportation Information and Control Engineering, Tongji University, Shanghai 201800, China
Abstract:Data mining has proven to be very useful in order to extract information from data in many different contexts. However, due to the complexity of data mining techniques, it is required the know-how of an expert in this field to select and use them. Actually, adequately applying data mining is out of the reach of novice users which have expertise in their area of work, but lack skills to employ these techniques. In this paper, we use both model-driven engineering and scientific workflow standards and tools in order to develop named S3Mining framework, which supports novice users in the process of selecting the data mining classification algorithm that better fits with their data and goal. To this aim, this selection process uses the past experiences of expert data miners with the application of classification techniques over their own datasets. The contributions of our S3Mining framework are as follows: (i) an approach to create a knowledge base which stores the past experiences of experts users, (ii) a process that provides the expert users with utilities for the construction of classifiers’ recommenders based on the existing knowledge base, (iii) a system that allows novice data miners to use these recommenders for discovering the classifiers that better fit for solving their problem at hand, and (iv) a public implementation of the framework’s workflows. Finally, an experimental evaluation has been conducted to shown the feasibility of our framework.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号