首页 | 本学科首页   官方微博 | 高级检索  
     

基于区间模糊匹配函数的数据清洗算法研究及其在问卷调查中的应用
引用本文:米允龙,李金海,米春桥,,刘文奇,刘 佳,王 添.基于区间模糊匹配函数的数据清洗算法研究及其在问卷调查中的应用[J].南京师范大学学报,2017,0(3).
作者姓名:米允龙  李金海  米春桥    刘文奇  刘 佳  王 添
作者单位:(1.怀化学院计算机科学与工程学院,湖南 怀化 418000)(2.昆明理工大学理学院,云南 昆明 650500)(3.武陵山片区生态农业智能控制技术湖南省重点实验室,湖南 怀化 418000)
摘    要:数据清洗是保证数据质量的重要步骤. 由于人类的活动通常带有一定的主观性与情绪性,因此现实中部分数据往往存在不合理性甚至错误. 而此类不合理数据常具有不确定性、模糊性与隐藏性,这给数据清洗带来了困难. 传统的数据清洗方法对此类数据难以充分发挥作用. 结合区间值模糊集理论与匹配函数提出一种区间模糊匹配函数方法,构建区间模糊匹配算法来清洗数据、提高数据质量,并将其应用在问卷调查数据中. 实验结果表明本算法具有较高的准确度及运行效率,适应处理数据中的不合理数据.

关 键 词:数据清洗  匹配函数  区间模糊集  区间模糊匹配函数  问卷调查数据

Reasearch into Data Cleaning Algorithm Based on Interval FuzzyMatching Functions and Its Application to Questionnaire Data
Mi Yunlong,Li Jinhai,Mi Chunqiao,' target="_blank" rel="external">,Liu Wenqi,Liu Jia,Wang Tian.Reasearch into Data Cleaning Algorithm Based on Interval FuzzyMatching Functions and Its Application to Questionnaire Data[J].Journal of Nanjing Nor Univ: Eng and Technol,2017,0(3).
Authors:Mi Yunlong  Li Jinhai  Mi Chunqiao  " target="_blank">' target="_blank" rel="external">  Liu Wenqi  Liu Jia  Wang Tian
Affiliation:(1.School of Computer Science and Engineering,Huaihua University,Huaihua 418000,China)(2.Faculty of Science,Kunming University of Science and Technology,Kunming 650500,China)(3.Hunan Provincial Key Laboratory of Ecological Agriculture Intelligent Control Technology,Huaihua 418000,China)
Abstract:Data cleaning is a very important step to ensure data quality. The real-world data often has some unreasonable data even error because of human activites usually with subjectivity and emotionality,such as the questionare data. However,there are some difficulties to process data cleaning due to these unreasonable data often being uncertainty,ambiguity and hidding. For this type of data,the traditional data cleaning methods have difficulty in handling the unreasonable data. Therefore,by combining the basic theories of interval-valued fuzzy set and mathcing function,we propose an interval fuzzy matching function method. Based on this method we construct a new algorithm to clean data and improve data quality,and then apply it to questionaire data. Experiments show that our algorithm have a good precision and running efficiency,and that it is adaptable to process the unreasonable data.
Keywords:data cleaning  matching function  interval-valued fuzzy set  interval-valued fuzzy matching function  questionnaire data
本文献已被 CNKI 等数据库收录!
点击此处可从《南京师范大学学报》浏览原始摘要信息
点击此处可从《南京师范大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号