首页 | 本学科首页   官方微博 | 高级检索  
     

面向海量低质手机轨迹数据的重要位置发现
引用本文:章志刚,金澈清,王晓玲,周傲英. 面向海量低质手机轨迹数据的重要位置发现[J]. 软件学报, 2016, 27(7): 1700-1714
作者姓名:章志刚  金澈清  王晓玲  周傲英
作者单位:华东师范大学 计算机科学与软件工程学院 数据科学与工程研究院, 上海 200062,华东师范大学 计算机科学与软件工程学院 数据科学与工程研究院, 上海 200062,华东师范大学 计算机科学与软件工程学院 数据科学与工程研究院, 上海 200062,华东师范大学 计算机科学与软件工程学院 数据科学与工程研究院, 上海 200062
基金项目:国家重点基础研究发展规划(973)(2012CB316203);国家自然科学基金(61370101);上海市教委科研创新重点项目(14ZZ045).
摘    要:重要位置是指人们在日常生活中的主要活动地点,比如居住地和工作地.智能手机的不断发展与普及为人们的日常生活带来了极大的便利.除了通话、上网等传统应用之外,手机连接基站自动生成的日志记录也是用于用户行为模式挖掘的重要数据来源,例如重要位置发现.然而,相关工作面临着诸多挑战,包括轨迹数据规模庞大、位置精度低以及手机用户的多样性.为此,本文提出了一个通用解决框架以提高轨迹数据可用性.该框架包含一个基于状态的过滤模块,提高了数据的可用性,以及一个重要位置挖掘模块.基于此框架设计了两种分布式挖掘算法:GPMA(Grid-based Parallel Mining Algorithm)和SPMA(Station-based Parallel Mining Algorithm).进一步地,为提高挖掘结果的准确性和精确度,从三个方面进行优化:1)使用多元数据的融合技术,提高结果的准确性;2)提出了无工作地人群的发现算法;3)提出了夜间工作人群的发现算法.理论分析和实验结果表明本文算法具有较高的执行效率、可扩展性,并具有更高的精度.

关 键 词:低质  轨迹挖掘  重要位置  数据修正
收稿时间:2015-09-25
修稿时间:2016-01-12

Discovering Important Locations From Massive and Low-Quality Cell Phone Trajectory Data
ZHANG Zhi-Gang,JIN Che-Qing,WANG Xiao-Ling and ZHOU Ao-Ying. Discovering Important Locations From Massive and Low-Quality Cell Phone Trajectory Data[J]. Journal of Software, 2016, 27(7): 1700-1714
Authors:ZHANG Zhi-Gang  JIN Che-Qing  WANG Xiao-Ling  ZHOU Ao-Ying
Affiliation:Institute for Data Science and Engineering, School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China,Institute for Data Science and Engineering, School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China,Institute for Data Science and Engineering, School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China and Institute for Data Science and Engineering, School of Computer Science and Software Engineering, East China Normal University, Shanghai 200062, China
Abstract:Important locations mainly refer to the places where people spend much time in the daily life, including their home and working places. The development and popularization of the smart cell phones bring great convenience to our daily life. Besides making calls and surfing the Internet, the logs generated when visiting the base stations also contribute to users' pattern mining, such as important location discovery. However, it's challenging to deal with such kind of trajectory data, due to huge volume, data inaccuracy and diversity of cell phone users. Consequently, a general framework is proposed to improve the usability of trajectory data in this paper, including a filter to improve data usability and a model to get the mining results. Note that two concrete strategies,namely GPMA (Grid-based Parallel Mining Algorithm) and SPMA (Station-based Parallel Mining Algorithm), can be embedded into this framework separately Moreover, some optimization techniques are developed for higher performance, including (i) data fusion techniques, (ii) an algorithm to find users who have no work places, and (iii) an algorithm to find people who work on night and fix their important locations. Theoretical analysis and extensive experimental results on real datasets show that our proposed algorithms are efficient, scalable, and effective.
Keywords:low quality   trajectorymining   important locations, data repairing
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号