首页 | 本学科首页   官方微博 | 高级检索  
     

基于信令数据的轨迹驻留点识别算法研究
引用本文:李万林,王超,许国良,雒江涛,张轩.基于信令数据的轨迹驻留点识别算法研究[J].电子与信息学报,2020,42(12):3013-3020.
作者姓名:李万林  王超  许国良  雒江涛  张轩
作者单位:1.重庆邮电大学通信与信息工程学院 重庆 4000652.重庆邮电大学电子信息与网络工程研究院 重庆 400065
基金项目:重庆市自然科学基金 (cstc2018jcyjAX0587),新型感知技术、信息融合处理及其应用(A2017-10)
摘    要:针对密度聚类算法只能识别密度相近的簇类且计算复杂度高等问题,该文提出一种基于信令数据中时空轨迹信息的密度峰值快速聚类(ST-CFSFDP)算法。首先对低采样密度的信令数据进行预处理,消除轨迹震荡现象;然后基于密度峰值快速聚类(CFSFDP)算法显式地增加时间维度限制,将局部密度由2维扩展到3维,并提出高密度时间间隔以表征簇中心在时间维度上的数据特征;接着设计筛选策略以选取聚类中心;最后识别用户出行轨迹中的驻留点,完成出行链的划分。实验结果表明,所提算法适用于采样密度低且定位精度差的信令数据,相比CFSFDP算法更适用于时空数据,相比基于密度的时空聚类算法(ST-DBSCAN)召回率提升14%,准确率提升8%,同时降低计算复杂度。

关 键 词:信令数据    时空聚类    密度峰值快速聚类算法    驻留点识别    出行链
收稿时间:2019-11-14

Research of Track Resident Point Identification Algorithm Based on Signaling Data
Wanlin LI,Chao WANG,Guoliang XU,Jiangtao LUO,Xuan ZHANG.Research of Track Resident Point Identification Algorithm Based on Signaling Data[J].Journal of Electronics & Information Technology,2020,42(12):3013-3020.
Authors:Wanlin LI  Chao WANG  Guoliang XU  Jiangtao LUO  Xuan ZHANG
Affiliation:1.Institute of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China2.Electronic Information and Networking Research Institute, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Abstract:For the problem that the density-based clustering algorithm can only identify clusters with similar density and high computational complexity, a Clustering by Fast Search and Find of Density Peaks based on Spatio-Temporal trajectory information in mobile phone signaling data, namely ST-CFSFDP, is proposed. Firstly, the low sampling density signaling data are pre-processed to eliminate the trajectory oscillation phenomenon in the data. Then, based on the Clustering by Fast Search and Find of Density Peaks(CFSFDP) algorithm, the time dimension limitation is explicitly increased, and the local density is extended from two-dimension to three-dimension. Moreover, in order to characterize the cluster center point in the time dimension, the concept of high-density time interval is defined. Secondly, the suitable cluster center screening strategy is developed to select automatically the appropriate cluster center. Finally, the resident points are identified in the travel trajectory of individual users over a period of time and the division of the travel chains is completed. The experimental results show that the algorithm is suitable for signaling data with low sampling density and poor positioning accuracy. It is more suitable for spatio-temporal data than CFSFDP algorithm. Compared with Density-Based Spatial Clustering of Applications with Noise based on Spatio-Temporal data (ST-DBSCAN) algorithm, the recall rate is improved by 14%, the accuracy rate is increased by 8%, and the computational complexity is also reduced.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号