SharkDB: an in-memory column-oriented storage for trajectory analysis期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

SharkDB: an in-memory column-oriented storage for trajectory analysis

Authors:	Bolong?Zheng,Haozhou?Wang,Kai?Zheng author-information" > author-information__contact u-icon-before" > mailto:zhengkai@suda.edu.cn" title=" zhengkai@suda.edu.cn" itemprop=" email" data-track=" click" data-track-action=" Email author" data-track-label=" " >Email author,Han?Su,Kuien?Liu,Shuo?Shang

Affiliation:	1.The University of Queensland,Brisbane,Australia;2.Pivotal Incorporated,San Francisco,USA;3.School of Computer Science and Techonology,Soochow University,Suzhou,China;4.Big Data Research Center, University of Electronic Science and Technology of China,Chengdu,China;5.King Abdullah University of Science and Technology,Thuwal,Saudi Arabia

Abstract:	The last decade has witnessed the prevalence of sensor and GPS technologies that produce a high volume of trajectory data representing the motion history of moving objects. However some characteristics of trajectories such as variable lengths and asynchronous sampling rates make it difficult to fit into traditional database systems that are disk-based and tuple-oriented. Motivated by the success of column store and recent development of in-memory databases, we try to explore the potential opportunities of boosting the performance of trajectory data processing by designing a novel trajectory storage within main memory. In contrast to most existing trajectory indexing methods that keep consecutive samples of the same trajectory in the same disk page, we partition the database into frames in which the positions of all moving objects at the same time instant are stored together and aligned in main memory. We found this column-wise storage to be surprisingly well suited for in-memory computing since most frames can be stored in highly compressed form, which is pivotal for increasing the memory throughput and reducing CPU-cache miss. The independence between frames also makes them natural working units when parallelizing data processing on a multi-core environment. Lastly we run a variety of common trajectory queries on both real and synthetic datasets in order to demonstrate advantages and study the limitations of our proposed storage.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏