Trajectory Big Data Processing Based on Frequent Activity

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

With the rapid development and wide use of Global Positioning System in technology tools, such as smart phones and touch pads, many people share their personal experience through their trajectories while visiting places of interest. Therefore, trajectory query processing has emerged in recent years to help users find their best trajectories. However, with the huge amount of trajectory points and text descriptions, such as the activities practiced by users at these points, organizing these data in the index becomes tedious. Therefore, the parallel method becomes indispensable. In this paper, we have investigated the problem of distributed trajectory query processing based on the distance and frequent activities. The query is specified by start and final points in the trajectory, the distance threshold, and a set of frequent activities involved in the point of interest of the trajectory. As a result, the query returns the shortest trajectory including the most frequent activities with high support and high confidence. To simplify the query processing, we have implemented the Distributed Mining Trajectory R-Tree index (DMTR-Tree). For this method, we initially managed the large trajectory dataset in distributed R-Tree indexes. Then, for each index, we applied the frequent itemset Apriori algorithm for each point to select the frequent activity set. For the faster computation of the above algorithms, we utilized the cluster computing framework of Apache Spark with MapReduce as the programing model. The experimental results show that the DMTR-Tree index and the query-processing algorithm are efficient and can achieve the scalability.

Author and article information

Journal

Journal ID (publisher-id): TST

Title: Tsinghua Science and Technology

Publisher: Tsinghua University Press (Xueyan Building, Tsinghua University, Beijing 100084, China )

ISSN (Print): 1007-0214

Publication date (Print): 05 June 2019

Volume: 24

Issue: 3

Pages: 317-332

Affiliations

[1]∙ Amina Belhassena and Hongzhi Wang are with School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China. E-mail: wangzh@ 123456hit.edu.cn .

Author notes

* To whom correspondence should be addressed. E-mail: amina_ belhasna@ 123456hotmail.fr ;

Hongzhi Wang is a professor and doctoral supervisor at Harbin Institute of Technology. He received the PhD degree in computer science from Harbin Institute of Technology in 2018. He is a recipient of the outstanding dissertation award of CCF, Microsoft Fellow, and IBM PhD Fellowship. His research area is data management, including data quality, XML data management, and graph management. He has published more than 100 papers in refereed journals and conferences.

Amina Belhassena received the PhD degree in computer science from Harbin Institute of Technology, China in 2018. She received the master degree of Technology in computer science from Abou bakr belkaid Tlemcen University, Algeria in 2012. Her research interest includes massive data computing, data mining, large-scale data management, and data indexing.

Article

Publisher ID: 1007-0214-24-3-317

DOI: 10.26599/TST.2018.9010087

SO-VID: 24d23902-bef4-40a8-8ed6-0f360d5a9b37

History

Date received : 22 November 2017

Date accepted : 22 February 2018

ScienceOpen disciplines: Software engineering,Data structures & Algorithms,Applied computer science,Computer science,Artificial intelligence,Hardware architecture

Keywords: frequent activity,trajectory,distributed R-tree,query

Data availability:

ScienceOpen disciplines: Software engineering, Data structures & Algorithms, Applied computer science, Computer science, Artificial intelligence, Hardware architecture

Trajectory Big Data Processing Based on Frequent Activity

Read this article at

Abstract

Author and article information

Journal

Affiliations

Author notes

Article

History

Comments

Comment on this article

Similar content 278

Cited by 2