Skip to main navigation Skip to search Skip to main content

Trajectory big data processing based on frequent activity

  • Amina Belhassena*
  • , Hongzhi Wang
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

With the rapid development and wide use of Global Positioning System in technology tools, such as smart phones and touch pads, many people share their personal experience through their trajectories while visiting places of interest. Therefore, trajectory query processing has emerged in recent years to help users find their best trajectories. However, with the huge amount of trajectory points and text descriptions, such as the activities practiced by users at these points, organizing these data in the index becomes tedious. Therefore, the parallel method becomes indispensable. In this paper, we have investigated the problem of distributed trajectory query processing based on the distance and frequent activities. The query is specified by start and final points in the trajectory, the distance threshold, and a set of frequent activities involved in the point of interest of the trajectory. As a result, the query returns the shortest trajectory including the most frequent activities with high support and high confidence. To simplify the query processing, we have implemented the Distributed Mining Trajectory R-Tree index (DMTR-Tree). For this method, we initially managed the large trajectory dataset in distributed R-Tree indexes. Then, for each index, we applied the frequent itemset Apriori algorithm for each point to select the frequent activity set. For the faster computation of the above algorithms, we utilized the cluster computing framework of Apache Spark with MapReduce as the programing model. The experimental results show that the DMTR-Tree index and the query-processing algorithm are efficient and can achieve the scalability.

Original languageEnglish
Article number8620950
Pages (from-to)317-332
Number of pages16
JournalTsinghua Science and Technology
Volume24
Issue number3
DOIs
StatePublished - Jun 2019
Externally publishedYes

Keywords

  • Distributed R-tree
  • Frequent activity
  • Query
  • Trajectory

Fingerprint

Dive into the research topics of 'Trajectory big data processing based on frequent activity'. Together they form a unique fingerprint.

Cite this