Skip to main navigation Skip to search Skip to main content

A performance study of big spatial data systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the accelerated growth in spatial data volume, being generated from a wide variety of sources, the need for efficient storage, retrieval, processing and analyzing of spatial data is ever more important. Hence, spatial data processing system has become an important field of research. In recent times a number of Big Spatial Data systems have been proposed by researchers around the world. These systems can be roughly categorized into Apache Hadoopbased and in-memory systems based on Apache Spark. The available features supported by these systems vary widely. However, there has not been any comprehensive evaluation study of these systems in terms of performance, scalability and functionality. To address this need, we propose a benchmark to evaluate Big Spatial Data systems. Although, Spark is a very popular framework, its performance is limited by the overhead associated with distributed resource management and coordination. The Big Spatial Data systems that are based on Spark, are also constrained by these. We introduce SpatialIgnite, a Big Spatial Data system that we have developed based on Apache Ignite. We investigate the present status of the Big Spatial Data systems by conducting a comprehensive feature analysis and performance evaluation of a few representative systems with our benchmark. Our study shows that SpatialIgnite performs better than Hadoop and Spark based systems that we have evaluated.

Original languageEnglish
Title of host publicationProceedings of the 7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018
PublisherAssociation for Computing Machinery, Inc
Pages1-9
Number of pages9
ISBN (Electronic)9781450360418
DOIs
StatePublished - 6 Nov 2018
Externally publishedYes
Event7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018 - Seattle, United States
Duration: 6 Nov 20186 Nov 2018

Publication series

NameProceedings of the 7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018

Conference

Conference7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018
Country/TerritoryUnited States
CitySeattle
Period6/11/186/11/18

Keywords

  • Benchmark
  • Big Spatial Data
  • Hadoop
  • Ignite
  • In-Memory
  • Performance Evaluation
  • Spark

Fingerprint

Dive into the research topics of 'A performance study of big spatial data systems'. Together they form a unique fingerprint.

Cite this