Simba: spatial in-memory big data analysis.

Dong Xie,Feifei Li,Bin Yao,Gefei Li,Zhongpu Chen,Liang Zhou,Minyi Guo
DOI: https://doi.org/10.1145/2996913.2996935
2016-01-01
Abstract:We present the Simba (<u>S</u>patial <u>I</u>n-Memory <u>B</u>ig data <u>A</u>nalytics) system, which offers scalable and efficient in-memory spatial query processing and analytics for big spatial data. Simba natively extends the Spark SQL engine to support rich spatial queries and analytics through both SQL and DataFrame API. It enables the construction of indexes over RDDs inside the engine in order to work with big spatial data and complex spatial operations. Simba also comes with an effective query optimizer, which leverages its indexes and novel spatial-aware optimizations, to achieve both low latency and high throughput in big spatial data analysis. This demonstration proposal describes key ideas in the design of Simba, and presents a demonstration plan.
What problem does this paper attempt to address?