Abstract:With the increasing popularity of a large number of Internet-based services and a large number of services hosted on cloud platforms, a more powerful back-end storage system is needed to support these services. At present, it is very difficult or impossible to implement a distributed storage to meet all the above assumptions. Therefore, the focus of research is to limit different characteristics to design different distributed storage solutions to meet different usage scenarios. Economic big data should have the basic requirements of high storage efficiency and fast retrieval speed. The large number of small files and the diversity of file types make the storage and retrieval of economic big data face severe challenges. This paper is oriented to the application requirements of cross-modal analysis of economic big data. According to the source and characteristics of economic big data, the data types are analyzed and the database storage architecture and data storage structure of economic big data are designed. Taking into account the spatial, temporal, and semantic characteristics of economic big data, this paper proposes a unified coding method based on the spatiotemporal data multilevel division strategy combined with Geohash and Hilbert and spatiotemporal semantic constraints. A prototype system was constructed based on Mongo DB, and the performance of the multilevel partition algorithm proposed in this paper was verified by the prototype system based on the realization of data storage management functions. The Wiener distributed memory based on the principle of Wiener filter is used to store the workload of each workload distributed storage window in a distributed manner. For distributed storage workloads, this article adopts specific types of workloads. According to its periodicity, the workload is divided into distributed storage windows of specific duration. At the beginning of each distributed storage window, distributed storage is distributed to the next distributed storage window. Experiments and tests have verified the distributed storage strategy proposed in this article, which proves that the Wiener distributed storage solution can save platform resources and configuration costs while ensuring Service Level Agreement (SLA).

Search on Secondary Attributes in Geo-Distributed Systems

HGeoHashBase: an Optimized Storage Model of Spatial Objects for Location-Based Services

Analytic Queries over Geospatial Time-Series Data Using Distributed Hash Tables

HBaseSpatial: A Scalable Spatial Data Storage Based on HBase

Towards Parallel Spatial Query Processing for Big Spatial Data.

Real-Time Spatial Queries for Moving Objects Using Storm Topology.

Partitioning, Indexing and Querying Spatial Data on Cloud

Efficient Spatial Big Data Storage and Query in HBase.

Scalable Top-K Spatial Keyword Search

Bf-Matrix: A Secondary Index For The Cloud Storage

New Distributed Spatial Query Optimization Approach by Using Query Analyzer

Efficient Top K Temporal Spatial Keyword Search

Processing Spatial Keyword Query As a Top-K Aggregation Query

Distributed Storage Strategy and Visual Analysis for Economic Big Data

A Performance-Improved and Storage-Efficient Secondary Index for Big Data Processing.

Overview of spatial index in the cloud storage

Distribution-Based Approach for Efficient Storage and Indexing of Massive Infrared Hyperspectral Sounding Data

A geohash-based index for spatial data management in distributed memory

Learning to Distribute Vocabulary Indexing for Scalable Visual Search

Design and Optimization for Distributed Indexing Scheme in Switch-Centric Cloud Storage System

Design and Implementation of Underlying Storage for Graph Engine