Abstract:With the increasing popularity of a large number of Internet-based services and a large number of services hosted on cloud platforms, a more powerful back-end storage system is needed to support these services. At present, it is very difficult or impossible to implement a distributed storage to meet all the above assumptions. Therefore, the focus of research is to limit different characteristics to design different distributed storage solutions to meet different usage scenarios. Economic big data should have the basic requirements of high storage efficiency and fast retrieval speed. The large number of small files and the diversity of file types make the storage and retrieval of economic big data face severe challenges. This paper is oriented to the application requirements of cross-modal analysis of economic big data. According to the source and characteristics of economic big data, the data types are analyzed and the database storage architecture and data storage structure of economic big data are designed. Taking into account the spatial, temporal, and semantic characteristics of economic big data, this paper proposes a unified coding method based on the spatiotemporal data multilevel division strategy combined with Geohash and Hilbert and spatiotemporal semantic constraints. A prototype system was constructed based on Mongo DB, and the performance of the multilevel partition algorithm proposed in this paper was verified by the prototype system based on the realization of data storage management functions. The Wiener distributed memory based on the principle of Wiener filter is used to store the workload of each workload distributed storage window in a distributed manner. For distributed storage workloads, this article adopts specific types of workloads. According to its periodicity, the workload is divided into distributed storage windows of specific duration. At the beginning of each distributed storage window, distributed storage is distributed to the next distributed storage window. Experiments and tests have verified the distributed storage strategy proposed in this article, which proves that the Wiener distributed storage solution can save platform resources and configuration costs while ensuring Service Level Agreement (SLA).

Query Optimization and Rebalancing Methods based on CMD.

Optimization Factor Analysis Of Large-Scale Join Queries On Different Platforms

Distributed High-Dimension Matrix Operation Optimization on Spark

Distributed Model Based on Data Partition and Load Balance Algorithm

Data Based Application Partitioning and Workload Balance in Distributed Environment

A Request Skew Aware Heterogeneous Distributed Storage System Based on Cassandra

An Optimized Learning-Based Directory Placement Policy with Two-Rounds Selection in Distributed File Systems

Effective Data Distribution And Reallocation Strategies For Fast Query Response In Distributed Query-Intensive Data Environments

New Distributed Spatial Query Optimization Approach by Using Query Analyzer

A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

Load Rebalancing in Large-Scale Distributed File System

An Efficient and Compact Indexing Scheme for Large-Scale Data Store.

DynaHash: Efficient Data Rebalancing in Apache AsterixDB (Extended Version)

Energy-Aware Disk Storage Management: Online Approach with Application in DBMS

An HBase-Based Optimization Model for Distributed Medical Data Storage and Retrieval

Optimize Multidimensional Arrays Queries with Heterogeneous Replica Method

Distributed Storage Strategy and Visual Analysis for Economic Big Data

Optimizing Data Partition for Scaling out Nosql Cluster

Cost-Based Optimization Of Logical Partitions For A Query Workload In A Hadoop Data Warehouse

MCS-B: an Energy Efficient Storage System for Astronomical Observation Data Based on Logical Block Replacement Strategy

Coexistence of Multiple Partition Plan Based Physical Database Design.