Abstract:The exponential growth of data exerts great pressure on hardware design for database systems. Memory-centric computing (MCC) architecture, which enable compute capabilities near or inside memory storage, demonstrate great potential in enhancing the efficiency of database operations with higher compute parallelism and reduced data movements. However, existing MCC architecture mainly focus on artificial intelligence (AI) computations and those designed for database applications can only run a limited number of standalone queries such as SORT or JOIN, lacking efficient support for increasingly diverse and complex database workloads. For example, realizing a commercial recommendation engine on database requires supporting workloads including but not limited to vector aggregation, convolution or $N$ -hop neighborhoods computing, etc. In this work, we develop a memristor-enabled memory-centric instruction-set architecture (MeMCISA) aiming to efficiently accelerate versatile workloads in modern database systems. MeMCISA features scalable multi-bank memristor-based storage organization with near-memory circuitries and caches in banks. An out-of-order (O0O) scheduling scheme is designed for MeMCISA based on a vector instruction set with four types of instructions (bit-level, element-level, vector-level, and control-level), combining memristor-enabled in-memory computing and near-memory computing to efficiently run workloads with varying computational kernels and data sizes. MeMCISA can support parallel instruction executions across different memristor banks as well as different hardware modules within a memristor bank. Furthermore, we develop data dependency handling mechanisms to support vector dependency scenarios in MeMCISA that do not exist in conventional scalar-based instruction sets. A prototype MeMCISA is implemented based on a 40nm CMOS technology with necessary peripheral hardware including instruction buffer and instruction scheduler. To accurately study MeMCISA performance in real-world database systems, a software-hardware co-designed framework integrating reconfigurable MeMCISA prototype is created that can support end-to-end simulations for database workloads starting from raw software codes. Based on this framework, we evaluate MeMCISA performance with standalone database queries as well as complex database workloads from representative benchmarks including UniBench, neural collaborative filtering (NCF), and ResNet-18. Simulation results demonstrate that MeMCISA achieves up to 41.84 × ~ 1767.70 × in speed compared to general-purpose processors (CPUs/GPUs).

Investigating Memory Failure Prediction Across CPU Architectures

Exploring Error Bits for Memory Failure Prediction: An In-Depth Correlative Study

Highly Efficient Memory Failure Prediction using Mcelog-based Data Mining and Machine Learning

On Workload-Aware DRAM Failure Prediction in Large-Scale Data Centers.

DRAM Failure Prediction in AIOps: Empirical Evaluation, Challenges and Opportunities

A Rising Tide Lifts All Boats: How Memory Error Prediction and Prevention Can Help with Virtualized System Longevity

EXPERT: EXPloiting DRAM ERror Types to Improve the Effective Forecasting Coverage in the Field.

System-level Hardware Failure Prediction Using Deep Learning

Precise Contention-Aware Performance Prediction on Virtualized Multicore System.

Towards CPU Performance Prediction: New Challenge Benchmark Dataset and Novel Approach

Characterizing Application Memory Error Vulnerability to Optimize Datacenter Cost Via Heterogeneous-Reliability Memory

Predicting DRAM-Caused Node Unavailability in Hyper-Scale Clouds

Enabling Effective Error Mitigation in Memory Chips That Use On-Die Error-Correcting Codes

Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM Errors in the Field

An Experimental Evaluation of Machine Learning Training on a Real Processing-in-Memory System

A Memory RAS System Design and Engineering Practice in High Temperature Ambient Data Center

MeMCISA: Memristor-Enabled Memory-Centric Instruction-Set Architecture for Database Workloads

Overcoming Memory Constraint for Improved Target Classification Performance on Embedded Deep Learning Systems

Software-Based Detecting and Recovering from ECC-Memory Faults

A Performance Evaluation of DRAM Access for In-Memory Databases