MeG<SUP>2</SUP>: In-Memory Acceleration for Genome Graphs Analysis

Yu Huang,Long Zheng,Haifeng Liu,Zhuoran Zhou,Dan Chen,Pengcheng Yao,Qinggang Wang,Xiaofei Liao,Hai Jin
DOI: https://doi.org/10.1109/DAC56929.2023.10247956
2023-01-01
Abstract:Genome graphs analysis has emerged as an effective means to enable mapping DNA fragments (known as reads) to the reference genome. It replaces the traditional linear reference with a graph-based representation to augment the genetic variations and diversity information, significantly improving the quality of genotyping. The in-depth characterization of genome graphs analysis uncovers that it is bottlenecked by the irregular seed index access and the intensive alignment operation, stressing both the memory system and computing resources. Based on these observations, we propose MeG(2), a lightweight, commodity DRAM-compliant, processing-in-memory architecture to accelerate genome graphs analysis. MeG(2) is specifically integrated with the capabilities of both near-memory processing and bitwise in-situ computation. Specifically, MeG(2) leverages the low access latency of near-memory processing with the indexcentric offload mechanism to alleviate the irregular memory access in the seeding procedure, and harnesses the row-parallel capacity of in-situ computation with the distance-aware technique to exploit the intensive computational parallelism in the alignment process. Results show that MeG(2) outperforms the CPU-, GPU-, and ASIC-based genome graphs analysis solutions by 502x (30.2x), 272x (15.1x), and 5.5x (8.3x) for short (long) reads, while reducing energy consumption by 1628x (85.6x), 1443x (77.1x), and 7.8x (11.7x), respectively. We also demonstrate that MeG(2) offers significant improvements over existing PIM-based genome sequence analysis accelerators.
What problem does this paper attempt to address?