Aligner-D: Leveraging In-DRAM Computing to Accelerate DNA Short Read Alignment
Fan Zhang,Shaahin Angizi,Jiao Sun,Wei Zhang,Deliang Fan
DOI: https://doi.org/10.1109/jetcas.2023.3241545
IF: 5.877
2023-01-01
IEEE Journal on Emerging and Selected Topics in Circuits and Systems
Abstract:DNA short read alignment task has become a major sequential bottleneck to humongous amounts of data generated by next-generation sequencing platforms. In this paper, an energy-efficient and high-throughput Processing-in-Memory (PIM) accelerator based on DRAM (named Aligner-D) is presented to execute DNA short-read alignment with the state-of-the-art BWT alignment algorithm. We first present the PIM design that utilizes DRAM’s internal high parallelism and throughput. It converts each DRAM array to a potent processing unit for alignment tasks. The proposed Aligner-D can efficiently execute the bulk bit-wise XNOR-based matching operation required by the alignment task with only 3-transistor/col overhead. We then introduce a highly parallel and customized read alignment algorithm based on BWT that supports both exact and inexact match tasks. Next, we present how to map the correlated data of the alignment task to utilize the parallelism from both new hardware and algorithm maximumly. The experimental results demonstrate that Aligner-D obtains $\sim 4\times $ , $\sim 2.45\times $ , $\sim 3.26\times $ , and $\sim 1.65\times $ improvement, respectively, compared with other in-memory computing platforms: Ambit (Seshadri et al., 2017), DRISA-1T1C (Li et al., 2017), DRISA-3T1C (Li et al., 2017), and ReDRAM (Angizi and Fan, 2019). As for DNA short read alignment, Aligner-D boosts the alignment throughput per Watt by $\sim 20104\times $ , $\sim 3522\times $ , $\sim 927\times $ , $\sim 88\times $ , $\sim 5.28\times $ , and $\sim 2.34\times $ , over ReCAM, CPU, GPU, FPGA, Ambit, and DRISA, respectively.