Abstract:Read mapping, which maps billions of reads to a reference DNA, poses a significant performance bottleneck in genomic analysis. Current accelerators for read mapping are primarily bounded by the intensive and random memory access to huge datasets. Near-data processing (NDP) infrastructures are promising to provide extremely high bandwidth. However, existing frameworks failed to reach this potential due to poor locality and high redundancy. Our idea is to introduce prediction under the insight that candidate mapping positions become predictable when the reference is organized in coarse-grain slices. We present GEM ( Ge nomic M emory), an ultra-efficient near-memory accelerator for read mapping. GEM adopts a novel data-centric framework, named dividing-and-predictive-scattering (DPS), which synthesizes information of seed existence to predict the target mapping locations to reduce memory access redundancy. During preparation, DPS divides the reference into coarse-grained slices and creates predictive filters to assess the likelihood of reads belonging to each slice. During mapping, DPS predicts and scatters reads to considerably fewer slices compared than without prediction. By employing small on-chip SRAM-based predictors with high accuracy, DPS minimizes unnecessary DRAM access and data movement from remote memory. In essence, DPS trades pre-seeding predictors for localized access patterns and low redundancy, hence achieving high throughput for data-intensive applications. We implement GEM by integrating coarse-grain reconfigurable architectures (CGRAs) in the logic layer of a 3D-stacked DRAM infrastructure, utilizing the massive banks as slices. GEM leverages CGRAs for their flexibility in supporting various algorithms tailored to different datasets. Bloom filters are leveraged for slice prediction, providing an error rate below 1%. Evaluation results demonstrate that GEM reduces memory requests by 95% and alignments by 87%, achieving a throughput improvement of 15.3× and 11.0× compared to compute-centric and broadcast-based baselines on the same NDP platform. Overall, GEM achieves a $3.5\times$ throughput improvement and $2.1\times$ energy efficiency compared to state-of-the-art ASIC accelerators.

Accelerating massive short reads mapping for next generation sequencing (abstract only).

Accelerating Irregular Computation in Massive Short Reads Mapping on FPGA Co-Processor

Accelerating Millions of Short Reads Mapping on a Heterogeneous Architecture with FPGA Accelerator

Big Data Genome Sequencing on Zynq Based Clusters (abstract Only)

Accelerating the Next Generation Long Read Mapping with the FPGA-Based System

A FPGA-Based High Performance Acceleration Platform for the Next Generation Long Read Mapping

Genome sequencing using mapreduce on FPGA with multiple hardware accelerators (abstract only).

Investigating Memory Optimization of Hash-Index for Next Generation Sequencing on Multi-Core Architecture

Gene Sequence Alignment on a Public Computing Platform

Heterogeneous Cloud Framework for Big Data Genome Sequencing.

GateSeeder: Near-memory CPU-FPGA Acceleration of Short and Long Read Mapping

GenSeq+: A Scalable High-Performance Accelerator for Genome Sequencing.

Acceleration of the long read mapping on a PC-FPGA architecture (abstract only).

FPGA Acceleration of Short Read Alignment

A 2.46M Reads/s Seed-Extension Accelerator for Next-Generation Sequencing Using a String-Independent PE Array

An FPGA Based Energy-Efficient Read Mapper With Parallel Filtering and In-Situ Verification

GenServ: Genome Sequencing Services on Scalable Energy Efficient Accelerators

FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications

Efficient end-to-end long-read sequence mapping using minimap2-fpga integrated with hardware accelerated chaining

Accelerating FM-index Search for Genomic Data Processing

GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering