Accelerating Genome Analysis via Algorithm-Architecture Co-Design

Onur Mutlu,Can Firtina
2023-05-31
Abstract:High-throughput sequencing (HTS) technologies have revolutionized the field of genomics, enabling rapid and cost-effective genome analysis for various applications. However, the increasing volume of genomic data generated by HTS technologies presents significant challenges for computational techniques to effectively analyze genomes. To address these challenges, several algorithm-architecture co-design works have been proposed, targeting different steps of the genome analysis pipeline. These works explore emerging technologies to provide fast, accurate, and low-power genome analysis. This paper provides a brief review of the recent advancements in accelerating genome analysis, covering the opportunities and challenges associated with the acceleration of the key steps of the genome analysis pipeline. Our analysis highlights the importance of integrating multiple steps of genome analysis using suitable architectures to unlock significant performance improvements and reduce data movement and energy consumption. We conclude by emphasizing the need for novel strategies and techniques to address the growing demands of genomic data generation and analysis.
Hardware Architecture,Genomics
What problem does this paper attempt to address?
The paper primarily explores the issue of accelerating genomic analysis through algorithm-architecture co-design. With the advancement of High-Throughput Sequencing (HTS) technology, genomic data analysis is facing significant challenges due to the surge in data volume, especially in terms of processing speed, accuracy, and energy consumption. To address these challenges, researchers have proposed various methods of co-optimization of algorithms and hardware architectures, aimed at improving the performance and energy efficiency of key steps in the genomic analysis workflow. Specifically, the paper covers several core components of the genomic analysis pipeline, including: 1. **Basecalling**: Converting raw sequencing data into a sequence of genetic characters. For the raw electrical signals produced by nanopore sequencing, researchers use methods such as deep neural networks for basecalling, while exploring techniques like GPU acceleration, reducing unnecessary computations, and Processing-in-Memory (PIM) to enhance efficiency. 2. **Real-Time Genome Analysis**: Analyzing read data synchronously during the sequencing process, particularly using adaptive sampling strategies in nanopore sequencing technology, which can significantly reduce overall analysis time and cost. The study proposes a series of algorithms and co-design solutions for software and hardware to meet the throughput, noise tolerance, and latency requirements of real-time analysis. 3. **Read Mapping**: Identifying similarities and differences between genomic sequences, such as aligning read sequences with a reference genome. Researchers have optimized several steps in this process, such as sketching, indexing, pre-alignment filtering, and sequence alignment, by reducing data movement overhead and unnecessary computations through algorithm-architecture co-design, thereby enhancing overall efficiency. 4. **Variant Calling**: Identifying genetic variations between an individual's genome and the reference genome. Researchers accelerate the speed and accuracy of variant detection by optimizing statistical techniques and machine learning methods, as well as using hardware accelerators like FPGA or ASIC. The paper emphasizes the importance of co-optimization at the algorithm and architecture levels, noting that this approach can significantly reduce data movement, eliminate unnecessary computations, and avoid processing of irrelevant data in downstream analysis, thus greatly enhancing the performance and energy efficiency of the entire genomic analysis pipeline. Additionally, the paper discusses the challenges faced by real-time genomic analysis and how to achieve low-power, high-performance, and low-cost portable sequencing technology through the collaborative development of efficient algorithms and dedicated hardware.