Copy Number Variation Detection Based on Constraint Least Squares

Xiaopu Wang,Xueqin Wang,Aijun Zhang,Canhong Wen
DOI: https://doi.org/10.4310/23-sii814
2024-01-01
Statistics and Its Interface
Abstract:Copy number variations (CNVs) are a form of structural variation of a DNA sequence, including amplification and deletion of a particular DNA segment on chromosomes. Due to the huge amount of data in every DNA sequence, there is a great need for a computationally fast algorithm that accurately identifies CNVs. In this paper, we formulate the detection of CNVs as a constraint least squares problem and show that circular binary segmentation is a greedy ap-proach to solving this problem. To solve this problem with high accuracy and efficiency, we first derived a necessary op-timality condition for its solution based on the alternating minimization technique and then developed a computation-ally efficient algorithm named AMIAS. The performance of our method was tested on both simulated data and two real -world applications using genomic data from diagnosed pri-mal glioblastoma and the HapMap project. Our proposed method has competitive performance in identifying CNVs with high-throughput genotypic data.
What problem does this paper attempt to address?