Forensic efficiency evaluation of a mtDNA whole genome sequencing system constructed with long fragment amplification strategy on DNA nanoball sequencing platform
Man Chen,Chong Chen,Ning Li,Yuerong Su,Wei Cui,Yan Huang,Meiming Cai,Bofeng Zhu
DOI: https://doi.org/10.1016/j.fsigen.2024.103126
IF: 4.453
2024-08-26
Forensic Science International Genetics
Abstract:Mitochondrial DNA (mtDNA) is an important genetic marker for high copy number DNA, degraded biological sample identification, maternal pedigree tracing, and population genetic structure study owing to its characteristics of high copy number, anti-degradable ring structure, and maternal inheritance. Whole mtDNA genome sequencing is an optimal method for the analysis of mtDNA polymorphism and heterogeneity because it allows for the comprehensive use of maternal genetic information. However, because of lacking quantitative evaluations for sequencing data, the scientific interpretation standards for mtDNA sequencing results of the previously used sequencing systems are often different, and false positive or false negative results are prone to occur when faced with the interference of nuclear genomic DNA, or the heterogeneity of mtDNA sequence and structure. In this study, we evaluated a novel mtDNA whole genome sequencing system using long fragment amplification strategy on the DNA nanoball (DNB) sequencing platform. This system demonstrated high sequencing quality and specific mtDNA sequencing efficiencies on positive control DNA and FTA bloodstain samples, as the average Q20 and Q30 values of the corresponding samples were 97.17% and 91.93%; 97.37% and 92.48%, respectively. The mean mapping percentages for the reference sequences of whole genome DNA (wgDNA), mtDNA, and nuclear genomic DNA (ngDNA) in the corresponding samples were 99.98%, 99.97%, 0.03%, and 99.91%, 99.40%, 0.60%, respectively. The average error calling rates for the bases A, C, G, and T of the whole mtDNA genome were 0.2519%, 0.2550%, 0.2906%, and 0.2392%, respectively. The efficacy of heteroplasmy identification was assessed using a set of theoretical sites with predetermined rates. These sites were created by combining the samples with known mtDNA haplotypes in certain proportions. The absolute errors between observed and theoretical heteroplasmy values were 89.59%, 74.68%, 50.20%, 12.65%, 8.31%, and 4.85%, while the theoretical heteroplasmy values were 5%, 10%, 20%, 80%, 90%, and 95%, respectively. The absolute error exhibited relative stability when the mtDNA sequencing depth exceeded 500×. Furthermore, the system sequencing efficiency also was confirmed among different kinds of samples, and these samples included natural samples (e.g., peripheral blood samples preserved on FTA cards for 2 and 11 years, and on filter paper for 6 and 9 years), degraded samples, sensitivity samples, samples derived from various bodily fluids, and maternal pedigree samples. In summary, the whole mtDNA genome sequencing system used for forensic identification demonstrated high performance in analyzing mtDNA sequence information, and showed significant prospects for forensic application and maternal genetic research.
genetics & heredity,medicine, legal