Exploring nanopore direct sequencing performance of forensic STRs, SNPs, InDels, and DNA methylation markers in a single assay

Desiree D S H de Bruin,Martin A Haagmans,Kristiaan J van der Gaag,Jerry Hoogenboom,Natalie E C Weiler,Niccoló Tesi,Alex Salazar,Yaran Zhang,Henne Holstege,Marcel Reinders,Amade Aouatef M'charek,Titia Sijen,Peter Henneman
DOI: https://doi.org/10.1016/j.fsigen.2024.103154
Abstract:Introduction: The field of forensic DNA analysis has undergone rapid advancements in recent decades. The integration of massively parallel sequencing (MPS) has notably expanded the forensic toolkit, moving beyond identity matching to predicting phenotypic traits and biogeographical ancestry. This shift is of particular significance in cases where conventional DNA profiling fails to identify a single suspect. Supplementing forensic analyses with estimated biological age may be valuable but involves a complex and time-consuming DNA methylation analysis. This study explores and validates the performance of a comprehensive forensic third-generation sequencing assay utilizing Oxford Nanopore Technologies (ONT) in an adaptive and direct sequencing approach. We incorporated the most widely used forensic markers, i.e., STRs, SNPs, InDels, mitochondrial DNA (mtDNA), and two methylation-based clock classifiers, thereby combining forensic genetic and epigenetic analysis in one single workflow. Methods and results: In our investigation, DNA from six anonymous individuals was sequenced using the ONT standard adaptive direct sequencing approach, reaching a mean percentage of on-target reads ranging from 6.6 % to 7.7 % per sample. ONT data was compared to standard MPS data and Illumina EPIC DNA methylation profiles. Basecalling employed recommended ONT software packages. TREAT was used for ONT-based analysis of autosomal and Y-chromosome STRs, achieving 90-92 % correct calls depending on allelic read depth thresholds. InDel analyses for two lower-quality samples proved challenging due to inadequate read depth, while the remaining four samples significantly contributed to the observed percentage markers (60.9 %) and correct calls (97.8 %). SNP analysis achieved a 98 % call rate, with only two mismatches and two missed alleles. ONT-generated DNA methylation data demonstrated Pearson's correlation coefficients with EPIC data ranging from 0.67 to 0.97 for Horvath's clock. Additional age-associated markers exhibited Pearson's correlation coefficients with chronological age between 0.14 (ELOVL2) and 0.96 (FHL2) at read depths of <30 and <20, respectively. Despite excluding mtDNA from our targeted sequencing approach, adaptive proof-reading fragments covered the complete mtDNA with an average read depth of 21-72, showing 100 % concordance with reference data. Discussion: Our exploratory study using ONT adaptive sequencing for conventional forensic and age associated DNA methylation markers showed high sequencing accuracy for a significant number of markers, showcasing ONT as a promising (epi)genetic forensic method. Future studies must address three critical aspects: determining clear quantity and quality measures and detection thresholds for accuracy, optimizing input DNA quantity for forensic casework expectations, and addressing ethical considerations associated with phenotype and ancestry analysis to prevent ethnic biases.
What problem does this paper attempt to address?