Abstract:In most National DNA databases (NDNADB), only single source DNA profiles, and sometimes two-person DNA mixtures, can be searched provided a minimum number of loci (or alleles) is available. DNA profiles that do not meet these criteria (about 14% of the traces analyzed in Western Switzerland) can be compared locally with candidates upon request from police services, used for one-off search, or remain unused. With the advent of probabilistic genotyping (PG), such complex DNA profiles can be compared to those stored in NDNADB based on likelihood ratios (LRs). In this pilot study, traces of known contributors and casework DNA profiles were used to evaluate the performance of the DBLRTM "Search database" tool in conjunction with the Swiss NDNADB. First, 40 DNA mixtures (2 to 5 contributors) from 15 volunteers were prepared in the wet laboratory. They were deconvoluted with STRmixTM and compared to a database containing the DNA profiles of these 15 volunteers, along with 174,493 person DNA profiles from the Swiss NDNADB (ground-truth experiments). Using LR thresholds of 10 3 and 10 6 , sensitivity and specificity were respectively 90.0%/57.1% and 99.9%/100.0%. For the lower LR threshold, this resulted in 52 adventitious associations out of more than 24 million pairwise comparisons. Second, 160 DNA mixture profiles from casework (2 to 4 contributors) that had previously been locally compared were searched with DBLRTM using the same conditions as for phase 1. With the 10 3 LR threshold, 380 associations were retrieved: 194 of these corresponded to expected associations, as they were previously made through the local comparisons with known persons, and 186 were new. With the 10 6 LR threshold, 199 associations were recovered of which 180 were expected and 19 new. This demonstrates that even with complex DNA profiles (up to 4 contributors) all expected associations were retrieved with a limited number of candidates per trace. Database searches of complex DNA mixtures allow for the generation of leads early in an investigation for DNA profiles that might otherwise remain underutilized. Next steps for the possible integration of DBLRTM or similar software within an operational context will require discussions on legal, financial, and technical aspects among stakeholders.

Low LRs obtained from DNA mixtures: On calibration and discrimination performance of probabilistic genotyping software

A tale of two PG systems: A comparison of the two most widely used continuous probabilistic genotyping systems in the United States

Complex DNA mixture analysis in a forensic context: evaluating the probative value using a likelihood ratio model

On the limitations of probabilistic claims about the probative value of mixed DNA profile evidence

Statistical methods for discrimination of STR genotypes using high resolution melt curve data

A review of likelihood ratios in forensic science based on a critique of Stiffelman "No longer the Gold standard: Probabilistic genotyping is changing the nature of DNA evidence in criminal trials"

Extending the discussion on inconsistency in forensic decisions and results

An integrated approach to reduce the impact of minor allele frequency and linkage disequilibrium on variable importance measures for genome-wide data

Comments arising from WJ Thompson "Uncertainty in probabilistic genotyping of low template DNA A case study comparing STRmix and TrueAllele"

A collaborative study on the precision of the Markov chain Monte Carlo algorithms used for DNA profile interpretation

Searching national DNA databases with complex DNA profiles: an empirical study using probabilistic genotyping

A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples

An exploratory view into allelic drop‐out of sequenced autosomal STRs

Flexible Mixture Model Approaches That Accommodate Footprint Size Variability for Robust Detection of Balancing Selection

Encoding of low-quality DNA profiles as genotype probability matrices for improved profile comparisons, relatedness evaluation and database searches

On forensic likelihood ratios from low-coverage sequencing

Evidentiary evaluation of single cells renders highly informative forensic comparisons across multifarious admixtures

Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets

MLR-tagging: informative SNP selection for unphased genotypes based on multiple linear regression.

Using simulated microhaplotype genotyping data to evaluate the value of machine learning algorithms for inferring DNA mixture contributor numbers

American forensic DNA practitioners' opinion on activity level evaluative reporting