Fast all versus all genotype comparison using DNA/RNA sequencing data: method and workflow

Steven A. Eschrich,Xiaoqing Yu and Jamie K. Teer
DOI: https://doi.org/10.1186/s12859-023-05288-y
IF: 3.307
2023-04-25
BMC Bioinformatics
Abstract:Massively parallel sequencing includes many liquid handling steps which introduce the possibility of sample swaps, mixing, and duplication. The unique profile of inherited variants in human genomes allows for comparison of sample identity using sequence data. A comparison of all samples vs. each other (all vs. all) provides both identification of mismatched samples and the possibility of resolving swapped samples. However, all vs. all comparison complexity grows as the square of the number of samples, so efficiency becomes essential.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?