Abstract:Background Multi-sample comparison is commonly used in cancer genomics studies. By using next-generation sequencing (NGS), a mutation's status in a specific sample can be measured by the number of reads supporting mutant or wildtype alleles. When no mutant reads are detected, it could represent either a true negative mutation status or a false negative due to an insufficient number of reads, so-called "coverage". To minimize the chance of false-negative, we should consider the mutation status as "unknown" instead of "negative" when the coverage is inadequately low. There is no established method for determining the coverage threshold between negative and unknown statuses. A common solution is to apply a universal minimum coverage (UMC). However, this method relies on an arbitrarily chosen threshold, and it does not take into account the mutations' relative abundances, which can vary dramatically by the type of mutations. The result could be misclassification between negative and unknown statuses. Methods We propose an adaptive mutation-specific negative (MSN) method to improve the discrimination between negative and unknown mutation statuses. For a specific mutation, a non-positive sample is compared with every known positive sample to test the null hypothesis that they may contain the same frequency of mutant reads. The non-positive sample can only be claimed as “negative” when this null hypothesis is rejected with all known positive samples; otherwise, the status would be “unknown”. Results We first compared the performance of MSN and UMC methods in a simulated dataset containing varying tumor cell fractions. Only the MSN methods appropriately assigned negative statuses for samples with both high- and low-tumor cell fractions. When evaluated on a real dual-platform single-cell sequencing dataset, the MSN method not only provided more accurate assessments of negative statuses but also yielded three times more available data after excluding the “unknown” statuses, compared with the UMC method. Conclusions We developed a new adaptive method for distinguishing unknown from negative statuses in multi-sample comparison NGS data. The method can provide more accurate negative statuses than the conventional UMC method and generate a remarkably higher amount of available data by reducing unnecessary “unknown” calls.

Application of Next-Generation Sequencing in the Detection of Low-Abundance Mutations.

Next Generation Sequencing Technology and Its Application in Detecting Gene Mutations

High efficiency error suppression for accurate detection of low-frequency variants

Detection of ultra-rare mutations by next-generation sequencing

Recent Advances in Biosensors and Sequencing Technologies for the Detection of Mutations

LFMD: a new likelihood-based method to detect low-frequency mutations without molecular tags

Ultrasensitive and high-efficiency screen of de novo low-frequency mutations by o2n-seq

Application of Next Generation Sequencing in Laboratory Medicine

Detection of low-frequency mutations in clinical samples by increasing mutation abundance via the excision of wild-type sequences

A Novel Next-Generation Sequencing–Based Approach for Concurrent Detection of Mitochondrial DNA Copy Number and Mutation

An Adaptive Method of Defining Negative Mutation Status for Multi-Sample Comparison Using Next-Generation Sequencing

Application of Nanotechnology for Sensitive Detection of Low-Abundance Single-Nucleotide Variations in Genomic DNA: A Review

Single-molecule, Quantitative Detection of Low-Abundance Somatic Mutations by High-Throughput Sequencing

Enhanced Error Suppression for Accurate Detection of Low‐Frequency Variants

Targeted Next-Generation Sequencing As a Comprehensive Test for Mendelian Diseases: a Cohort Diagnostic Study.

Detecting somatic point mutations in cancer genome sequencing data: a comparison of mutation callers

[Application of Next Generation Sequencing for AML/MDS Diagnosis and Treatment].

Assessment of the Clinical Application of Detecting EGFR, KRAS, PIK3CA and BRAF Mutations in Patients with Non-Small Cell Lung Cancer Using Next-Generation Sequencing

Applying next-generation sequencing to unravel the mutational landscape in viral quasispecies

The status quo and future prospects of the next generation sequencing technologies in clinical diagnostics

17. Application of Next-Generation Sequencing in Diagnosis of Patients with MDS and AML