Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software

Daniel L. Cameron,Leon Di Stefano,Anthony T. Papenfuss
DOI: https://doi.org/10.1038/s41467-019-11146-4
IF: 16.6
2019-07-19
Nature Communications
Abstract:In recent years, many software packages for identifying structural variants (SVs) using whole-genome sequencing data have been released. When published, a new method is commonly compared with those already available, but this tends to be selective and incomplete. The lack of comprehensive benchmarking of methods presents challenges for users in selecting methods and for developers in understanding algorithm behaviours and limitations. Here we report the comprehensive evaluation of 10 SV callers, selected following a rigorous process and spanning the breadth of detection approaches, using high-quality reference cell lines, as well as simulations. Due to the nature of available truth sets, our focus is on general-purpose rather than somatic callers. We characterise the impact on performance of event size and type, sequencing characteristics, and genomic context, and analyse the efficacy of ensemble calling and calibration of variant quality scores. Finally, we provide recommendations for both users and methods developers.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the performance evaluation and comparison among different software packages when using short - read - length sequencing data to identify structural variants (SVs). Specifically: 1. **Lack of comprehensive benchmarking**: Although multiple software packages for identifying SVs have been released, these software are usually only compared with a few existing methods at the time of release, and such comparisons are often selective and incomplete. This poses challenges for users in choosing methods and for developers in understanding the behavior and limitations of algorithms. 2. **Performance differences among different SV callers**: Different SV callers adopt different detection methods, and their performance in handling different types of SV events, sequencing features, and genomic contexts varies significantly. By comprehensively evaluating 10 SV callers, the paper aims to reveal these differences and provide suggestions for users and developers. 3. **Performance of SV callers on different datasets**: The paper uses high - quality reference cell lines and simulated data to evaluate the performance of SV callers, especially their performance under different event sizes, types, sequencing parameters, and genomic contexts. 4. **Effectiveness of quality scores**: The paper also explores the effectiveness of the quality scores reported by SV callers and finds that the calls with the highest quality scores in most methods are usually false positives, and only Manta performs better in this regard. 5. **Effect of simple integrated calls**: The paper evaluates the effect of a simple integrated calling method based on multiple callers and finds that this method can improve performance in some cases, but the effect is highly dependent on the selected caller combinations. In summary, the main objective of this paper is to provide users with guidance on choosing appropriate SV callers and to provide developers with suggestions for improving algorithms through comprehensive benchmarking.