Twelve years of SAMtools and BCFtools

Petr Danecek,James K Bonfield,Jennifer Liddle,John Marshall,Valeriu Ohan,Martin O Pollard,Andrew Whitwham,Thomas Keane,Shane A McCarthy,Robert M Davies,Heng Li
DOI: https://doi.org/10.1093/gigascience/giab008
IF: 7.658
2021-01-29
GigaScience
Abstract:Abstract Background SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. Findings The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. Conclusion Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.
multidisciplinary sciences
What problem does this paper attempt to address?