AFITbin: a metagenomic contig binning method using aggregate l-mer frequency based on initial and terminal nucleotides

Amin Darabi,Sayeh Sobhani,Rosa Aghdam,Changiz Eslahchi
DOI: https://doi.org/10.1186/s12859-024-05859-7
IF: 3.307
2024-07-19
BMC Bioinformatics
Abstract:Using next-generation sequencing technologies, scientists can sequence complex microbial communities directly from the environment. Significant insights into the structure, diversity, and ecology of microbial communities have resulted from the study of metagenomics. The assembly of reads into longer contigs, which are then binned into groups of contigs that correspond to different species in the metagenomic sample, is a crucial step in the analysis of metagenomics. It is necessary to organize these contigs into operational taxonomic units (OTUs) for further taxonomic profiling and functional analysis. For binning, which is synonymous with the clustering of OTUs, the tetra-nucleotide frequency (TNF) is typically utilized as a compositional feature for each OTU.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?