TASmania: A bacterial Toxin-Antitoxin Systems database

Hatice Akarsu,Patricia Bordes,Moise Mansour,Donna-Joe Bigot,Pierre Genevaux,Laurent Falquet
DOI: https://doi.org/10.1371/journal.pcbi.1006946
2019-04-25
PLoS Computational Biology
Abstract:Bacterial Toxin-Antitoxin systems (TAS) are involved in key biological functions including plasmid maintenance, defense against phages, persistence and virulence. They are found in nearly all phyla and classified into 6 different types based on the mode of inactivation of the toxin, with the type II TAS being the best characterized so far. We have herein developed a new <em>in silico</em> discovery pipeline named <strong>TASmania,</strong> which mines the &gt;41K assemblies of the EnsemblBacteria database for known and uncharacterized protein components of type I to IV TAS loci. Our pipeline annotates the proteins based on a list of curated HMMs, which leads to &gt;210<sup>6</sup> loci candidates, including orphan toxins and antitoxins, and organises the candidates in pseudo-operon structures in order to identify new TAS candidates based on a guilt-by-association strategy. In addition, we classify the two-component TAS with an unsupervised method on top of the pseudo-operon (pop) gene structures, leading to 1567 "popTA" models offering a more robust classification of the TAs families. These results give valuable clues in understanding the toxin/antitoxin modular structures and the TAS phylum specificities. Preliminary <em>in vivo</em> work confirmed six putative new hits in <em>Mycobacterium tuberculosis</em> as promising candidates. The TASmania database is available on the following server <a href="https://shiny.bioinformatics.unibe.ch/apps/tasmania/">https://shiny.bioinformatics.unibe.ch/apps/tasmania/</a>.TASmania offers an extensive annotation of TA loci in a very large database of bacterial genomes, which represents a resource of crucial importance for the microbiology community. TASmania supports i) the discovery of new TA families; ii) the design of a robust experimental strategy by taking into account potential interferences in <em>trans</em>; iii) the comparative analysis between TA loci content, phylogeny and/or phenotypes (pathogenicity, persistence, stress resistance, associated host types) by providing a vast repertoire of annotated assemblies. Our database contains TA annotations of a given strain not only mapped to its core genome but also to its plasmids, whenever applicable.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?