Estimating the Number of Essential Genes in Random Transposon Mutagenesis Libraries

Oliver Will,Michael A Jacobs
DOI: https://doi.org/10.48550/arXiv.q-bio/0608005
2006-08-03
Other Quantitative Biology
Abstract:Biologists use random transposon mutagenesis to construct knockout libraries for bacteria. Random mutagenesis offers cost and efficiency benefits over the standard site directed mutagenesis, but one can no longer ensure that all the nonessential genes will appear in the library. In random libraries for haploid organisms, there is always a class of genes for which knockout clones have not been made, and the members of this class are either essential or nonessential. One requires statistical methods to estimate the number of essential genes. Two groups of researchers, Blades and Broman and Jacobs et al., independently and simultaneously developed methods to do this. Blades and Broman used a Gibbs sampler and Jacobs et al. used a parametric bootstrap. We compare the performance of these two methods and find that they both depend on having an accurate probabilistic model for transposon insertion or on having a library with a large number of clones. At this point, we do not have good enough probabilistic models so we must build libraries that have at least five clones per open reading frame to accurately estimate the number of essential genes.
What problem does this paper attempt to address?