Fosmid library end sequencing reveals a rarely known genome structure of marine shrimp Penaeus monodon

Shiao-Wei Huang,You-Yu Lin,En-Min You,Tze-Tze Liu,Hung-Yu Shu,Keh-Ming Wu,Shih-Feng Tsai,Chu-Fang Lo,Guang-Hsiung Kou,Gwo-Chin Ma,Ming Chen,Dongying Wu,Takashi Aoki,Ikuo Hirono,Hon-Tsen Yu
DOI: https://doi.org/10.1186/1471-2164-12-242
IF: 4.547
2011-05-17
BMC Genomics
Abstract:Abstract Background The black tiger shrimp ( Penaeus monodon ) is one of the most important aquaculture species in the world, representing the crustacean lineage which possesses the greatest species diversity among marine invertebrates. Yet, we barely know anything about their genomic structure. To understand the organization and evolution of the P. monodon genome, a fosmid library consisting of 288,000 colonies and was constructed, equivalent to 5.3-fold coverage of the 2.17 Gb genome. Approximately 11.1 Mb of fosmid end sequences (FESs) from 20,926 non-redundant reads representing 0.45% of the P. monodon genome were obtained for repetitive and protein-coding sequence analyses. Results We found that microsatellite sequences were highly abundant in the P. monodon genome, comprising 8.3% of the total length. The density and the average length of microsatellites were evidently higher in comparison to those of other taxa. AT-rich microsatellite motifs, especially poly (AT) and poly (AAT), were the most abundant. High abundance of microsatellite sequences were also found in the transcribed regions. Furthermore, via self-BlastN analysis we identified 103 novel repetitive element families which were categorized into four groups, i.e ., 33 WSSV-like repeats, 14 retrotransposons, 5 gene-like repeats, and 51 unannotated repeats. Overall, various types of repeats comprise 51.18% of the P. monodon genome in length. Approximately 7.4% of the FESs contained protein-coding sequences, and the Inhibitor of Apoptosis Protein (IAP) gene and the Innexin 3 gene homologues appear to be present in high abundance in the P. monodon genome. Conclusions The redundancy of various repeat types in the P. monodon genome illustrates its highly repetitive nature. In particular, long and dense microsatellite sequences as well as abundant WSSV-like sequences highlight the uniqueness of genome organization of penaeid shrimp from those of other taxa. These results provide substantial improvement to our current knowledge not only for shrimp but also for marine crustaceans of large genome size.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?