Development of the high-throughput tool HighCodon for high-frequency codons

Yu Jincong,Fang Baishan
DOI: https://doi.org/10.3969/j.issn.1001-4160.2011.05.004
2011-01-01
Abstract:The identification of special codons is one of important issues for the codon usage research.It plays a key role in the experiment design of the codon optimization in genetic engineering.High-frequency codons(HFC) are a kind of these codons.However,there are some problems in the existed identification standard of HFC.In this paper,the high-throughput high-frequency codons software named HighCodon was developed as a powerful tool,which solved the bottle-neck to test the applicability of the standard in mass sequence data.The software was mainly consist of three modules,that is,input analyzing module,codon usage table generating module and high-frequency codon identifying module.HighCodon had three remarkable features,(i) multi-data sources that included the local and the remote,local FASTA format sequence files,local codon usage table(CUT) format CUT files and remote CUT address in codon usage database(CUD) were acceptable;(ii) high-flexibility,the mixed input of above three sources was supportable;(iii) high-throughput,batch processing was applied for dealing multi-input records.The tool was integrated well with the important online server CUD.That means gaining up to 35799 species' CUTs meanwhile analyzing HFC is rather convenient.Besides,the paper proposed a kind of CUT format based on FASTA format,in order to achieve the storage and exchange of CUT data.
What problem does this paper attempt to address?