A Brief Survey on the Probability and Statistics Method in Bioinformatics
钱敏平,沈世镒
DOI: https://doi.org/10.3969/j.issn.1000-0917.2004.06.002
IF: 1.675
2004-01-01
Advances in Mathematics
Abstract:Along with continously improving and developing of the biotechnology, especially completing of various whole genome projects of human (HGP), rice, mouse and rat, etc. In the near future, the data for amino acids, proteins, and their interaction will accumulate exponentially. The bioinformatics becomes very hot in biology, and it can be expected being hotter and hotter. This is due to that obtaining tremendous amount of data only provides conditions to reach knowledges, and which can only be acquired after rules and laws being found from data by analyzing. For example, HGP only provides the sequences of 4 amino acids (A, T, C, G), the blue print of our body, which means nothing for persons who do not know much about genes and proteins. This just like getting a Chinese book only a very little step to know what it says for a person knowing very little characters. In fact, the understanding about genes by the mankind looks like only in the elementary school level. On the other hand, we are facing an extremely active era of biotechnology, and many scientists call it the harvest age of genome projects. It not only makes it possible to obtain important results in pure science, but also provides opportunities for applications with great economical and social benefits. We should utilize these opportunities without hesitancy to exert the cooperation of multi-disciplines for going to the frontier of international science. In this marching, the mathematical modeling, ideas and algorithms, especially those of the probability theory and statistics will play key roles.