Applications of Statistics on Researches in Human Genome Project

唐晓静,张新生

DOI: https://doi.org/10.3969/j.issn.1672-1454.2003.06.002

2003-01-01

Abstract:We give four examples to show the important applications of probability and statistics in the researches of (Human) Genome Project. Some basic concepts of gene, such as, chromosomes, DNA and genetic codes ect. are also (introduced).

What problem does this paper attempt to address?

A Brief Survey on the Probability and Statistics Method in Bioinformatics

钱敏平,沈世镒

DOI: https://doi.org/10.3969/j.issn.1000-0917.2004.06.002

IF: 1.675

2004-01-01

Advances in Mathematics

Abstract:Along with continously improving and developing of the biotechnology, especially completing of various whole genome projects of human (HGP), rice, mouse and rat, etc. In the near future, the data for amino acids, proteins, and their interaction will accumulate exponentially. The bioinformatics becomes very hot in biology, and it can be expected being hotter and hotter. This is due to that obtaining tremendous amount of data only provides conditions to reach knowledges, and which can only be acquired after rules and laws being found from data by analyzing. For example, HGP only provides the sequences of 4 amino acids (A, T, C, G), the blue print of our body, which means nothing for persons who do not know much about genes and proteins. This just like getting a Chinese book only a very little step to know what it says for a person knowing very little characters. In fact, the understanding about genes by the mankind looks like only in the elementary school level. On the other hand, we are facing an extremely active era of biotechnology, and many scientists call it the harvest age of genome projects. It not only makes it possible to obtain important results in pure science, but also provides opportunities for applications with great economical and social benefits. We should utilize these opportunities without hesitancy to exert the cooperation of multi-disciplines for going to the frontier of international science. In this marching, the mathematical modeling, ideas and algorithms, especially those of the probability theory and statistics will play key roles.
The Application of Population Genomics in Crop Research

Fan-Jing Yang,Wei Ma,Chu-Yu Ye

DOI: https://doi.org/10.3390/agronomy13102480

2023-01-01

Abstract:Population genomics is a rapidly developing discipline at the crossroads of population genetics and genomics [...]
The Application of the High Throughput Sequencing Technology in the Transposable Elements

Zhen Liu,Jian-hong Xu

DOI: https://doi.org/10.16288/j.yczz.15-140

2015-01-01

Abstract:High throughput sequencing technology has dramatically improved the efficiency of DNA sequencing, and decreased the costs to a great extent. Meanwhile, this technology usually has advantages of better specificity, higher sensitivity and accuracy. Therefore, it has been applied to the research on genetic variations, transcriptomics and epigenomics. Recently, this technology has been widely employed in the studies of transposable elements and has achieved fruitful results. In this review, we summarize the application of high throughput sequencing technology in the fields of transposable elements, including the estimation of transposon content, preference of target sites and dis-tribution, insertion polymorphism and population frequency, identification of rare copies, transposon horizontal transfers as well as transposon tagging. We also briefly introduce the major common sequencing strategies and algo-rithms, their advantages and disadvantages, and the corresponding solutions. Finally, we envision the developing trends of high throughput sequencing technology, especially the third generation sequencing technology, and its ap-plication in transposon studies in the future, hopefully providing a comprehensive understanding and reference for related scientific researchers.
Summary of Genetic Algorithms Research

葛继科,邱玉辉,吴春明,蒲国林

DOI: https://doi.org/10.3969/j.issn.1001-3695.2008.10.008

2008-01-01

Abstract:This paper introduced the history,basic principle and main characters of genetic algorithms,discussed the theory,technology,limitation and improving measures about genetic algorithm.Then summarized the implementation techniques and applications of genetic algorithms,analyzed the research state of genetic algorithms in China during the past five years,and pointed out the genetic algorithms' research directions in the future.
[Analysis and Application of SNP and Haplotype in the Human Genome].

Jing Li,Yu-Chun Pan,Yi-Xue Li,Tie-Liu Shi

2005-01-01

Abstract:Single nucleotide polymorphism (SNP) is the most common type of genetic variant in human genome. Haplotype, defined as a specific set of alleles observed on a single chromosome, or a part of a chromosome,has been an integral part of human genetics for decades. The goal of the international HapMap project is to determine the common patterns of DNA sequence variation and find the Tag SNPs representing all SNPs in the human genome. Some studies demonstrated that the analyses of haplotype defined by the grouping and interaction of several variants rather than any individual SNP correlated with complex phenotypes. Here, we describe the definitions of SNPs, genotype, haplotype and some information of the HapMap project. In this review, we summarize the current three haplotype-inference methods, including Clark' method, EM algorithm and Byes approach, and the different defining methods for haplotype block, as well as the methods for choosing tag SNPs and association studies of complex diseases using haplotype. The major public SNP databases and applications of SNPs and haplotype in common complex diseases and drug response are also introduced in the paper.
Next Generation Sequencing Technology and Its Application in Detecting Gene Mutations

ZHOU Fan,LIN Biao-yang

DOI: https://doi.org/10.16605/j.cnki.1007-7847.2012.05.013

2012-01-01

Abstract:Gene mutations can lead to human diseases,and play an important role in diagnosis and treat-ment of them.Next generation sequencing technology,with features of high throughput,rapid sequencing and low cost,brings revolutionary changes in the fields of detecting gene mutations.The process of using this method to dectect mutations is sample.Researchers can combine whole genome resequencing,target sequenc-ing and transcriptome sequencing to dectect mutations in all-around,high accurate way.
Statistics in the Genomic Era

Hui Jiang,Kevin He

DOI: https://doi.org/10.3390/genes11040443

IF: 4.141

2020-01-01

Genes

Abstract:In recent years, technology breakthroughs have greatly enhanced our ability to understand the complex world of molecular biology [...]
[The Application of Human Mutation Databases].

Yong-Long Zuang,Min Zhou,Yan-Da Li,Yan Shen

DOI: https://doi.org/10.3321/j.issn:0253-9772.2004.04.020

2004-01-01

Abstract:Researches on genome mutation are becoming more and more important with the finish of human genome DNA draft. This review is to classify the existing human mutation databases, including mutation database, SNP(single nucleotide polymorphisms) databases, mutation databases about disease, mutation databases about proteins, mutation databases about map and mutation information about specific gene. We also give advice on how to utilize these mutation databases, and discuss problems of existing databases.
Statistical Methods for the Analysis of Genomic Data

Hui Jiang,Zhi Qiang He

2020-01-01

Abstract:In recent years, technological breakthroughs have greatly enhanced our ability to understand the complex world of molecular biology. Rapid developments in genomic profiling techniques, such as high-throughput sequencing, have brought new opportunities and challenges to the fields of computational biology and bioinformatics. Furthermore, by combining genomic profiling techniques with other experimental techniques, many powerful approaches (e.g., RNA-Seq, Chips-Seq, single-cell assays, and Hi-C) have been developed in order to help explore complex biological systems. As a result of the increasing availability of genomic datasets, in terms of both volume and variety, the analysis of such data has become a critical challenge as well as a topic of great interest. Therefore, statistical methods that address the problems associated with these newly developed techniques are in high demand. This book includes a number of studies that highlight the state-of-the-art statistical methods for the analysis of genomic data and explore future directions for improvement.
Research Progress on Application of Microhaplotype in Forensic Genetics.

Jing Zhou,Yan Wang,Enping Xu

DOI: https://doi.org/10.3724/zdxbyxb-2021-0180

2021-01-01

Abstract:As a novel genetic marker, microhaplotype can be applied in the field of forensic genetics. Microhaplotype has the advantages of high polymorphism, low mutation rate, no stutter products and short amplification fragments. Microhaplotype can effectively detect mixture, and quantitatively analyze the contributors of mixture. DNA with severe fragmentation can be successfully genotyped by microhaplotype. It can be used as ancestry informative marker to effectively divide the global continental population according to genetic structure. Microhaplotype system can provide more information than traditional short tandem repeat and help to identify complex relationships. It can provide new ideas for tumor source identification, cell line identification and prenatal paternity testing. Here we review the applications of microhaplotype, intending to provide references for forensic practice.
Applying Genetic Analysis in Anthropological Studies——An Interview with Anthropologists (39)

XU Jie-shun,JIN Li

DOI: https://doi.org/10.3969/j.issn.1673-8179.2006.03.010

2006-01-01

Abstract:This paper explores the application of genetics in anthropological studies and the future prospect, including in such sub-branches as physical anthropology, medical anthropology; molecular anthropology; and archaeological anthropology.
Teaching Examples of Applied Bioinformatics Course

Luo Jingchu

DOI: https://doi.org/10.13560/j.cnki.biotech.bull.1985.2015.07.001

2015-01-01

Abstract:In this article, we introduce the basic bioinformatics analysis methods and tools taking the hemoglobin as an example. The methods include:1)protein and DNA sequence alignment;2)advanced search for UniProt and RefSeq database;3)Blast database similarity search;4)phylogenetic tree construction under MEGA;5)protein structure comparison using Swiss-PdbViewer.
The application of recombinant DNA technology for genetic probing in epidemiology.

C. Caskey,R. Gibbs

DOI: https://doi.org/10.1146/ANNUREV.PU.10.050189.000331

IF: 21.87

Annual Review of Public Health

Abstract:Advances in recombinant DNA technology have greatly improved the pros pects for large scale genetic screening. The developments inclu�e better methods for obtaining DNA probes that are linked to disease traits �nd novel procedures for rapidly identifying DNA variation. The new methods are simple, reliable and can be automated. Because DNA is the common sub strate, a single format can be used to analyze a wide variety of different traits. Much of the very recent technological advancement is due to the develop ment of the polymerase chain reaction (PCR), a procedure for the in vitro amplification of nucleic acid sequences (69). In less than two years the method has found application in almost every facet of molecular biology. In the near future, the PCR will be used both in the elucidation of the molecular basis of genetic disease and in widespread screening for the disease alleles. Several groups are appropriate subjects for genetic screening by DNA methods. Prenatal and carrier screening will continue in families where risk for inherited disease has been established. Similarly, genetically isolated populations with high frequencies of particular traits will continue to be examined closely. However, the greatest increase in the application of DNA screening will be in testing the general population for recurrent alleles that cause or are associated with disease. Carriers of traits with simple modes of inheritance will be identified and individuals at risk for late onset disorders

Biology,Medicine
On Statistical Analysis of Forensic DNA: Theory, Methods and Computer Programs.

Wing K. Fung,Yue-Qing Hu,Yuk-Ka Chung

DOI: https://doi.org/10.1016/j.forsciint.2006.06.025

IF: 2.676

2006-01-01

Forensic Science International

Abstract:Statistics plays an important role in evaluating the evidential weight of forensic DNA. In this paper, general statistical principles for forensic DNA analysis are presented. We introduce the theory and methods for the statistical assessment in kinship determination and DNA mixture evaluation. In particular, analytical formulas for testing for biological relationship among three individuals and for assessing the DNA mixture evidence in the case of multiple subdivided ethnic groups are developed. Two user-friendly computer programs are demonstrated to exhibit their wide applicability in tackling with complex kinship/paternity and mixture problems. The EasyDNA program can solve a complicated paternity case in 1 min.
A New Distribution Vector and Its Application in Genome Clustering.

Bo Zhao,Rong He,Stephen S.‐T. Yau

DOI: https://doi.org/10.1016/j.ympev.2011.02.020

IF: 5.019

2011-01-01

Molecular Phylogenetics and Evolution

Abstract:In this paper we report a novel mathematical method to transform the DNA sequences into the distribution vectors which correspond to points in the sixty dimensional space. Each component of the distribution vector represents the distribution of one kind of nucleotide in k segments of the DNA sequences. The mathematical and statistical properties of the distribution vectors are demonstrated and examined with huge datasets of human DNA sequences and random sequences. The determined expectation and standard deviation can make the mapping stable and practicable. Moreover, we apply the distribution vectors to the clustering of the Haemagglutinin (HA) gene of 60 H1N1 viruses from Human, Swine and Avian, the complete mitochondrial genomes from 80 placental mammals and the complete genomes from 50 bacteria. The 60 H1N1 viruses, 80 placental mammals and 50 bacteria are classified accurately and rapidly compared to the multiple sequence alignment methods. The results indicate that the distribution vectors can reveal the similarity and evolutionary relationship among homologous DNA sequences based on the distances between any two of these distribution vectors. The advantage of fast computation offers the distribution vectors the opportunity to deal with a huge amount of DNA sequences efficiently.
The Gene Chip Technology and the Prospects of Its Application in Hematology and Oncology

Xin Zhang,Ping Zhu

DOI: https://doi.org/10.3969/j.issn.1009-2137.2000.03.013

2000-01-01

Abstract:The basic principle, technological procedure and types of gene chips were introduced in the article. The probe choice in practical application and processing and hybridization of detected samples were described. Some researchers have used gene chips in hematology, oncology and cellular differentiation. Gene chips can be used to detect the expression of oncogenes, tumor suppression genes, and cell differentiation- and apoptosis-related genes. It can also help us to further study the association between the polymorphism of human genes and disease susceptibility. The aspects of development and problems in gene chips study were discussed.
Applications and Trends of Twins Study in Genetic Epidemiology

Ai-qun HUANG,Yong-hua HU

DOI: https://doi.org/10.3760/cma.j.issn.1673-4386.2006.05.006

2006-01-01

International Journal of Genetics

Abstract:Twin study is one of the most important methods to identify the genetic basis of complex diseases or traits. The classical twins study can be used to identify the relative contribution of genetic and environmental influences on the variation of phenotype. But with the development of genetic statistics and the techniques of computer and molecular biology, the classical twin study not only has been extended, but also developed some new methods and theories, such as applications based on structural equation modeling, multivariate design, co-twin case-control study and twins family study. In this article, the methods of classical twins study extended classical twins study, and the applications of twins study to linkage and association studies of complex diseases or traits were reviewed.
The Application of Gene Sequencing in Human Diseases

Liu Yanxia,Yang Xuesong

DOI: https://doi.org/10.3760/cma.j.issn.1673-4386.2015.04.004

2015-01-01

Abstract:Substantial genetic variation information in both patients and the general population can be generated by high-throughput sequencing approaches.Human health can be greatly promoted by reasonable utilization of genetic information.However,false evaluation of variants implication in disease can have severe consequences for patients.Given the potentially significant impact on medicine,this paper will make a brief overview on the application of gene sequencing in clinical medicine and the guideline for implicating sequence variants in human disease.
Some Statistical Parameters in Molecular Evolution and Their Application

CHEN Jian-qun,WANG You-hong,JIANG Ke,ZHANG Hui

DOI: https://doi.org/10.3969/j.issn.0254-0037.2005.02.018

2005-01-01

Abstract:The precise evolution relationship and phylogenetic tree can be obtained with the aid of analyses and calculations of certain gene data. Some statistical parameters of molecular evolution were introduced and their application was explained, including Ka (nonsynonymous nucleotide substitutions), Ks (synonymous nu-cleotide substitutions) and Ka/Ks. The other important parameters, for example DNA polymorphism and genetic distance, were also discussed. Based on these evolution statistical parameters, the positive selection and gene evolution speed could be determined. An example was used to reveal the evolution mechanism of plant resistant genes with analyses of three parameters.
High-throughput Sequencing Technology and Its Application

Yang Zhi-rong Wang Min Li Wei Li Sheng-cai Wang Xing-chun

2012-01-25

Abstract:As a milestone in the development of DNA sequencing,high-throughput sequencing technology provides an unprecedented opportunity for the modern life sciences.The recent progress on this technology,including the second generation sequencing technology(represented by 454,Solexa and SOLiD),the third generation sequencing technology(represented by HeliScope TIRM and Pacific Biosciences SMART)and the Ion Personal Genome Machine sequencing technology are summarized.Then,the application of the high-throughput sequencing technology in genome sequencing,transcriptome sequencing,gene expression regulation,detection of binding locations for transcription factors and methylation analysis are summarized.Finally,the disadvantages and the prospects of this technology were discussed.

Engineering,Biology,Computer Science

Applications of Statistics on Researches in Human Genome Project

A Brief Survey on the Probability and Statistics Method in Bioinformatics

The Application of Population Genomics in Crop Research

The Application of the High Throughput Sequencing Technology in the Transposable Elements

Summary of Genetic Algorithms Research

[Analysis and Application of SNP and Haplotype in the Human Genome].

Next Generation Sequencing Technology and Its Application in Detecting Gene Mutations

Statistics in the Genomic Era

[The Application of Human Mutation Databases].

Statistical Methods for the Analysis of Genomic Data

Research Progress on Application of Microhaplotype in Forensic Genetics.

Applying Genetic Analysis in Anthropological Studies——An Interview with Anthropologists (39)

Teaching Examples of Applied Bioinformatics Course

The application of recombinant DNA technology for genetic probing in epidemiology.

On Statistical Analysis of Forensic DNA: Theory, Methods and Computer Programs.

A New Distribution Vector and Its Application in Genome Clustering.

The Gene Chip Technology and the Prospects of Its Application in Hematology and Oncology

Applications and Trends of Twins Study in Genetic Epidemiology

The Application of Gene Sequencing in Human Diseases

Some Statistical Parameters in Molecular Evolution and Their Application

High-throughput Sequencing Technology and Its Application