A Genome-Based Resource for Molecular Cardiovascular Medicine

David Hwang,Adam A. Dempsey,Ruoxiang Wang,Mojgan Rezvani,J. David Barrans,MHSc,Ken-Shwo Dai,Huiyuan Wang,Hong Mā,Eva Cukerman,Yuqing Liu,Gu Jianren,Jinghui Zhang,Stephen Kwok‐Wing Tsui,Mary Miu Yee Waye,Kwok‐Pui Fung,Cheuk-Yu Lee,Choong‐Chin Liew
DOI: https://doi.org/10.1161/01.cir.96.12.4146
IF: 37.8
1997-01-01
Circulation
Abstract:Large-scale partial sequencing of cDNA libraries to generate expressed sequence tags (ESTs) is an effective means of discovering novel genes and characterizing transcription patterns in different tissues. To catalogue the identities and expression levels of genes in the cardiovascular system, we initiated large-scale sequencing and analysis of human cardiac cDNA libraries.Using automated DNA sequencing, we generated 43,285 ESTs from human heart cDNA libraries. An additional 41,619 ESTs were retrieved from public databases, for a total of 84,904 ESTs representing more than 26 million nucleotides of raw cDNA sequence data from 13 independent cardiovascular system-based cDNA libraries. Of these, 55% matched to known genes in the Genbank/EMBL/DDBJ databases, 33% matched only to other ESTs, and 12% did not match to any known sequences (designated cardiovascular system-based ESTs, or CVbESTs). ESTs that matched to known genes were classified according to function, allowing for detection of differences in general transcription patterns between various tissues and developmental stages of the cardiovascular system. In silico Northern analysis of known gene matches identified widely expressed cardiovascular genes as well as genes putatively exhibiting greater tissue specificity or developmental stage specificity. More detailed analysis identified 48 genes potentially overexpressed in cardiac hypertrophy, at least 10 of which were previously documented as differentially expressed. Computer-based chromosomal localizations of 1048 cardiac ESTs were performed to further assist in the search for disease-related genes.These data represent the most extensive compilation of cardiovascular gene expression information to date. They further demonstrate the untapped potential of genome research for investigating questions related to cardiovascular biology and represent a first-generation genome-based resource for molecular cardiovascular medicine.
What problem does this paper attempt to address?