Abstract:Although ten years have passed since the draft sequence of the human genome was reported in 2001,the accurate number of human protein-encoding genes is still uncertain.According to the latest release of H-InvDB(release 6.2),34511protein-encoding genes were annotated and most of them were function-unknown.There is still a long road ahead to identify the whole human genes and their functions.During our previous genome-wide analysis of single-block EST sequences with polyadenylation sites,we found a lot of ESTs that represented potentially novel un-identified genes or splice variants.In the present study,we focused on the cloning and identification of a novel cDNA sequence represented by the EST AV653338with the encoding product containing two incomplete colipase-like domains,when the electronic prolongation by EST contig was performed.By using mixed cDNAs from human cell lines as templates,we successfully cloned the novel encoding gene sequence,and revealed a novel human colipase-like protein-encoding gene named hCLPSL3(human colipase-like 3) for there have been two other genes with protein products also containing colipase-like domains registered in the international nucleotide databases,C6ORF126(namely CLPSL1 here) and C6ORF127(namely CLPSL2 here).Totally,two transcript variants were obtained,i.e.hCLPSL3-v1and-v2.Only hCLPSL3-v1contained a complete open reading frame(ORF),encoding 159amino acids,whereas the ORF of hCLPSL3-v2was interrupted by PTC(premature stop codon) and it possibly was a substrate of nonsense-mediated mRNA decay.Human CLPSL3-v1contained a typical signal peptide(aa.1-22) predicted by SignalP 4.0Server,two internal sequence repeat regions(aa.46-84and aa.88-125) with high homology,and two incomplete colipase-like domains(aa.2568and aa.113-159) predicted by InterProScan and Motif Scan program,respectively.Using nested RT-PCR method for expression analysis in several cell lines(293T,HeLa,U2OS,HepG2,HCT116,A549,H1299,Jurkat,H520and THP-1),we found that hCLPSL3 was mainly expressed in 293T,U2OS,HCT116and THP1cells.For lack of cDNAs,we did not perform the detection in human tissues.The typical signal peptide in CLPSL3 suggests that it probably is a secreted protein.Therefore,the recombinant eukaryotic expression vector of hCLPSL3-v1was constructed,and over-expressed in human 293Tcells.As expected,human CLPSL3 was verified to be secreted when the supernatant was used for Western blot assay.By homology analysis,mouse and rat CLPSL3 homologous cDNAs were also predicted.Using RT-PCR method,three mouse CLPSL3 transcript variants(mCLPSL-v1,-v2and-v3) were successfully cloned in the kidney(-v1),colon(-v2) and spleen(-v3) tissues,respectively.Only mCLPSL-v1contained a complete ORF encoding 161amino acids,whereas the ORFs of both mCLPSL-v2and-v3were interrupted by PTCs.Rat CLPSL3(rCLPSL3) cDNAs,encoding aproduct of 162amino acids,were successfully cloned in both colon and small intestine tissues.However,rCLPSL3 was undetectable in rat pancreas.CLPSL3 was widely expressed and highly conserved in mammalian animals,such as Pan troglodytes,Equus caballus,Cavia porcellus,Loxodonta africana,Mus musculus and Rattus norvegicus,and showed very similar gene structures.Moreover,the CLPSL3 contained 18highly conserved cysteines in all species,suggesting that it might relate to the disulfide bond formation.Colipase is a co-factor needed by pancreatic lipase for efficient dietary lipid hydrolysis.Because both mCLPSL and rCLPSL3were detectable for expression in digestive tract,whether they play important roles in dietary lipid hydrolysis still needs further investigation.Our studies lay a foundation for future functional study of CLPSL3.In addition,all of the novel nucleotide sequences have been submitted to GenBank database with the accession numbers : JQ012741(hCLPSL3-v1),JQ012742(hCLPSL3-v2),JQ258939(mCLPSL-v1),JQ258940(mCLPSLv2),JQ258941(mCLPSL-v3) and JQ258942(rCLPSL3).

The Structure and Evolution of a 461 Amino Acid Human Protein C Precursor and Its Messenger RNA, Based Upon the DNA Sequence of Cloned Human Liver Cdnas

Cloning and Characterization of Human Liver Cdna Encoding a Protein S Precursor.

Evolution and Organization of the Human Protein C Gene.

Structure And Evolution Of The Human Genes Encoding Protein-C And Coagulation Factor-Ix

Cloning of a new human cytochrome P450 2A6 cDNA

Cloning and Expression of Tachypleus Tridentatus Factor C

The human alpha 1(XV) collagen chain contains a large amino-terminal non-triple helical domain with a tandem repeat structure and homology to alpha 1(XVIII) collagen.

Cloning and Sequence Analysis of the Human Mitochondrial Translational Initiation Factor 2 Cdna

An Encoding Cdna Sequence and Its Protein Prospection from Human Fetal Brain

Cloning of Human Cytochrome P450 2A6 Cdna and Its Expression in Mammalian Cells

Cloning and Sequence Analysis of the Human Cdna Encoding the Synaptoporin (Δ), a Highly Conservative Synaptic Vesicle Protein

[Cloning and Tissue Expressive Pattern Analysis of the Human Ribosomal S6 Kinase-Rps6ka5 Cdna].

[CDNA Cloning of Human Leptin and Its Expression].

Molecular Cloning and Expression Analysis of a Novel Human Cdna Fragment Encoding a Putative Ser/Thr Protein Kinase

Molecular cloning and characterization of a novel cystatin-like molecule, CLM, from human bone marrow stromal cells.

Cloning and identification of human colipase-like 3 and its homologous cDNAs

The amino acid sequence of the α-chain of human fibrinogen

Cloning and sequencing of cDNAs encoding the human hepatocyte nuclear factor 4 indicate the presence of two isoforms in human liver

Molecular Cloning and Characterization of a Novel Human C4orf13 Gene, Tentatively a Member of the Sodium Bile Acid Cotransporter Family.

Cloning And Characterization Of A Novel Human Cdna Encoding A J-Domain Protein (Dnaja5) From The Fetal Brain

Construction and characterization of a cDNA library from human liver tissue of cirrhosis]