A large family of Dscam genes with tandemly arrayed 5[prime] cassettes in Chelicerata
Yuan Yue,Hongru Ma,Shouqing Hou,Guozheng Cao,Weiling Hong,Yang Shi,Pengjuan Guo,Baoping Liu,Feng Shi,Yun Yang,Yongfeng Jin,Yijun Meng
DOI: https://doi.org/10.1038/ncomms11252
IF: 16.6
2016-01-01
Nature Communications
Abstract:Drosophila Dscam1 (Down Syndrome Cell Adhesion Molecules) and vertebrate clustered protocadherins (Pcdhs) are two classic examples of the extraordinary isoform diversity from a single genomic locus. Dscam1 encodes 38,016 distinct isoforms via mutually exclusive splicing in D. melanogaster, while the vertebrate clustered Pcdhs utilize alternative promoters to generate isoform diversity. Here we reveal a shortened Dscam gene family with tandemly arrayed 5′ cassettes in Chelicerata. These cassette repeats generally comprise two or four exons, corresponding to variable Immunoglobulin 7 (Ig7) or Ig7–8 domains of Drosophila Dscam1. Furthermore, extraordinary isoform diversity has been generated through a combination of alternating promoter and alternative splicing. These sDscams have a high sequence similarity with Drosophila Dscam1, and share striking organizational resemblance to the 5′ variable regions of vertebrate clustered Pcdhs. Hence, our findings have important implications for understanding the functional similarities between Drosophila Dscam1 and vertebrate Pcdhs, and may provide further mechanistic insights into the regulation of isoform diversity. Alternative transcription and alternative splicing are two major means to expand the transcriptomic and proteomic repertoire from a single gene1, 2. Drosophila Dscam1 (Down Syndrome Cell Adhesion Molecules) and vertebrate clustered protocadherins (Pcdhs) are two classic examples of the extraordinary protein isoform diversity that can arise from a single complex genomic locus in two phyla3, 4. Dscam1 gene encodes 38,016 distinct isoforms via mutually exclusive alternative splicing of 4 arrays of tandem duplicated exons in D. melanogaster3. These Dscam1 isoforms are expressed stochastically and combinatorially, and exhibit isoform-specific homophilic binding5, 6, 7, 8, 9, 10. These properties provide the molecular basis of Drosophila Dscam1 as a key molecule for self-avoidance, and genetic studies have indicated that thousands of Dscam1 isoforms are required for neuronal wiring and self-avoidance8, 9, 10, 11, 12, 13, 14. In contrast to insect Dscam1, vertebrate Dscam genes do not generate extraordinary protein diversity15. However, another set of genes, the clustered Pcdhs, might perform the analogous function in vertebrates16, 17, 18. Pcdhs are the largest subgroup of the cadherin superfamily of cell adhesion proteins and are abundantly expressed in the central nervous system. In the human, 52 Pcdh proteins are encoded by 3 tightly linked gene clusters called Pcdhα, Pcdhβ and Pcdhγ, which are organized in a tandem array and on a single chromosome4. In these genes, each variable exon is preceded by a promoter, and Pcdh diversity is produced via differential promoter choice and cis-alternative splicing19, 20. The Pcdh gene cluster encodes a large repertoire of cell surface recognition proteins, which can engage in specific homophilic interactions21. Functional experiments show that deletion of the mouse Pcdhγ gene cluster could cause defective dendritic self-avoidance in retinal starburst amacrine cells or in Purkinje cells22. This observation suggests that clustered Pcdhs, similar to Drosophila Dscam1, may also mediate neurite self-avoidance by specifying single-cell identity21, 22, 23, 24. Conversely, such vertebrate clustered Pcdh genes have not been identified in Drosophila16. Given the striking molecular parallels between and complementary phylogenetic distribution of Dscam diversity in Drosophila and the clustered Pcdh diversity in vertebrates, it is attractive to speculate that they may have similar roles. These two phyla appear to have evolved a common molecular strategy for self-avoidance by recruiting different molecules18. Nevertheless, since there is a big evolutionary gap between insects and vertebrates, who shared a common ancestor more than 500 million years ago, how the evolutionary transitions and complementarities occurred remains unclear. Moreover, Drosophila Dscam1 generally generates tens of thousands of isoforms, while only 58 isoforms exist for clustered Pcdh genes in mice. This discrepancy in isoform diversity by at least 2 orders of magnitude is unlikely to be explained by the much higher common isoform tolerance for Pcdhs than is assumed for Dscam1 (ref. 18). In this study, we identified a novel Dscam gene family (sDscam) in Chelicerata that contained tandemly arrayed 5′ cassettes. The encoded proteins had a striking similarity to Drosophila Dscam1, but all lacked the canonical Immunoglobulin 1 (Ig1)–6, 10 and Fibronectin III (FNIII) 3–4, 6 domains present in classical DSCAM. The N-terminal domains of each sDscam protein are generally encoded by only one of a cluster of tandemly arrayed 5′ cassettes. These 5′ cassettes are generally comprised of two or four exons (sDscamα and sDscamβ), which correspond to variable Ig7 or Ig7–8 domains of Drosophila Dscam1. There was also high splicing complexity across variable 5′ clusters, which expanded the isoform diversity via a combination of alternative promoter and splicing activities. Thus, Drosophila Dscam1 and Chelicerata sDscam represent examples of convergent evolution for isoform diversity. This genomic organization is remarkably similar to that of the clustered Pcdhs in vertebrates. Hence, our findings have important implications to aid in our understanding of the functional similarities between two structurally unrelated families of Drosophila Dscam and vertebrate Pcdhs, and may provide further insights into the regulatory mechanisms governing the selection of tandemly arrayed 5′ variable regions. To trace the origins of duplicated exons of the Dscam genes in Arthropoda, the exons encoding the Ig7 orthologues of Drosophila Dscam1 in the M. martensii genome were analysed. These Ig-coding exons were tandemly arrayed across the gene body, similar to Drosophila Dscam1. Nevertheless, RNA-seq analyses and sequencing of 5′ RACE (rapid-amplification of cDNA ends) products indicated that these transcripts shared no common upstream exons, and therefore, they might initiate immediately upstream of each variable exon (Fig. 1). Importantly, we believe this was located close to the transcription start sites for each variable exon, because a stop codon was generally located in the frame immediately upstream from the ATG initiation codon in each variable cassette (Supplementary Fig. 1). Last, computer-assisted and RNA-seq analyses revealed seven novel Dscam genes in M. martensii, which were characterized by tandemly arrayed 5′ cassettes (Fig. 1). Their encoding isoforms were similar to each other and to previously characterized Drosophila Dscam1, but all lacked the canonical Ig1–6,10 and FNIII 3–4, 6 domains present in classical DSCAM. We therefore designated these novel shortened Dscam genes as sDscam. Based on different units of tandemly arrayed 5′ cassettes, these sDscams could be subdivided into two closely related subfamilies, sDscamα and sDscamβ (Fig. 1a,b). The former (sDscamα) contained tandemly arrayed 5′ cassettes with 2 exons. This tandem cassette encoded a single Ig domain, which corresponded to the Ig7 of Drosophila Dscam1 (Fig. 1a). Genome-wide analyses revealed the presence of only one member of the sDscamα subfamily, which contained at least 40 tandem copies at the 5′ variable regions. The tandemly arrayed 5′ cassette of another gene cluster subfamily (sDscamβ) generally contained 4 exons (Fig. 1b). These tandem cassettes encoded 2 Ig repeats, which corresponded to the Ig7–8 domains of Drosophila Dscam1. This is similar to Ig7–8 arrays in Ixodes scapularis Dscam, albeit without the annotation of the first exons25. We identified up to 6 members (sDscamβ1–sDscamβ6) of the sDscamβ subfamily, which contained 13, 8, 13, 9, 10 and 2 tandemly arrayed cassettes, respectively. In some cases, tandem cassettes could be made by the combination of different duplication units. Taken together, this unusual organization of the sDscam family potentiates the capacity to expand the transcript isoforms. We examined whether this clustered organization of sDscam found in M. martensii was conserved at the 5′ variable regions throughout Arthropoda. This analysis was expanded to include the Araneae Stegodyphus mimosarum, 2 Ixodoidean species (I. scapularis and Tetranychus urticae) and Merostomatan Limulus polyphemus. Together, these organisms comprise some of the major taxonomic groups of the Chelicerata subphylum that last shared a common ancestor ~420 million years ago26. We identified the clustered organization at the 5′ regions of sDscam in all species of the Arachnida class investigated, although the members of the tandemly arrayed 5′ cassettes differed among species (Supplementary Fig. 2). This led us to believe that the 5′ clustered organization of the sDscam family was evolutionarily conserved in Arachnida. Moreover, the sequence comparison revealed the 5′ clustered organization of the sDscamα and sDscamβ subfamilies in Merostomatan L. polyphemus (Supplementary Fig. 2). However, a similar 5′ clustered organization was not identified in any of the Dscam genes from the Mandibulata species of insect, Crustacea or Myriapoda classes, suggesting that it arose after radiation of Mandibulata and Chelicerata during the evolution of Arthropoda. Thus, we concluded that the 5′ clustered organization of sDscam was Chelicerata-specific and conserved throughout Chelicerata evolution. How the 5′ clustered organization of the sDscam gene arose was investigated next. Following a comprehensive comparative analysis of Dscam sequences from arthropod species (Supplementary Fig. 3), it was speculated that the sDscam gene might have originated from the sequential shortening and expansion of the Ig and FNIII domains of canonical Dscam (Fig. 2, Supplementary Fig. 4a). First, the ancestral Dscam gene underwent the loss of FNIII3–4 and Ig10 domains before the divergence of Arachnida and Merostomata. This is supported by the fact that Dscam genes lacking the FNIII3–4 and Ig10 domains are present in all Chelicerata species investigated (Supplementary Fig. 3). The further loss of the FNIII domain proximal to the transmembrane domain was followed later by the loss of the coding region encoding the N-terminal Ig1–6 domains (Fig. 2; Supplementary Fig. 4a). Eventually, a shortened Dscam evolved in the ancestral gene. Second, this shortening was followed later by 5′ segmental duplication to create two or multiple tandemly arrayed cassettes. The duplication unit may include both exons 1–2 encoding an Ig domain or exons 1–4 encoding two Ig domains and their promoters (green or blue dashed box, Fig. 2; Supplementary Fig. 4a). Moreover, phylogenetic analysis indicated that these clustered cassettes were more similar to each other than to the variable cassettes from other species (Supplementary Figs 5 and 6), suggesting that the variable cassettes were expanded in a species-specific manner. Notably, the genome analysis indicated that most sDscam genes tended to be clustered in Chelicerata (Supplementary Fig. 4b). For example, three sDscam genes clustered in the T. urticae genome, of which sDscamβ2 and sDscamβ3 were only 4 kb apart and in the same orientation. These findings strongly suggest that sDscam gene clusters result from lineage-specific duplications. Together, these results demonstrate that 5′ cassette tandem duplication, combined with gene duplication, jointly shaped the large lineage-specific repertoire of sDscam isoforms in Chelicerata. To determine the expression profiles of the variable cassettes in M. martensii sDscams, paired-end sequencing of poly(A)-tailed transcripts was performed on five dissected adult tissue samples, including the cephalothorax, abdomen, muscles, haemocytes and poison glands. RNA-seq reads were mapped to the genome sequence of sDscams as described above. Based on the RNA-seq data of constitutive exons, the sDscamα and sDscamβ1–6 transcripts were differentially expressed (Fig. 3a). The sDscamα and sDscamβ1–6 transcripts were expressed at much higher levels in the cephalothorax than in the abdomen, muscles and haemocytes (Fig. 3a; Supplementary Fig. 7a). This is largely consistent with previous studies in which Dscams were highly expressed in neural tissues13, 27. Notably, sDscamβ3, sDscamβ5 and sDscamβ6 transcripts were expressed at maximum levels in the poison glands. It would be of interest to know whether the sDscam isoform diversity contributes to immune protection, as previously reported for Dscam1 isoforms in insects27. Transcriptional signals were detected for almost all of the 5′ variable exons of sDscamα and the six sDscamβ genes in at least one of the tissues of M. martensii (Fig. 3b,c; Supplementary Fig. 7b,c). For each sDscam gene, the relative abundance of isoforms differed markedly among the variable exons. For example, the most abundant 10 sDscamα isoforms accounted for 54.7% and 52.5% of all reads from the cephalothorax and abdomen, respectively (Fig. 3b,c). Interestingly, the variable cassettes most distal to the constitutive exons tended to occur less frequently in all tissues for all sDscams, except for sDscamβ4. In sDscamβ2–3 and sDscamβ5–6, the inclusion frequency of a variable exon largely correlated with its proximity to the first constitutive exon (Supplementary Fig. 8a–d). Several significant differences existed in the expression profiles of various sDscam variable cassettes in different tissues. The 5′ variable exon usage in sDscamβ1–5 showed moderate to dramatic changes in different tissues, whereas differences in the sDscamα cassettes were relatively modest (Fig. 3b,c). Most of the 5′ variable exons of sDscamα were expressed in the cephalothorax, abdomen, haemocytes and poison glands. Nonetheless, only a subset was lowly expressed in the muscles (Fig. 3b). Similarly, most of the 5′ variable exons of sDscamβ1–6 could be detected in the cephalothorax, abdomen and poison glands, while only a subset was expressed in the haemocytes and muscles. Variable cassette 4 of sDscamβ1 was abundantly expressed in the cephalothorax, but was barely detectable in the abdomen (Fig. 3c; Supplementary Fig. 7c,d). sDscamβ3 variable cassette 11 was abundantly expressed in the poison gland, but was barely detectable in other tissues (Fig. 3c; Supplementary Fig. 7c,d). These data indicate that the selection of 5′ variable exons of sDscamα and sDscamβ is differentially regulated in different tissues. To clarify the mechanisms by which isoforms were generated and regulated from a single sDscam gene locus, it was ascertained whether the sDscam genes applied a similar strategy to that in vertebrate Pcdhs, with the alternative use of a separate promoter upstream of each first exon of a variable region19, 20. In Pcdhs, each first exon is preceded by a promoter and produces a transcript in which the first exon is spliced to common exons. To determine whether each sDscam variable cassette has its own promoter, sequences immediately upstream of the transcription start site of each variable region in sDscamα and six sDscamβ genes were examined. A rich array of potential promoter elements (PPEs) was predicted to be located upstream of the 5′ end of each variable region (Fig. 4a; Supplementary Fig. 9). Therefore our data suggest that each variable cassette is generally preceded by a given promoter. Next, we firstly validated the promoter activity of sDscamβ6, which contains only two tandemly arrayed variable cassettes. To this end, a ~1.0–2 kb DNA fragment preceding the variable V1 and V2 cassettes was fused to luciferase in an expression vector. As shown in Fig. 4b, both constructs displayed significant promoter activity in transient transfection reporter assays in Drosophila S2 cells. This indicates that these predictable promoter sequences are sufficient to direct the reporter expression of heterologous cells. To determine the minimal DNA sequence requirements for promoter activity, a series of deletion constructs was tested. Promoter function was not significantly diminished by truncations to ~300 bp (Fig. 4b). Moreover, promoter activity was only partially reduced by disruption of a given PPE, suggesting that it resulted from the combinatorial interaction of multiple PPEs, including those beyond the prediction capabilities of the program, which was based on distantly related species. Together, these results indicate that the transcription of individual variable cassettes is under the control of a distinct promoter upstream of each variable exon. Inconsistent with the presence of a large first exon in the clustered Pcdh gene4, a cassette repeat composed of two or four exons was identified in the clustered sDscam gene. This raised the question of how these variable exons were combined into distinct mRNA isoforms, particularly because the exclusion or multiple inclusions of exons 2, 3 or 4 variants would not result in a frameshift. To explore this, we defined exon junctions based on a total of 0.7 billion RNA-seq reads from different tissues. At least 264 distinct exon junctions were detected, 249 of which were joined neighbouring junctions in single tandem cassettes. This suggests that most isoforms could be made through joining neighbouring junctions in variable cassette regions. Moreover, we detected a small fraction of isoforms from the same cassette with either exon 2, 3 and/or 4 skipped. In these cases, the variable exon skipping resulted in an incomplete Ig domain (that is, the sDscamβ6 variable exon 3.1) (Fig. 5a). This abnormal splicing is analogous to the skipping of Dscam exon 4 variants, which results in a partial Ig2 domain and is likely to be biologically relevant28. In addition, we detected other non-canonical splicing isoforms that contained variable exons from different tandem cassettes, as well as the isoforms containing within-cassette introns (Fig. 5a). Based on the exon junctions from the RNA-seq data, we estimated that ~10–40% of isoforms resulted from non-canonical splicing in most sDscam genes, which showed differential expression in various tissues (Fig. 5a,b; Supplementary Fig. 10a,b). Taken together, these data indicate that sDscams have potentially complex splicing patterns at the 5′ variable regions. Given the low expression of a considerable number of sDscam variable exons, we systematically examined the possible exon combinations derived from different tandem cassettes using a nested reverse transcription–PCR (RT–PCR) approach. Several unexpected types of splice isoforms were detected. One type of isoform was produced by combining exons from different tandem cassettes, which encoded 2 Ig domains identical to the canonical isoform from a single cassette. For example, sDscamβ1 exon 2.1 could be spliced with the downstream variable exon 3.2, while variable exon 3.13 could be spliced with the upstream variable exon 2.10 (Fig. 5c,d). Surprisingly, sDscamβ1 variable exon 3.13 could be spliced with the upstream variable exons 4.5 and 4.6, and the resulting variable region of the mRNA isoform encoded 3 Ig repeats (Fig. 5c,d). Moreover, sDscamβ3 variable exon 4.10 could be spliced with the downstream variable exon 2.11, and the resulting variable region of the mRNA isoform encoded 4 Ig repeats. Furthermore, other distinct types of variable 3′ isoforms were detected (Fig. 5e,f). Similar results were obtained for other sDscamα and sDscamβ genes (Supplementary Fig. 11). Together, these results show that the multi-exon repeat architecture of sDscams can increase not only Ig sequence diversity but also Ig number plasticity (Fig. 5g). Finally, we examined how variable exons were spliced after transcription by alternative promoters. Although previous studies suggested that only the cap-proximal variable exon was joined to the first constant exon in vertebrate Pcdhs4, 19, 20, this hypothesis had not been validated experimentally due to the large size (~200 kb in the variable regions) and complexity of the clustered Pcdhs. Surprisingly, we found that abundant intron sequences immediately downstream of the last variable exon of each cassette were frequently retained in the RNA-seq data, while cassettes within introns were exclusively spliced out (that is, sDscamβ1 V5, Fig. 6a,b). Interestingly, the extent of this retention differed in different tissues (Fig. 6b; Supplementary Fig. 10b). The frequent occurrence of this unusual intronic retention might be a result of the splicing of the variable exons immediately downstream of the cap-proximal cassette to the constant exon (type II; Fig. 6a). Taken together, we propose that not only the cap-proximal, but also the downstream variable exons spliced to the constant exon. Next, a more sensitive assay was designed that used primers in exons 1.5 and 4.6 to validate the findings above (Fig. 6a). It was hypothesized that if the downstream variable cassette 6 (V6) could be spliced into the constant when sDscamβ1 was transcribed under the control of the V5 promoter, then one mRNA isoform should be produced containing the two neighbouring variable cassettes (V5 and V6) without a within-cassette intron, but with the between-cassette sequence (type II, Fig. 6a). The presence of this mRNA isoform was confirmed by RT–PCR and sequencing (Fig. 6c). A similar mRNA isoform was detected in sDscamβ2, although the partial interval sequences between the two neighbouring variable cassettes had been spliced out (Fig. 6d,e). Similar mRNA isoforms were observed in other sDscamβ genes (Fig. 5e; Supplementary Fig. 11c–h). Taken together, these observations strongly support our hypothesis that not only the cap-proximal, but also the downstream variable cassettes could splice to the constant exon. This also suggests that the expression of 5′ variable cassettes is not only associated with specific promoter activity, but also with post-transcriptional alternative splicing. This study identified a novel shortened Dscam gene family with tandemly arrayed 5′ cassettes in Chelicerata. These sDscams had a high sequence similarity to the 3′ region of Drosophila Dscam1, but shared striking organizational resemblance to the 5′ variable region of vertebrate clustered Pcdhs. Moreover, sDscam gene family members tended to be arranged in tandem clusters, much like the vertebrate clustered Pcdh genes4. Finally, sDscams generally contained separate promoters upstream of each first exon of the variable cassette, as occurs in vertebrate Pcdhs19, 20. Hence, our findings have important implications for understanding the functional similarities between Drosophila Dscam1 and vertebrate Pcdhs. Compared with the large exons in clustered Pcdh genes, Chelicerata sDscam genes were composed of two to four exons. This tandem multi-exon organization not only expanded the diversity of amino acid sequences, but also enabled Ig structural plasticity. In Chelicerata sDscams, additional alternative splicing methods might be employed to expand isoform diversity (Fig. 5). For example, additional isoform diversity could be generated through mutually exclusive splicing of within-cassette duplicated exons (that is, sDscamβ1 V7; Fig. 5c). Notably, additional sequence and structural diversity could potentially be generated through combining exons from different tandem cassettes. Thus, clustered sDscams could potentially achieve much more isoform diversity than the clustered Pcdh gene. It is very likely that this more complex organization provides a genetic mechanism for generating higher numbers and additional types of isoforms required for the diverse functions and adaptations in Chelicerata. Phylogenetic analysis of Arthropoda Dscam genes revealed that Chelicerata sDscam and Drosophila Dscam1 were classified into different clades (Supplementary Fig. 3), suggesting that they may have converged on the common protein domain diversity from independent origins. Notably, duplication of the Ig7-encoding exon 9 or its orthologues occurred internally or 5′ terminally in all Arthropoda species investigated. This suggests that the diversity of Dscam1 Ig7 or its orthologues conferred intrinsic structural and regulatory benefits during Arthropoda evolution. Recent studies indicated that Ig7 domain diversity was crucial for the proper function of Dscam1 (refs 6, 8, 10, 12, 13, 14). Dscam1 generates functionally distinct isoforms through mutually exclusive splicing of internal exons in Drosophila (Fig. 7). However, no Chelicerata Dscam genes appeared to have a similar arrangement, although a random array of only two alternatives for the Dscam1 exon 9 orthologue are often observed in Chelicerata (that is, sDscamβ1 V7). In contrast, sDscam genes have evolved other mechanisms that serve this function in Chelicerata, through a combination of alternative promoter use and alternative splicing (Fig. 7). In this scenario, Drosophila Dscam1 and Chelicerata sDscam represent examples of convergent evolution for isoform diversity. It is noteworthy that, compared with Drosophila Dscam1 and other Dscam proteins from metazoans containing 10 Ig and 6 FNIII extracellular repeats, a single transmembrane segment and a cytoplasmic tail15, the Chelicerata sDscams reported in this study lacked the N-terminal Ig1–6,10 domains and FNIII3–4, 6 domains present in classical DSCAM. In fact, the Ig domains differed markedly across the immunoglobulin superfamily (IgSF) proteins, ranging from 2 to 10, but with mostly 4 to 5 repeats29. Hence, we speculate that such shortened isoforms have important functions. Because Chelicerata sDscams share a striking similarity with Drosophila Dscam1, and there was a remarkable organizational resemblance to the vertebrate clustered Pcdhs, with the latter two proteins both able to mediate self-recognition and self-avoidance, it is reasonable to speculate that Chelicerata sDscams have analogous roles in the nervous system. Our results indicated that not only the cap-proximal but also the downstream variable cassettes spliced to the constant exon. Based on this evidence, we propose a mechanistic framework for the selection of tandemly arrayed 5′ variable exons (Fig. 8). This extends and revises a previously proposed model for the mechanism governing the selection of tandemly arrayed 5′ variable regions4, 19, 20. Interestingly, intron sequences downstream of the variable region exons of Pcdhs were frequently contained in complementary DNA (cDNA) in independently derived cDNA libraries, which were previously assumed to be truncated mRNA isoforms or correspond to trans-splicing precursors4. Considering the similarity of the 5′ gene structure of Chelicerata sDscams and vertebrate Pcdhs, we speculate that these unusual intron-containing cDNAs might be a consequence of the variable exons downstream of the cap-proximal exons spliced to the constant exon in vertebrate Pcdh genes. Therefore, our mechanistic framework might be broadly applicable to tandemly arrayed 5′ variable exons in invertebrates and vertebrates. The selection of tandemly arrayed 5′ cassettes was highly regulated by a variety of mechanisms at both the transcriptional and post-transcriptional levels. Previous studies indicated that expression of the corresponding Pcdh mRNA might correlate with specific promoter activity19, 20. Because sDscam was under the control of a distinct promoter upstream of each variable cassette, Chelicerata sDscams should be regulated by a similar mechanism. Second, the 5′ splice site strength might have an effect on the selection of the variable exon. In general, the variant inclusion largely correlated with the strength of the 5′ splice site, but decreased with distance from the 3′ splice site of the first constitutive exon30. Based on the correlation of the inclusion frequency of a variable exon with its proximity to the first constitutive exon in sDscamβ2–3 and sDscamβ5–6, it seems that distance had some effect on the inclusion, at least for some genes. This was possibly due to higher levels of pre-mRNA for the proximal exons of the first constitutive exon present after transcription under multiple promoters. Finally, the selection of variable cassettes could easily be overridden in a developmental- or tissue-specific manner by the expression of specific activator- and repressor-binding proteins. Thus, the outcome of the variable exon results from multiple mechanisms acting in an overlapping manner. The sequences of the Dscam genes from the Scorpione M. martensii, the Araneae S. mimosarum, the Ixodoidean I. scapularis and T. urticae, and the Merostomatan L. polyphemus have been annotated through BLAST searches, using the annotated Dscam sequence of the most closely related organism and confirmed by available genome annotation and phylogenetic analysis (http://blast.ncbi.nlm.nih.gov/Blast.cgi; http://flybase.org/blast/, Supplementary Table 1). Gaps in the Dscam sequences for M. martensii were closed by PCR and sequencing. Genomic DNA was isolated from M. martensii (a gift from Zhijian Cao) using a QIAamp DNA Kit (Qiagen, Hilden, Germany). PCR was performed using primers designed against genomic sequences. Amplification products were cloned into the pGEM-T Easy Vector (Promega, Madison, WI, USA) for sequencing. Primer sequences are available on request. All Dscam homologues were analysed by classifying into families and predicting domains with InterPro31 (http://www.ebi.ac.uk/interpro/). Five tissues (cephalothorax, abdomen, poison gland, haemocyte and muscle) from an M. martensii adult and the whole body of a L. polyphemus adult were collected for RNA preparation. RNA library construction and paired-end RNA-seq were performed by LC Sciences (Houston, TX, USA). Briefly, total RNA was extracted using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s instructions. The total RNA quantity and purity were analysed using a Bioanalyser 2100 and RNA 6000 Nano LabChip Kit (Agilent, Santa Clara, CA, USA) with RNA integrity number >7.0. For the RNA-seq experiment, ~10 μg of total RNA was subjected to enrichment of the poly(A)-tailed mRNAs with poly(T) oligo-attached magnetic beads (Thermo Fisher Scientific, Waltham, MA, USA). After purification, the mRNA was fragmented into small pieces using divalent cations under elevated temperature. Then the cleaved RNA fragments were reverse transcribed to produce the final cDNA library according to the instructions in the mRNA-seq sample preparation kit (Illumina, San Diego, CA, USA). The paired-end RNA-seq was performed on the Illumina Hiseq 2500 platform (Illumina) following the vendor’s recommended protocols. The RNA-seq reads were de novo assembled to obtain transcripts of M. martensii and L. polyphemus using Trinity32 (https://github.com/trinityrnaseq/trinityrnaseq/wiki) with the default parameters. Transcripts sharing high sequence similarity were assigned to a cluster based on the default parameter settings of Trinity. For a cluster, the longest transcript was designated as the unigene of the cluster. The unigenes were functionally annotated based on sequence similarity at the protein level. Specifically, by using BLASTX (E-value<0.00001), the protein sequences translated from the unigenes were searched against the protein databases, including the NCBI non-redundant protein database, SwissProt, Kyoto Encyclopaedia of Genes and Genomes (KEGG) and Clusters of Orthologous Groups (COG) of proteins. The ends most 5′ of the sDscam unigenes were analysed for their potential transcription start sites, some of which were further verified by 5′ RACE. Tophat33 (http://ccb.jhu.edu/software/tophat/index.shtml) was used for RNA-seq mapping, the results of which were visualized using integrative genomics viewer (IGV)34 (http://www.broadinstitute.org/igv/). Considering the similarity among exon duplicates, the RNA-seq reads were split into 25- and 50-nucleotide (nt) fragments, which were mapped to calculate the expression levels of variable exons. Furthermore, to eliminate influences on calculations of the expression levels from identical sequence regions among exon duplicates, the 25- and 50-nt fragments with multiple loci were correctly allocated by referring to the mapping results of the full-length RNA-seq data sets. The correlation coefficient was calculated between the 25- and 50-nt mapping results. Similarly, to analyse the intron retention rate, both the 25- and 50-nt fragmented RNA-seq data sets were utilized to calculate the expression levels of the exon and neighbouring intron. An in-house computational program was developed to search for sequencing evidence supporting the exon–exon junctions. First, exonic sequences covering all of the possible junctions between the variable exons were created. We used 10 positions from each exon in a pair to assign a given read to an exon–exon junction. For example, the 230-nt exonic sequences included 115-nt upstream and 115-nt downstream of the junction for 125-nt RNA-seq reads. Second, all of the RNA-seq reads were mapped onto the exonic sequences created above, and the perfectly mapped RNA-seq reads covering exon–exon junctions were retained. A similar analysis was performed for 25 positions from each exon in a pair to determine the correlation with the results based on 10 positions. In addition, a similar method was used to analyse the exon-intron junction of the isoforms containing within-cassette introns. Total RNA was isolated using an RNeasy Mini Kit (Qiagen). Total RNA was reverse transcribed using SuperScript III RT (Invitrogen) with oligo(dT)15 primer, and the resulting single-stranded cDNA product was treated with DNase I at 37 °C for 30 min. The PCR was implemented with an initial denaturing at 95 °C for 3 min, followed by 35 cycles of denaturing at 95 °C for 45 s, annealing at 55 °C for 50 s, and extension at 72 °C for 2 min and 10 s, followed by a final extension at 72 °C for 10 min. The products of the PCR or the RT–PCR were purified and cloned into the pGEM-T Easy Vector and transformed into JM109 competent cells. Sequencing of individual clones was carried out using an automatic DNA sequencer. In some cases, nested PCR was necessary to amplify the products. Primer sequences are listed in Supplementary Table 2. The alignment of specific regions between species was performed using the ClustalW2 program (http://www.ebi.ac.uk/clustalw/index.html). Full-length variable region coding sequences were translated, and the resulting polypeptides were aligned. The genetic distances for each gene were estimated with MEGA 6.0 (ref. 35). The 5′ RACE analysis was performed according to the 5′ RACE Kit (Invitrogen) protocol and using the reagents from the kit. Total RNA was extracted from adult M. martensii cephalothoraxes using TRIzol Reagent (Life Technologies, Carlsbad, CA, USA). The RNA was subjected to reverse transcription using SuperScript II at 42 °C for 50 min, and incubated at 70 °C for 15 min to terminate the reaction. RT–PCR was carried out under the following cycling conditions: an initial denaturation of 2 min at 94 °C followed by 30–35 cycles of denaturation at 94 °C for 30 s, annealing at 55–60 °C for 30 s and extension at 72 °C for 30 s, with a final extension at 72 °C for 10 min. The promoter distribution was predicted using the Berkley Drosophila Neural Network Promoter program (http://www.fruitfly.org/seq_tools/promoter.html). To assay the promoter activity for M. martensii, the corresponding DNA sequences immediately preceding the translational start site of sDscamα and sDscamβ were cloned into a pGL4.20-Fluc reporter vector (Promega). For sDscamβ6 V1 and V2, site mutagenesis was performed to disrupt the predicted core promoter elements based on the schematic diagrams of minigene constructs (Fig. 4). The intron sequence in the common region of sDscamβ6 was cloned as a negative control. The pGL4.20 vector was used as a blank control. The promoter DNA sequence immediately preceding the translational start site of D. melanogaster Dscam2 was cloned as a positive control. All constructs were confirmed by sequencing. Drosophila S2 cells were co-transfected with the pGL4.20-Fluc reporter plasmid and the tubulin promoter-Rluc reporter plasmid (a gift from Wanzhong Ge) with Lipofectin (Invitrogen) according to the manufacturer’s instructions. Cells were lysed 48 h post transfection to measure the activity of firefly and Renilla luciferase according to the Dual-Luciferase Reporter Assay System (Promega). The mean and s.d. values were determined for each construct based on three independent transfections. The error bars were calculated from the average of three independent experiments in this study. The significance of differences was determined by a two-tailed Student’s t-test and *P<0.05, **P<0.01 and ***P<0.001 were taken to indicate statistical significance. Accession codes: The RNA-seq data were deposited into NCBI SRA (Sequence Read Archive; http://www.ncbi.nlm.nih.gov/sra/) (accession numbers: SRX1319503, SRX1319674, SRX1319813, SRX1319876 SRX1319877, and SRX1323743). The Dscam gene sequences were deposited into GenBank with accession numbers KT932388-KT932417; KU378204-KU378205. How to cite this article: Yue, Y. et al. A large family of Dscam genes with tandemly arrayed 5′ cassettes in Chelicerata. Nat. Commun. 7:11252 doi: 10.1038/ncomms11252 (2016). Download references This work was partly supported by research grants from the National Natural Science Foundation of China (31430050, 31125011, 31270844), the National Science and Technology Project (2012ZX09102301-009), the 973 Program (2014CB541700) and the Doctoral Foundation of Ministry of Education (20110101130012).
What problem does this paper attempt to address?
-
A Large Family of Dscam Genes with Tandemly Arrayed 5′ Cassettes in Chelicerata
Yuan Yue,Yijun Meng,Hongru Ma,Shouqing Hou,Guozheng Cao,Weiling Hong,Yang Shi,Pengjuan Guo,Baoping Liu,Feng Shi,Yun Yang,Yongfeng Jin
DOI: https://doi.org/10.1038/ncomms11252
IF: 16.6
2016-01-01
Nature Communications
Abstract:Drosophila Dscam1 (Down Syndrome Cell Adhesion Molecules) and vertebrate clustered protocadherins (Pcdhs) are two classic examples of the extraordinary isoform diversity from a single genomic locus. Dscam1 encodes 38,016 distinct isoforms via mutually exclusive splicing in D. melanogaster , while the vertebrate clustered Pcdh s utilize alternative promoters to generate isoform diversity. Here we reveal a shortened Dscam gene family with tandemly arrayed 5′ cassettes in Chelicerata . These cassette repeats generally comprise two or four exons, corresponding to variable Immunoglobulin 7 (Ig7) or Ig7–8 domains of Drosophila Dscam1. Furthermore, extraordinary isoform diversity has been generated through a combination of alternating promoter and alternative splicing. These sDscams have a high sequence similarity with Drosophila Dscam1 , and share striking organizational resemblance to the 5′ variable regions of vertebrate clustered Pcdh s. Hence, our findings have important implications for understanding the functional similarities between Drosophila Dscam1 and vertebrate Pcdh s, and may provide further mechanistic insights into the regulation of isoform diversity.
-
Human Down Syndrome Cell Adhesion Molecules (dscams) Are Functionally Conserved with Drosophila Dscam[TM1] Isoforms in Controlling Neurodevelopment.
Jianhua Huang,Ying Wang,Sangeetha Raghavan,Siqian Feng,Kurtis Kiesewetter,Jian Wang
DOI: https://doi.org/10.1016/j.ibmb.2011.05.008
IF: 4.421
2011-01-01
Insect Biochemistry and Molecular Biology
Abstract:Drosophila Down syndrome cell adhesion molecule (Dscam) potentially produces more than 150,000 cell adhesion molecules that share two alternative transmembrane/juxtamembrane (TM) domains, which dictate the dendrite versus axon subcellular distribution and function of different Dscam isoforms. Vertebrate genomes contain two closely related genes, DSCAM and DSCAM-Like1 (DSCAML1), which do not have extensive alternative splicing. We investigated the functional conservation between invertebrate Dscams and vertebrate DSCAMs by cross-species rescue assays and found that human DSCAM and DSCAML1 partially, but substantially, rescued the larval lethality of Drosophila Dscam mutants. Interestingly, both human DSCAM and DSCAML1 were targeted to the dendrites in Drosophila neurons, had synergistic rescue effects with Drosophila Dscam[TM2], and preferentially rescued the dendrite defects of Drosophila Dscam mutant neurons. Therefore, human DSCAM and DSCAML1 are functionally conserved with Drosophila Dscam[TM1] isoforms.
-
A chelicerate-specific burst of nonclassical Dscam diversity
Guozheng Cao,Yang Shi,Jian Zhang,Hongru Ma,Shouqing Hou,Haiyang Dong,Weiling Hong,Shuo Chen,Hao Li,Yandan Wu,Pengjuan Guo,Xu Shao,Bingbing Xu,Feng Shi,Yijun Meng,Yongfeng Jin
DOI: https://doi.org/10.1186/s12864-017-4420-0
IF: 4.547
2018-01-19
BMC Genomics
Abstract:Background: The immunoglobulin (Ig) superfamily receptor Down syndrome cell adhesion molecule (Dscam) gene can generate tens of thousands of isoforms via alternative splicing, which is essential for both nervous and immune systems in insects. However, further information is required to develop a comprehensive view of Dscam diversification across the broad spectrum of Chelicerata clades, a basal branch of arthropods and the second largest group of terrestrial animals. Results: In this study, a genome-wide comprehensive analysis of Dscam genes across Chelicerata species revealed a burst of nonclassical Dscams, categorised into four types-mDscam, sDscamα, sDscamβ, and sDscamγ-based on their size and structure. Although the mDscam gene class includes the highest number of Dscam genes, the sDscam genes utilise alternative promoters to expand protein diversity. Furthermore, we indicated that the 5' cassette duplicate is inversely correlated with the sDscam gene duplicate. We showed differential and sDscam- biased expression of nonclassical Dscam isoforms. Thus, the Dscam isoform repertoire across Chelicerata is entirely dominated by the number and expression levels of nonclassical Dscams. Taken together, these data show that Chelicerata evolved a large conserved and lineage-specific repertoire of nonclassical Dscams. Conclusions: This study showed that arthropods have a large diversified Chelicerata-specific repertoire of nonclassical Dscam isoforms, which are structurally and mechanistically distinct from those of insects. These findings provide a global framework for the evolution of Dscam diversity in arthropods and offer mechanistic insights into the diversification of the clade-specific Ig superfamily repertoire.
-
Regulation of Dscam Exon 17 Alternative Splicing by Steric Hindrance in Combination with RNA Secondary Structures.
Yuan Yue,Guoli Li,Yun Yang,Wenjing Zhang,Huawei Pan,Ran Chen,Feng Shi,Yongfeng Jin
DOI: https://doi.org/10.4161/rna.27176
2013-01-01
RNA Biology
Abstract:The gene Down syndrome cell adhesion molecule (Dscam) potentially encodes 38 016 distinct isoforms in Drosophila melanogaster via mutually exclusive splicing. Here we reveal a combinatorial mechanism of regulation of Dscam exon 17 mutually exclusive splicing through steric hindrance in combination with RNA secondary structure. This mutually exclusive behavior is enforced by steric hindrance, due to the close proximity of the exon 17.2 branch point to exon 17.1 in Diptera, and the interval size constraint in non-Dipteran species. Moreover, intron-exon RNA structures are evolutionarily conserved in 36 non-Drosophila species of six distantly related orders (Diptera, Lepidoptera, Coleoptera, Hymenoptera, Hemiptera, and Phthiraptera), which regulates the selection of exon 17 variants via masking the splice site. By contrast, a previously uncharacterized RNA structure specifically activated exon 17.1 by bringing splice sites closer together in Drosophila, while the other moderately suppressed exon 17.1 selection by hindering the accessibility of polypyrimidine sequences. Taken together, these data suggest a phylogeny of increased complexity in regulating alternative splicing of Dscam exon 17 spanning more than 300 million years of insect evolution. These results also provide models of the regulation of alternative splicing through steric hindrance in combination with dynamic structural codes.
-
Dscam homophilic specificity is generated by high order cis -multimers coupled with trans self-binding of variable Ig1 in Chelicerata
Fengyan Zhou,Guozheng Cao,Songjun Dai,Guo Li,Hao Li,Zhu Ding,Shouqing Hou,Bingbing Xu,Wendong You,Feng Shi,Xiaofeng Yang,Yongfeng Jin
DOI: https://doi.org/10.1101/2019.12.15.877159
2019-01-01
Abstract:By alternative splicing, () encodes tens of thousands of proteins required for establishing neural circuits, while Chelicerata encodes a family of ∼ 100 shortened (sDscam) isoforms via alternative promoters. We report that Dscam isoforms interact promiscuously to generate a vast repertoire of combinatorial homophilic recognition specificities in Chelicerata. Specifically, sDscams formed high order -multimers without isoform specificity involving the membrane-proximal fibronectin type III (FNIII) 1-3 and transmembrane (TM) domains and associated specifically via antiparallel self-binding of the first variable immunoglobulin (Ig1) domain. We propose that such sDscam combinatorial homophilic specificity is sufficient to provide each neuron with a unique identity for self–non-self discrimination. In many respects, our results amazingly mirror those reported for the structurally unrelated vertebrate protocadherins (Pcdh) rather than for the closely related fly Dscam1. Thus, our findings blur the distinction between the neuronal self-avoidance of invertebrates and vertebrates and provide insight into the basic principles and evolution of metazoan self-avoidance and self–non-self discrimination.
-
RNA secondary structures in Dscam1 mutually exclusive splicing: unique evolutionary signature from the midge
Weiling Hong,Yang Shi,Bingbing Xu,Yongfeng Jin
DOI: https://doi.org/10.1261/rna.075259.120
2020-05-29
RNA
Abstract:The Drosophila melanogaster gene Dscam1 potentially generates 38,016 distinct isoforms via mutually exclusive splicing, which are required for both nervous and immune functions. However, the mechanism underlying splicing regulation remains obscure. Here we show apparent evolutionary signatures characteristic of competing RNA secondary structures in exon clusters 6 and 9 of Dscam1 in the two midge species ( Belgica antarctica and Clunio marinus ). Surprisingly, midge Dscam1 encodes only ∼6000 different isoforms through mutually exclusive splicing. Strikingly, the docking site of the exon 6 cluster is conserved in almost all insects and crustaceans but is specific in the midge; however, the docking site-selector base-pairings are conserved. Moreover, the docking site is complementary to all predicted selector sequences downstream from every variable exon 9 of the midge Dscam1 , which is in accordance with the broad spectrum of their isoform expression. This suggests that these cis -elements mainly function through the formation of long-range base-pairings. This study provides a vital insight into the evolution and mechanism of Dscam1 alternative splicing.
biochemistry & molecular biology
-
Revisiting Dscam diversity: lessons from clustered protocadherins
Yongfeng Jin,Hao Li
DOI: https://doi.org/10.1007/s00018-018-2951-4
Abstract:The complexity of neuronal wiring relies on the extraordinary recognition diversity of cell surface molecules. Drosophila Dscam1 and vertebrate clustered protocadherins (Pcdhs) are two classic examples of the striking diversity from a complex genomic locus, wherein the former encodes more than 10,000 distinct isoforms via alternative splicing, while the latter employs alternative promoters to attain isoform diversity. These structurally unrelated families show remarkably striking molecular parallels and even similar functions. Recent studies revealed a novel Dscam gene family with tandemly arrayed 5' cassettes in Chelicerata (e.g., the scorpion Mesobuthus martensii and the tick Ixodes scapularis), similar to vertebrate clustered Pcdhs. Likewise, octopus shows a more remarkable expansion of the Pcdh isoform repertoire than human. These discoveries of Dscam and Pcdh diversification reshape the evolutionary landscape of recognition molecule diversity and provide a greater understanding of convergent molecular strategies for isoform diversity. This article reviews new insights into the evolution, regulatory mechanisms, and functions of Dscam and Pcdh isoform diversity. In particular, the convergence of clustered Dscams and Pcdhs is highlighted.
-
Trans-splicing facilitated by RNA pairing greatly expands sDscam isoform diversity but not homophilic binding specificity
Shouqing Hou,Guo Li,Bingbing Xu,Haiyang Dong,Shixin Zhang,Ying Fu,Jilong Shi,Lei Li,Jiayan Fu,Feng Shi,Yijun Meng,Yongfeng Jin
DOI: https://doi.org/10.1126/sciadv.abn9458
IF: 13.6
2022-07-08
Science Advances
Abstract:The Down syndrome cell adhesion molecule 1 ( Dscam1 ) gene can generate tens of thousands of isoforms via alternative splicing, which is essential for nervous and immune functions. Chelicerates generate approximately 50 to 100 shortened Dscam (sDscam) isoforms by alternative promoters, similar to mammalian protocadherins. Here, we reveal that trans-splicing markedly increases the repository of sDscamβ isoforms in Tetranychus urticae . Unexpectedly, every variable exon cassette engages in trans-splicing with constant exons from another cluster. Moreover, we provide evidence that competing RNA pairing not only governs alternative cis-splicing but also facilitates trans-splicing. Trans-spliced sDscam isoforms mediate cell adhesion ability but exhibit the same homophilic binding specificity as their cis-spliced counterparts. Thus, we reveal a single sDscam locus that generates diverse adhesion molecules through cis- and trans-splicing coupled with alternative promoters. These findings expand understanding of the mechanism underlying molecular diversity and have implications for the molecular control of neuronal and/or immune specificity.
multidisciplinary sciences
-
Molecular Characterization of DSC1 Orthologs in Invertebrate Species
Ying-Jun Cui,Lin-Lin Yu,Hai-Jun Xu,Ke Dong,Chuan-Xi Zhang
DOI: https://doi.org/10.1016/j.ibmb.2012.01.005
IF: 4.421
2012-01-01
Insect Biochemistry and Molecular Biology
Abstract:DSC1 and BSC1 are two founding members of a novel family of invertebrate voltage-gated cation channels with close structural and evolutionary relationships to voltage-gated sodium and calcium channels. In this study, we searched the published genome sequences for DSC1 orthologs. DSC1 orthologs were found in all 48 insect species, and in other invertebrate species belonging to phyla Mollusca, Cnidaria, Hemichordata and Echinodermata. However, DSC1 orthologs were not found in four arachnid species, Ixodes scapularis, Rhipicephalus microplus, Tetranychus urticae and Varroa destructor, two species in Annelida or any vertebrate species. We then cloned and sequenced NlSC1 and BmSC1 full-length cDNAs from the brown planthopper (Nilaparvata lugens) and the silkworm (Bombyx mori), respectively. NlSC1 and BmSC1 share about 50% identity with DSC1, and the expression of NlSC1 and BmSC1 transcripts was most abundant in the head and antenna in adults. All DSC1 orthologs contain a unique and conserved DEEA motif, instead of the EEEE or EEDD motif in classical calcium channels or the DEKA motif in sodium channels. Phylogenetic analyses revealed that DSC1 and its orthologs form a separate group distinct from the classical voltage-gated sodium and calcium channels and constitute a unique family of cation channels. The DSC1/BSC1-family channels could be potential targets of new and safe insecticides for pest control.
-
Intron-targeted mutagenesis reveals roles for Dscam1 RNA pairing-mediated splicing bias in neuronal wiring
Weiling Hong,Haiyang Dong,Jian Zhang,Fengyan Zhou,Yandan Wu,Yang Shi,Shuo Chen,Bingbing Xu,Wendong You,Feng Shi,Xiaofeng Yang,Zhefeng Gong,Jianhua Huang,Yongfeng Jin
DOI: https://doi.org/10.1101/622217
2019-01-01
bioRxiv
Abstract:Drosophila melanogaster Down syndrome cell adhesion molecule (Dscam1) can potentially generate 38,016 different isoforms through stochastic, yet highly biased, alternative splicing. Genetic studies demonstrated that stochastic expression of multiple Dscam1 isoforms provides each neuron with a unique identity for self/non-self-discrimination. However, due to technical obstacles, the functional significance of the highly specific bias in isoform expression remains entirely unknown. Here, we provide conclusive evidence that Dscam1 splicing bias is required for precise mushroom body (MB) axonal wiring in flies in a variable exon-specific manner. We showed that targeted deletion of the intronic docking site perturbed base pairing-mediated regulation of inclusion of variable exons. Unexpectedly, we generated mutant flies with normal overall Dscam1 protein levels and an identical number but global changes in exon 4 and exon 9 isoform bias (DscamΔ4D and DscamΔ9D), respectively. DscamΔ9D mutant exhibited remarkable mushroom body defects, which were correlated with the extent of the disrupted isoform bias. By contrast, the DscamΔ4D animals exhibited a much less severe defective phenotype than DscamΔ9D animals, suggestive of a variable domain-specific requirement for isoform bias. Importantly, mosaic analysis revealed that changes in isoform bias caused axonal defects but did not influence the self-avoidance of axonal branches. We concluded that, in contrast to the Dscam1 isoform number that provides the molecular basis for neurite self-avoidance, isoform bias may play a non-repulsive role in mushroom body axonal wiring.
-
Intron-targeted mutagenesis reveals roles for<i>Dscam1</i>RNA pairing-mediated splicing bias in neuronal wiring
Weiling Hong,Haiyang Dong,Jian Zhang,Fengyan Zhou,Yandan Wu,Yang Shi,Shuo Chen,Bingbing Xu,Wendong You,Feng Shi,Xiaofeng Yang,Zhefeng Gong,Jianhua Huang,Yongfeng Jin
DOI: https://doi.org/10.1101/622217
2019-01-01
Abstract:AbstractDrosophila melanogasterDown syndrome cell adhesion molecule (Dscam1) can potentially generate 38,016 different isoforms through stochastic, yet highly biased, alternative splicing. Genetic studies demonstrated that stochastic expression of multiple Dscam1 isoforms provides each neuron with a unique identity for self/non-self-discrimination. However, due to technical obstacles, the functional significance of the highly specific bias in isoform expression remains entirely unknown. Here, we provide conclusive evidence thatDscam1splicing bias is required for precise mushroom body (MB) axonal wiring in flies in a variable exon-specific manner. We showed that targeted deletion of the intronic docking site perturbed base pairing-mediated regulation of inclusion of variable exons. Unexpectedly, we generated mutant flies with normal overall Dscam1 protein levels and an identical number but global changes in exon 4 and exon 9 isoform bias (DscamΔ4D−/−and DscamΔ9D−/−), respectively. DscamΔ9D−/−mutant exhibited remarkable mushroom body defects, which were correlated with the extent of the disrupted isoform bias. By contrast, the DscamΔ4D−/−animals exhibited a much less severe defective phenotype than DscamΔ9D−/−animals, suggestive of a variable domain-specific requirement for isoform bias. Importantly, mosaic analysis revealed that changes in isoform bias caused axonal defects but did not influence the self-avoidance of axonal branches. We concluded that, in contrast to the Dscam1 isoform number that provides the molecular basis for neurite self-avoidance, isoform bias may play a non-repulsive role in mushroom body axonal wiring.
-
Complex RNA Secondary Structures Mediate Mutually Exclusive Splicing of Coleoptera <i>Dscam1</i>
Haiyang Dong,Lei Li,Xiaohua Zhu,Jilong Shi,Ying Fu,Shixin Zhang,Yang Shi,Bingbing Xu,Jian Zhang,Feng Shi,Yongfeng Jin
DOI: https://doi.org/10.3389/fgene.2021.644238
IF: 3.7
2021-01-01
Frontiers in Genetics
Abstract:Mutually exclusive splicing is an important mechanism for expanding protein diversity. An extreme example is the Down syndrome cell adhesion molecular (Dscam1) gene of insects, containing four clusters of variable exons (exons 4, 6, 9, and 17), which potentially generates tens of thousands of protein isoforms through mutually exclusive splicing, of which regulatory mechanisms are still elusive. Here, we systematically analyzed the variable exon 4, 6, and 9 clusters of Dscam1 in Coleoptera species. Through comparative genomics and RNA secondary structure prediction, we found apparent evidence that the evolutionarily conserved RNA base pairing mediates mutually exclusive splicing in the Dscam1 exon 4 cluster. In contrast to the fly exon 6, most exon 6 selector sequences in Coleoptera species are partially located in the variable exon region. Besides, bidirectional RNA-RNA interactions are predicted to regulate the mutually exclusive splicing of variable exon 9 of Dscam1. Although the docking sites in exon 4 and 9 clusters are clade specific, the docking sites-selector base pairing is conserved in secondary structure level. In short, our result provided a mechanistic framework for the application of long-range RNA base pairings in regulating the mutually exclusive splicing of Coleoptera Dscam1.
-
Specific Drosophila Dscam Juxtamembrane Variants Control Dendritic Elaboration and Axonal Arborization
Lei Shi,Hung-Hsiang Yu,Jacob S. Yang,Tzumin Lee
DOI: https://doi.org/10.1523/jneurosci.1517-07.2007
2007-01-01
Journal of Neuroscience
Abstract:Drosophila Dscam isoforms are derived from two alternative transmembrane/juxtamembrane domains (TMs) in addition to thousands of ectodomain variants. Using a microRNA-based RNA interference technology, we selectively knocked down different subsets of Dscams containing either the exon 17.1- or exon 17.2-encoding TM. Eliminating Dscam[TM1] reduced Dscam expression but minimally affected postembryonic axonal morphogenesis. In contrast, depleting Dscam[TM2] blocked axon arborization. Further removal of Dscam[TM1] enhanced the loss-of-Dscam[TM2] axonal phenotypes. However, Dscam[TM1] primarily regulates dendritic development, as evidenced by the observations that removing Dscam[TM1] alone impeded elaboration of dendrites and that transgenic Dscam[TM1], but not Dscam[TM2], effectively rescued Dscam mutant dendritic phenotypes in mosaic organisms. These distinct Dscam functions can be attributed to the juxtamembrane regions of TMs that govern dendritic versus axonal targeting of Dscam as well. Together, we suggest that specific Drosophila Dscam juxtamembrane variants control dendritic elaboration and axonal arborization.
-
Expression patterns of dscam and sdk gene paralogs in developing zebrafish retina
Carlos A Galicia,Joshua M Sukeena,Deborah L Stenkamp,Peter G Fuerst
2018-07-19
Abstract:Purpose: The differential adhesion hypothesis states that a cell adhesion code provides cues that direct the specificity of nervous system development. The Down syndrome cell adhesion molecule (DSCAM) and sidekick (SDK) proteins belong to the immunoglobulin superfamily of cell adhesion molecules (CAMs) and provide both attractive and repulsive cues that help to organize the nervous system during development, according to the differential adhesion hypothesis. The zebrafish genome is enriched in dscam and sdk genes, making the zebrafish an excellent model system to further test this hypothesis. The goal of this study is to describe the phylogenetic relationships of the paralogous CAM genes and their spatial expression and co-expression patterns in the embryonic zebrafish retina. Methods: Exon-intron structures, karyotypic locations, genomic context, and amino acid sequences of the zebrafish CAM genes (dscama, dscamb, dscaml1, sdk1a, sdk1b, sdk2a, and sdk2b) were obtained from the Ensembl genome database. The Prosite and SMART programs were used to determine the number and identity of protein domains for each CAM gene. The randomized axelerated maximum likelihood (RaxML) program was used to perform a phylogenetic analysis of the zebrafish CAM genes and orthologs in other vertebrates. A synteny analysis of regions surrounding zebrafish CAM paralogs was performed. Digoxigenin (dig)-labeled cRNA probes for each CAM gene were generated to perform in situ hybridization of retinal cryosections from zebrafish embryos and larvae. Dual in situ hybridization of retinal cryosections from zebrafish larvae was performed with dig- and fluorescein-labeled cRNA probes. Results: We found the studied zebrafish CAM genes encode similar protein domain structures as their corresponding orthologs in mammals and possess similar intron-exon organizations. CAM paralogs were located on different chromosomes. Phylogenetic and synteny analyses provided support for zebrafish dscam and sdk2 paralogs having originated during the teleost genome duplication. We found that dscama and dscamb are co-expressed in the ganglion cell layer (GCL) and the basal portion of the inner nuclear layer (INL), with weak expression in the photoreceptor-containing outer nuclear layer (ONL). Of the dscam genes, only dscamb was strongly expressed in ONL. Sdk1a and sdk1b were co-expressed in the GCL and the basal portion of the INL. Sdk2a and sdk2b also showed co-expression in the GCL and basal portion of the INL. All Sdk genes were expressed in the ciliary marginal zone (CMZ). Dual in situ hybridizations revealed alternating patterns of co-expression and exclusive expression for the dscam and sdk1 paralogs in cells of the GCL and the INL. The same alternating pattern was observed between dscam and sdk2 paralogs and between sdk1 and sdk2 paralogs. The expression of dscaml1 was observed in the INL and the GCL, with some cells in the basal portion of the INL showing co-expression of dscaml1 and dscama. Conclusions: These findings suggest that zebrafish dscam and sdk2 paralogs were likely the result of the teleost whole genome duplication and that all CAM duplicates show some differential expression patterns. We also demonstrate that the comparative expression patterns of CAM genes in the zebrafish are distinct from the exclusive expression patterns observed in chick retina, in which retinal ganglion cells express one of the four chick Dscam or Sdk genes only. The patterns in zebrafish are more similar to those of mice, in which co-expression of Dscam and Sdk genes is observed. These findings provide the groundwork for future functional analysis of the roles of the CAM paralogs in zebrafish.
-
A systematic CRISPR screen reveals redundant and specific roles for Dscam1 isoform diversity in neuronal wiring
Haiyang Dong,Xi Yang,Lili Wu,Shixin Zhang,Jian Zhang,Pengjuan Guo,Yiwen Du,Changkun Pan,Ying Fu,Lei Li,Jilong Shi,Yanda Zhu,Hongru Ma,Lina Bian,Bingbing Xu,Guo Li,Feng Shi,Jianhua Huang,Haihuai He,Yongfeng Jin
DOI: https://doi.org/10.1371/journal.pbio.3002197
IF: 9.8
2023-07-07
PLoS Biology
Abstract:Drosophila melanogaster Down syndrome cell adhesion molecule 1 ( Dscam1 ) encodes 19,008 diverse ectodomain isoforms via the alternative splicing of exon 4, 6, and 9 clusters. However, whether individual isoforms or exon clusters have specific significance is unclear. Here, using phenotype–diversity correlation analysis, we reveal the redundant and specific roles of Dscam1 diversity in neuronal wiring. A series of deletion mutations were performed from the endogenous locus harboring exon 4, 6, or 9 clusters, reducing to 396 to 18,612 potential ectodomain isoforms. Of the 3 types of neurons assessed, dendrite self/non-self discrimination required a minimum number of isoforms (approximately 2,000), independent of exon clusters or isoforms. In contrast, normal axon patterning in the mushroom body and mechanosensory neurons requires many more isoforms that tend to associate with specific exon clusters or isoforms. We conclude that the role of the Dscam1 diversity in dendrite self/non-self discrimination is nonspecifically mediated by its isoform diversity. In contrast, a separate role requires variable domain- or isoform-related functions and is essential for other neurodevelopmental contexts, such as axonal growth and branching. Our findings shed new light on a general principle for the role of Dscam1 diversity in neuronal wiring.
biochemistry & molecular biology,biology
-
Intron-targeted mutagenesis reveals roles for <i>Dscam1</i> RNA pairing architecture-driven splicing bias in neuronal wiring
Weiling Hong,Jian Zhang,Haiyang Dong,Yang Shi,Hongru Ma,Fengyan Zhou,Bingbing Xu,Ying Fu,Shixin Zhang,Shouqing Hou,Guo Li,Yandan Wu,Shuo Chen,Xiaohua Zhu,Wendong You,Feng Shi,Xiaofeng Yang,Zhefeng Gong,Jianhua Huang,Yongfeng Jin
DOI: https://doi.org/10.1016/j.celrep.2021.109373
IF: 8.8
2021-01-01
Cell Reports
Abstract:Drosophila melanogaster Down syndrome cell adhesion molecule (Dscam1) can generate 38,016 different isoforms through largely stochastic, yet highly biased, alternative splicing. These isoforms are required for nervous functions. However, the functional significance of splicing bias remains unknown. Here, we provide evidence that Dscam1 splicing bias is required for mushroom body (MB) axonal wiring. We generate mutant flies with normal overall protein levels and an identical number but global changes in exon 4 and 9 isoform bias (Dscam Delta 4D(-/-) and Dscam Delta 9D(-/-)), respectively. In contrast to Dscam Delta 4D(-/-), Dscam Delta 9D(-/-) exhibits remarkable MB defects, suggesting a variable domain-specific requirement for isoform bias. Importantly, changes in isoform bias cause axonal defects but do not influence the self-avoidance of axonal branches. We conclude that, in contrast to the isoform number that provides the molecular basis for neurite self-avoidance, isoform bias may play a role in MB axonal wiring by influencing non-repulsive signaling.
-
Chelicerata sDscam isoforms combine homophilic specificities to define unique cell recognition
Fengyan Zhou,Guozheng Cao,Songjun Dai,Guo Li,Hao Li,Zhu Ding,Shouqing Hou,Bingbing Xu,Wendong You,Gil Wiseglass,Feng Shi,Xiaofeng Yang,Rotem Rubinstein,Yongfeng Jin
DOI: https://doi.org/10.1073/pnas.1921983117
IF: 11.1
2020-09-22
Proceedings of the National Academy of Sciences
Abstract:Significance Neuronal self-avoidance is a conserved process in vertebrates and invertebrates. In Drosophila , self-avoidance is mediated by the Down syndrome cell adhesion molecule ( Dscam1 ) gene that encodes tens of thousands of proteins through alternative splicing. In vertebrates, an analogous function is performed by ∼60 clustered protocadherins (cPcdh) through promoter choice. Here we use cell aggregation assays to study the binding preferences of ∼100 sDscam protein in scorpion. We report that while related in sequence to the fly Dscam, the scorpion sDscam adopts a strategy that is similar to that of vertebrate cPcdhs, of combined specificity when coexpressed. Our findings identify sDscams as likely candidates to mediate neuronal self-avoidance in Chelicerata, as well as provide a remarkable example of convergent evolution.
multidisciplinary sciences
-
An RNA architectural locus control region involved in Dscam mutually exclusive splicing
Xuebin Wang,Guoli Li,Yun Yang,Wenfeng Wang,Wenjing Zhang,Huawei Pan,Peng Zhang,Yuan Yue,Hao Lin,Baoping Liu,Jingpei Bi,Feng Shi,Jinping Mao,Yijun Meng,Leilei Zhan,Yongfeng Jin
DOI: https://doi.org/10.1038/ncomms2269
IF: 16.6
2012-01-01
Nature Communications
Abstract:The most striking example of alternative splicing in a Drosophila melanogaster gene is observed in the Down syndrome cell adhesion molecule , which can generate 38,016 different isoforms. RNA secondary structures are thought to direct the mutually exclusive splicing of Down syndrome cell adhesion molecule , but the underlying mechanisms are poorly understood. Here we describe a locus control region that can activate the exon 6 cluster and specifically allow for the selection of only one exon variant in combination with docking site selector sequence interactions. Combining comparative genomic studies of 63 species with mutational analysis reveals that intricate, tandem multi-‘subunit’ RNA structures within the locus control region activate species-appropriate alternative variants. Importantly, strengthening the weak splice sites of the target exon can remove the locus control region dependence. Our findings not only provide a locus control region-dependent mechanism for mutually exclusive splicing, but also suggest a model for the evolution of increased complexity in a long-range RNA molecular machine.
-
Self-avoidance alone does not explain the function of Dscam1 in mushroom body axonal wiring
Haiyang Dong,Pengjuan Guo,Jian Zhang,Lili Wu,Ying Fu,Lei Li,Yanda Zhu,Yiwen Du,Jilong Shi,Shixin Zhang,Guo Li,Bingbing Xu,Lina Bian,Xiaohua Zhu,Wendong You,Feng Shi,Xiaofeng Yang,Jianhua Huang,Yongfeng Jin
DOI: https://doi.org/10.1016/j.cub.2022.05.030
IF: 9.2
2022-07-11
Current Biology
Abstract:Alternative splicing of Drosophila Dscam1 into 38,016 isoforms provides neurons with a unique molecular code for self-recognition and self-avoidance. A canonical model suggests that the homophilic binding of identical Dscam1 isoforms on the sister branches of mushroom body (MB) axons supports segregation with high fidelity, even when only a single isoform is expressed. Here, we generated a series of mutant flies with a single exon 4, 6, or 9 variant, encoding 1,584, 396, or 576 potential isoforms, respectively. Surprisingly, most of the mutants in the latter two groups exhibited obvious defects in the growth, branching, and segregation of MB axonal sister branches. This demonstrates that the repertoires of 396 and 576 Dscam1 isoforms were not sufficient for the normal patterning of axonal branches. Moreover, reducing Dscam1 levels largely reversed the defects caused by reduced isoform diversity, suggesting a functional link between Dscam1 expression levels and isoform diversity. Taken together, these results indicate that canonical self-avoidance alone does not explain the function of Dscam1 in MB axonal wiring.
cell biology,biochemistry & molecular biology,biology
-
Immune Functions of the Dscam Extracellular Variable Region in Chinese Mitten Crab
Xiao-Li Zhang,Guo-Qing Shen,Xiao-Na Zhang,Yue-Hong Zhao,Wei-Wei Li,Qun Wang
DOI: https://doi.org/10.1016/j.fsi.2023.108850
IF: 4.622
2023-01-01
Fish & Shellfish Immunology
Abstract:In arthropods, there is only a single copy of Down Syndrome Cell Adhesion Molecule (Dscam) in the genome, but it can exist as numerous splice variants. There are three hypervariable exons in the extracellular domain and one hypervariable exon in the transmembrane domain. In Chinese mitten crab (Eriocheir sinensis), exons 4, 6 and 14 can produce 25, 34 and 18 alternative splice variants, respectively. In this study, through Illumina sequencing, we identified additional splice variants for exons 6 and 14, hence there may be > 50,000 Dscam protein variants. Sequencing of exons 4, 6 and 14 showed that alternative splicing was altered after bacterial stimulation. Therefore, we expressed and purified the extracellular variable region of Dscam (EsDscam-Ig1-Ig7). Exons 4.3, 6.46 and 14.18, three variable exons of the recombinant protein, were randomly selected. The functions of EsDscam-Ig1-Ig7 in immune defences of E. sinensis were subsequently explored. EsDscam-Ig1-Ig7 was discovered to bind to both Gram-positive Staphylococcus aureus and Gram-negative Vibrio parahaemolyticus, but it did not exhibit antibacterial activity. By promoting hemocyte phagocytosis and bacterial removal, EsDscam-Ig1-Ig7 can also shield the host from bacterial infection. The findings highlight the immunological activities of Dscam alternative splicing and reveal the potential for many more Dscam isoforms than were previously predicted in E. sinensis.