A genome-wide association study of Chinese and English language phenotypes in Hong Kong Chinese children

Yu-Ping Lin,Yujia Shi,Ruoyu Zhang,Xiao Xue,Shitao Rao,Liangying Yin,Kelvin Fai Hong Lui,Dora Jue PAN,Urs Maurer,Kwong-Wai Choy,Silvia Paracchini,Catherine McBride,Hon-Cheong So
DOI: https://doi.org/10.1038/s41539-024-00229-7
2024-03-28
npj Science of Learning
Abstract:Dyslexia and developmental language disorders are important learning difficulties. However, their genetic basis remains poorly understood, and most genetic studies were performed on Europeans. There is a lack of genome-wide association studies (GWAS) on literacy phenotypes of Chinese as a native language and English as a second language (ESL) in a Chinese population. In this study, we conducted GWAS on 34 reading/language-related phenotypes in Hong Kong Chinese bilingual children (including both twins and singletons; total N = 1046). We performed association tests at the single-variant, gene, and pathway levels. In addition, we tested genetic overlap of these phenotypes with other neuropsychiatric disorders, as well as cognitive performance (CP) and educational attainment (EA) using polygenic risk score (PRS) analysis. Totally 5 independent loci (LD-clumped at r 2 = 0.01; MAF > 0.05) reached genome-wide significance ( p 0.3 and having at least 2 correlated SNPs (r 2 > 0.5) with p < 1e-3). The loci were associated with a range of language/literacy traits such as Chinese vocabulary, character and word reading, and rapid digit naming, as well as English lexical decision. Several SNPs from these loci mapped to genes that were reported to be associated with EA and other neuropsychiatric phenotypes, such as MANEA and PLXNC1 . In PRS analysis, EA and CP showed the most consistent and significant polygenic overlap with a variety of language traits, especially English literacy skills. To summarize, this study revealed the genetic basis of Chinese and English abilities in a group of Chinese bilingual children. Further studies are warranted to replicate the findings.
psychology, experimental,education & educational research,neurosciences
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to explore the genetic basis of children's Chinese and English language abilities in Hong Kong, China. Specifically, researchers use genome - wide association analysis (GWAS) to identify genetic variations related to multiple reading and language phenotypes in Chinese (as the mother tongue) and English (as the second language). In addition, the study also aims to test the genetic overlap between these language phenotypes and other neuropsychiatric diseases, cognitive performance, and educational achievements. ### Research Background 1. **Importance of Learning Disabilities**: Dyslexia and developmental language disorders are important learning disabilities that affect children's academic performance, career development, and socioeconomic status. 2. **Influence of Genetic Factors**: Although it is known that these disorders have a genetic basis, the specific genes or variations are not yet clear. Most genetic studies have focused on European populations, and there is a lack of research on Chinese populations. 3. **Complexity of Language and Literacy Skills**: Language and literacy skills involve multiple cognitive and language abilities, such as working memory, rapid naming, and vocabulary knowledge. Both environmental and genetic factors can affect the development of these skills. ### Research Objectives 1. **Explore the Genetic Basis of Chinese and English Language Abilities**: Through GWAS research, find genetic variations related to Chinese and English language abilities. 2. **Test Genetic Overlap**: Use polygenic risk score (PRS) analysis to test the genetic associations between these language phenotypes and other neuropsychiatric diseases, cognitive performance, and educational achievements. 3. **Provide Reference Data**: Provide important reference for future research, especially for genetic research on language and literacy abilities in Chinese populations. ### Main Methods 1. **Sample Selection**: The research subjects were 1,046 Chinese bilingual children (including twins and singletons) from Hong Kong. Their mother tongue was Cantonese, and their second language was English. 2. **GWAS Analysis**: Association tests at the single - variant, gene, and pathway levels were carried out for 34 reading - and language - related phenotypes. 3. **Polygenic Risk Score Analysis**: Tested the genetic overlap between these language phenotypes and other neuropsychiatric diseases, cognitive performance, and educational achievements. ### Main Findings 1. **Five Independent Significant Loci Were Discovered**: These loci are related to multiple language / literacy characteristics, such as Chinese vocabulary, Chinese character and word reading, and rapid number naming, as well as English vocabulary decision - making. 2. **Association between Gene Expression and Phenotype**: Through S - PrediXcan and S - MulTiXcan analyses, associations between gene expression changes in multiple brain regions and specific language phenotypes were discovered. 3. **Genetic Overlap**: Educational achievement (EA) and cognitive performance (CP) have the most consistent and significant polygenic overlap with multiple language characteristics, especially English reading and writing skills. ### Conclusion This study reveals the genetic basis of Chinese and English language abilities in bilingual children in Hong Kong, China, and provides valuable reference data, laying the foundation for further research.