Abstract:OBJECTIVE: Genotype imputation is a commonly used technique that infers un-typed variants into a study's genotype data, allowing better identification of causal variants in disease studies. However, due to overrepresentation of Caucasian studies, there's a lack of understanding of genetic basis of health-outcomes in other ethnic populations. Therefore, facilitating imputation of missing key-predictor-variants that can potentially improve a risk health-outcome prediction model, specifically for Asian ancestry, is of utmost relevance.METHODS: We aimed to construct an imputation and analysis web-platform, that primarily facilitates, but is not limited to genotype imputation on East-Asians. The goal is to provide a collaborative imputation platform for researchers in the public domain towards rapidly and efficiently conducting accurate genotype imputation.RESULTS: We present an online genotype imputation platform, Multi-ethnic Imputation System (MI-System) (https://misystem.cgm.ntu.edu.tw/), that offers users 3 established pipelines, SHAPEIT2-IMPUTE2, SHAPEIT4-IMPUTE5, and Beagle5.1 for conducting imputation analyses. In addition to 1000 Genomes and Hapmap3, a new customized Taiwan Biobank (TWB) reference panel, specifically created for Taiwanese-Chinese ancestry is provided. MI-System further offers functions to create customized reference panels to be used for imputation, conduct quality control, split whole genome data into chromosomes, and convert genome builds.CONCLUSION: Users can upload their genotype data and perform imputation with minimum effort and resources. The utility functions further can be utilized to preprocess user uploaded data with easy clicks. MI-System potentially contributes to Asian-population genetics research, while eliminating the requirement for high performing computational resources and bioinformatics expertise. It will enable an increased pace of research and provide a knowledge-base for genetic carriers of complex diseases, therefore greatly enhancing patient-driven research.STATEMENT OF SIGNIFICANCE: Multi-ethnic Imputation System (MI-System), primarily facilitates, but is not limited to, imputation on East-Asians, through 3 established prephasing-imputation pipelines, SHAPEIT2-IMPUTE2, SHAPEIT4-IMPUTE5, and Beagle5.1, where users can upload their genotype data and perform imputation and other utility functions with minimum effort and resources. A new customized Taiwan Biobank (TWB) reference panel, specifically created for Taiwanese-Chinese ancestry is provided. Utility functions include (a) create customized reference panels, (b) conduct quality control, (c) split whole genome data into chromosomes, and (d) convert genome builds. Users can also combine 2 reference panels using the system and use combined panels as reference to conduct imputation using MI-System.

Genotype Imputation and Reference Panel: A Systematic Evaluation

Genotype Imputation and Reference Panel: a Systematic Evaluation on Haplotype Size and Diversity.

A Combined Reference Panel from the 1000 Genomes and Uk10k Projects Improved Rare Variant Imputation in European and Chinese Samples

A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study

Genotype imputation accuracy and the quality metrics of the minor ancestry in multi-ancestry reference panels

Performance of Genotype Imputation for Low Frequency and Rare Variants from the 1000 Genomes

Genotype Imputation of MetabochipSNPs Using a Study‐Specific Reference Panel of ∼4,000 Haplotypes in African Americans from the Women's Health Initiative

Imputation accuracy across global human populations

MaCH-admix: Genotype Imputation for Admixed Populations.

A Panel of Ancestry Informative Markers to Estimate and Correct Potential Effects of Population Stratification in Han Chinese

Ancestry informative SNP panels for discriminating the major East Asian populations: Han Chinese, Japanese and Korean

Comprehensive Structural Variant Haplotype Panel of 943 Han Chinese from Long-Read Whole-Genome Sequencing

Comparing the effect of imputation reference panel composition in four distinct Latin American cohorts

Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel

NyuWa Genome resource: A deep whole-genome sequencing-based variation profile and reference panel for the Chinese population

The ChinaMAP reference panel for the accurate genotype imputation in Chinese populations

A resampling-based approach to share reference panels

A multi‐ethnic reference panel to impute HLA classical and non‐classical class I alleles in admixed samples: Testing imputation accuracy in an admixed sample from Brazil

Multi-ethnic Imputation System (MI-System): A genotype imputation server for high-dimensional data

A genotype imputation reference panel specific for native Southeast Asian populations