A Machine Learning Enhanced EMS Mutagenesis Probability Map for Efficient Identification of Causal Mutations in

Zhengyang Guo,Shimin Wang,Yang Wang,Zi Wang,Guangshuo Ou
DOI: https://doi.org/10.1101/2024.02.15.580605
2024-02-20
Abstract:Chemical mutagenesis-driven forward genetic screens are pivotal in unveiling gene functions, yet identifying causal mutations behind phenotypes remains laborious, hindering their high-throughput application. Here, we reveal a non-uniform mutation rate caused by Ethyl Methane Sulfonate (EMS) mutagenesis in the genome, indicating that mutation frequency is influenced by proximate sequence context and chromatin status. Leveraging these factors, we developed a Machine Learning enhanced pipeline to create a comprehensive EMS mutagenesis probability map for the genome. This map operates on the principle that causative mutations are enriched in genetic screens targeting specific phenotypes among random mutations. Applying this map to Whole Genome Sequencing (WGS) data of genetic suppressors that rescue a ciliary kinesin mutant, we successfully pinpointed causal mutations without generating recombinant inbred lines. This methodology can be adapted in other species, offering a scalable approach for identifying causal genes and revitalizing the effectiveness of forward genetic screens.
Genetics
What problem does this paper attempt to address?