The DoGA consortium expression atlas of promoters and genes in 100 canine tissues
Matthias Hörtenhuber,Marjo K. Hytönen,Abdul Kadir Mukarram,Meharji Arumilli,César L. Araujo,Ileana Quintero,Pernilla Syrjä,Niina Airas,Maria Kaukonen,Kaisa Kyöstilä,Julia Niskanen,Tarja S. Jokinen,Faezeh Mottaghitalab,Işıl Takan,Noora Salokorpi,Amitha Raman,Irene Stevens,Antti Iivanainen,Masahito Yoshihara,Oleg Gusev,Danika Bannasch,Antti Sukura,Jeffrey J. Schoenebeck,DoGA Consortium,Carsten Daub,César L. Araujo,Ileana B. Quintero,Milla Salonen,Riika Sarviaho,Sruthi Hundi,Jenni Puurunen,Sini Sulkama,Sini Karjalainen,Henna Pekkarinen,Ilona Kareinen,Anna Knuuttila,Hanna-Maaria Javela,Laura Tuomisto,Heli Nordgren,Karoliina Hagner,Tarja Jokinen,Kaarel Krjutskov,Auli Saarinen,Rasha Fahad Aljelaify,Fiona Ross,Irene Stevens,Jeffrey J. Schoenebeck,Heini Niinimäki,Marko Haapakoski,Sini Ezer,Shintaro Katayama,Carsten O. Daub,Juha Kere,Hannes Lohi
DOI: https://doi.org/10.1038/s41467-024-52798-1
IF: 16.6
2024-10-22
Nature Communications
Abstract:The dog, Canis lupus familiaris , is an important model for studying human diseases. Unlike many model organisms, the dog genome has a comparatively poor functional annotation, which hampers gene discovery for development, morphology, disease, and behavior. To fill this gap, we established a comprehensive tissue biobank for both the dog and wolf samples. The biobank consists of 5485 samples representing 132 tissues from 13 dogs, 12 dog embryos, and 24 wolves. In a subset of 100 tissues from nine dogs and 12 embryos, we characterized gene expression activity for each promoter, including alternative and novel, i.e., previously not annotated, promoter regions, using the 5' targeting RNA sequencing technology STRT2-seq. We identified over 100,000 promoter region candidates in the recent canine genome assembly, CanFam4, including over 45,000 highly reproducible sites with gene expression and respective tissue enrichment levels. We provide a promoter and gene expression atlas with interactive, open data resources, including a data coordination center and genome browser track hubs. We demonstrated the applicability of Dog Genome Annotation (DoGA) data and resources using multiple examples spanning canine embryonic development, morphology and behavior, and diseases across species.
multidisciplinary sciences