Novel Megaptera novaeangliae (Humpback whale) haplotype chromosome-level reference genome

Maria-Vittoria Carminati,Vlonjat Lonnie Gashi,Ruiqi Li,Daniel Jacob Klee,Sara Rose Padula,Ajay Manish Patel,Andy Dick Yee Tan,Jacqueline Mattos,Nolan Kane
DOI: https://doi.org/10.1038/s41597-024-03922-9
2024-10-11
Scientific Data
Abstract:The sequencing of a kidney sample (KW2013002) from a stranded Megaptera novaeangliae (Humpback whale) calf is the first chromosome-level reference genome for this species 1 . The calf, a 457 cm and 2,500 lbs male, was found stranded in Hawai'i Kai, HI, in 2013 and was marked as abandoned/orphaned. In 2023, 1 g of kidney was sequenced with PacBio long-read DNA sequencing, chromatin conformation capture (Hi-C), RNA sequencing, and mitochondrial sequencing to comprehensively characterize the genome and transcriptome of M. novaeangliae . Data validation includes a synteny analysis, mitochondrial annotation, and a comparison of BUSCO scores (scaffold v. reference genome and Balaenoptera musculus (Blue whale) v. M. novaeangliae ). BUSCO analysis was performed on an M. novaeangliae scaffold-level assembly to determine genomic completeness of the reference genome, with a scaffold BUSCO score of 91.2% versus a score of 95.4%. Synteny analysis was performed using the B. musculus genome as comparison to determine chromosome-level coverage and structure. Further, a time-based phylogenetic tree was constructed using the sequenced data and publicly available genomes.
multidisciplinary sciences
What problem does this paper attempt to address?