MetamORF: a Repository of Unique Short Open Reading Frames Identified by Both Experimental and Computational Approaches for Gene and Metagene Analyses

Sebastien A. Choteau,Audrey Wagner,Philippe Pierre,Lionel Spinelli,Christine Brun
DOI: https://doi.org/10.1093/database/baab032
2021-01-01
Database
Abstract:ABSTRACT The development of high-throughput technologies revealed the existence of non-canonical short open reading frames (sORFs) on most eukaryotic RNAs. They are ubiquitous genetic elements highly conserved across species and suspected to be involved in numerous cellular processes. MetamORF ( http://metamorf.hb.univ-amu.fr/ ) aims to provide a repository of unique sORFs identified in the human and mouse genomes with both experimental and computational approaches. By gathering publicly available sORF data, normalizing it and summarizing redundant information, we were able to identify a total of 1,162,675 unique sORFs. Despite the usual characterization of ORFs as short, upstream or downstream, there is currently no clear consensus regarding the definition of these categories. Thus, the data has been reprocessed using a normalized nomenclature. MetamORF enables new analyses at loci, gene, transcript and ORF levels, that should offer the possibility to address new questions regarding sORF functions in the future. The repository is available through an user-friendly web interface, allowing easy browsing, visualization, filtering over multiple criteria and export possibilities. sORFs could be searched starting from a gene, a transcript, an ORF ID, or looking in a genome area. The database content has also been made available through track hubs at UCSC Genome Browser.
What problem does this paper attempt to address?