UniProt: the Universal Protein Knowledgebase in 2025

The UniProt Consortium,Alex Bateman,Maria-Jesus Martin,Sandra Orchard,Michele Magrane,Aduragbemi Adesina,Shadab Ahmad,Emily H Bowler-Barnett,Hema Bye-A-Jee,David Carpentier,Paul Denny,Jun Fan,Penelope Garmiri,Leonardo Jose da Costa Gonzales,Abdulrahman Hussein,Alexandr Ignatchenko,Giuseppe Insana,Rizwan Ishtiaq,Vishal Joshi,Dushyanth Jyothi,Swaathi Kandasaamy,Antonia Lock,Aurelien Luciani,Jie Luo,Yvonne Lussi,Juan Sebastian Martinez Marin,Pedro Raposo,Daniel L Rice,Rafael Santos,Elena Speretta,James Stephenson,Prabhat Totoo,Nidhi Tyagi,Nadya Urakova,Preethi Vasudev,Kate Warner,Supun Wijerathne,Conny Wing-Heng Yu,Rossana Zaru,Alan J Bridge,Lucila Aimo,Ghislaine Argoud-Puy,Andrea H Auchincloss,Kristian B Axelsen,Parit Bansal,Delphine Baratin,Teresa M Batista Neto,Marie-Claude Blatter,Jerven T Bolleman,Emmanuel Boutet,Lionel Breuza,Blanca Cabrera Gil,Cristina Casals-Casas,Kamal Chikh Echioukh,Elisabeth Coudert,Beatrice Cuche,Edouard de Castro,Anne Estreicher,Maria L Famiglietti,Marc Feuermann,Elisabeth Gasteiger,Pascale Gaudet,Sebastien Gehant,Vivienne Gerritsen,Arnaud Gos,Nadine Gruaz,Chantal Hulo,Nevila Hyka-Nouspikel,Florence Jungo,Arnaud Kerhornou,Philippe Le Mercier,Damien Lieberherr,Patrick Masson,Anne Morgat,Salvo Paesano,Ivo Pedruzzi,Sandrine Pilbout,Lucille Pourcel,Sylvain Poux,Monica Pozzato,Manuela Pruess,Nicole Redaschi,Catherine Rivoire,Christian J A Sigrist,Karin Sonesson,Shyamala Sundaram,Anastasia Sveshnikova,Cathy H Wu,Cecilia N Arighi,Chuming Chen,Yongxing Chen,Hongzhan Huang,Kati Laiho,Minna Lehvaslaiho,Peter McGarvey,Darren A Natale,Karen Ross,C R Vinayaka,Yuqi Wang,Jian Zhang
DOI: https://doi.org/10.1093/nar/gkae1010
IF: 14.9
2024-11-24
Nucleic Acids Research
Abstract:The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication, we describe ongoing changes to our production pipeline to limit the sequences available in UniProtKB to high-quality, non-redundant reference proteomes. We continue to manually curate the scientific literature to add the latest functional data and use machine learning techniques. We also encourage community curation to ensure key publications are not missed. We provide an update on the automatic annotation methods used by UniProtKB to predict information for unreviewed entries describing unstudied proteins. Finally, updates to the UniProt website are described, including a new tab linking protein to genomic information. In recognition of its value to the scientific community, the UniProt database has been awarded Global Core Biodata Resource status.
biochemistry & molecular biology
What problem does this paper attempt to address?