UniProt: the universal protein knowledgebase in 2021
The UniProt Consortium,Alex Bateman,Maria-Jesus Martin,Sandra Orchard,Michele Magrane,Rahat Agivetova,Shadab Ahmad,Emanuele Alpi,Emily H Bowler-Barnett,Ramona Britto,Borisas Bursteinas,Hema Bye-A-Jee,Ray Coetzee,Austra Cukura,Alan Da Silva,Paul Denny,Tunca Dogan,ThankGod Ebenezer,Jun Fan,Leyla Garcia Castro,Penelope Garmiri,George Georghiou,Leonardo Gonzales,Emma Hatton-Ellis,Abdulrahman Hussein,Alexandr Ignatchenko,Giuseppe Insana,Rizwan Ishtiaq,Petteri Jokinen,Vishal Joshi,Dushyanth Jyothi,Antonia Lock,Rodrigo Lopez,Aurelien Luciani,Jie Luo,Yvonne Lussi,Alistair MacDougall,Fabio Madeira,Mahmoudy,Manuela Menchi,Alok Mishra,Katie Moulang,Andrew Nightingale,Carla Susana Oliveira,Sangya Pundir,Guoying Qi,Shriya Raj,Daniel Rice,Milagros Rodriguez Lopez,Rabie Saidi,Joseph Sampson,Tony Sawford,Elena Speretta,Edward Turner,Nidhi Tyagi,Preethi Vasudev,Vladimir Volynkin,Kate Warner,Xavier Watkins,Rossana Zaru,Hermann Zellner,Alan Bridge,Sylvain Poux,Nicole Redaschi,Lucila Aimo,Ghislaine Argoud-Puy,Andrea Auchincloss,Kristian Axelsen,Parit Bansal,Delphine Baratin,Marie-Claude Blatter,Jerven Bolleman,Emmanuel Boutet,Lionel Breuza,Cristina Casals-Casas,Edouard de Castro,Kamal Chikh Echioukh,Elisabeth Coudert,Beatrice Cuche,Mikael Doche,Dolnide Dornevil,Anne Estreicher,Maria Livia Famiglietti,Marc Feuermann,Elisabeth Gasteiger,Sebastien Gehant,Vivienne Gerritsen,Arnaud Gos,Nadine Gruaz-Gumowski,Ursula Hinz,Chantal Hulo,Nevila Hyka-Nouspikel,Florence Jungo,Guillaume Keller,Arnaud Kerhornou,Vicente Lara,Philippe Le Mercier,Damien Lieberherr,Thierry Lombardot,Xavier Martin,Patrick Masson,Anne Morgat,Teresa Batista Neto,Salvo Paesano,Ivo Pedruzzi,Sandrine Pilbout,Lucille Pourcel,Monica Pozzato,Manuela Pruess,Catherine Rivoire,Christian Sigrist,Karin Sonesson,Andre Stutz,Shyamala Sundaram,Michael Tognolli,Laure Verbregue,Cathy H Wu,Cecilia N Arighi,Leslie Arminski,Chuming Chen,Yongxing Chen,John S Garavelli,Hongzhan Huang,Kati Laiho,Peter McGarvey,Darren A Natale,Karen Ross,C R Vinayaka,Qinghua Wang,Yuqi Wang,Lai-Su Yeh,Jian Zhang,Patrick Ruch,Douglas Teodoro,,Mahdi Mahmoudy
DOI: https://doi.org/10.1093/nar/gkaa1100
IF: 14.9
2020-11-25
Nucleic Acids Research
Abstract:Abstract The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.
biochemistry & molecular biology