Regularly updated benchmark sets for statistically correct evaluations of AlphaFold applications

Laszlo Dobson,Gabor E. Tusnady,Peter Tompa
DOI: https://doi.org/10.1101/2024.08.02.606297
2024-12-09
Abstract:AlphaFold2 changed structural biology by providing high-quality structure predictions for all possible proteins. Since its inception, a plethora of applications were built on AlphaFold2, expediting discoveries in virtually all areas related to protein science. In many cases, however, optimism seems to have made scientists forget about data leakage, a serious issue that needs to be addressed when evaluating machine learning methods. Here we provide a rigorous benchmark set that can be used in a broad range of applications built around AlphaFold2/3.
Bioinformatics
What problem does this paper attempt to address?