Integrated Database of Force-Field Parameters, Experimental Measurements and Molecular Dynamics Simulations

Pavel Banas,Vojtech Mlynsky,David Ciz,Radek Furmanek,Nestor Pilat,Viktoria Pauw,Stephan Hachinger,Jiri Sponer,Jan Martinovic,Michal Otyepka
DOI: https://doi.org/10.1101/2024.12.03.626554
2024-12-04
Abstract:Molecular Dynamic (MD) simulation is a vital theoretical tool for exploring nucleic acids (RNA, DNA), proteins and other (bio)molecular systems, generating vast amounts of data daily. Efficient storage and possible reuse of this data is a persistent challenge. Here, we introduce IDA (Integrated DAtabase of force fields and datasets from experiments and MD simulations), an innovative database scheme for datasets from various types of MD simulations. IDA supports outputs from different MD approaches, i.e., standard MD simulations, importance sampling techniques, simulated annealing, and other enhanced sampling methods including replica-exchange simulations. IDA also houses a collection of molecule-specific force fields (FFs) and experimental datasets. Uploaded MD outputs, FFs, and experimental data are integrated in a standardized format, allowing efficient data mining and extraction of valuable insights from the extensive data generated by diverse MD simulations. With the data and metadata holdings of IDA, and the prospective assignment of persistent identifiers, our work aims to make key steps towards making MD data FAIR (findable, accessible, interoperable, reusable).
Bioinformatics
What problem does this paper attempt to address?