BOD: a Customizable Bioinformatics on Demand System Accommodating Multiple Steps and Parallel Tasks.

LA Qiao,J Zhu,QY Liu,T Zhu,C Song,W Lin,GZ Wei,L Mu,J Tao,NM Zhao,GW Yang,XJ Liu
DOI: https://doi.org/10.1093/nar/gkh756
IF: 14.9
2004-01-01
Nucleic Acids Research
Abstract:The integration of bioinformatics resources worldwide is one of the major concerns of the biological community. We herein established the BOD (Bioinformatics on demand) system to use Grid computing technology to set up a virtual workbench via a web-based platform, to assist researchers performing customized comprehensive bioinformatics work. Users will be able to submit entire search queries and computation requests, e.g. from DNA assembly to gene prediction and finally protein folding, from their own office using the BOD end-user web interface. The BOD web portal parses the user's job requests into steps, each of which may contain multiple tasks in parallel. The BOD task scheduler takes an entire task, or splits it into multiple subtasks, and dispatches the task or subtasks proportionally to computation node(s) associated with the BOD portal server. A node may further split and distribute an assigned task to its sub-nodes using a similar strategy. In the end, the BOD portal server receives and collates all results and returns them to the user. BOD uses a pipeline model to describe the user's submitted data and stores the job requests/status/results in a relational database. In addition, an XML criterion is established to capture task computation program details.
What problem does this paper attempt to address?