Topic-based crossing-workflow fragment discovery

Zhangbing Zhou,Jinfeng Wen,Yasha Wang,Xiao Xue,Patrick C.K. Hung,Long D. Nguyen
DOI: https://doi.org/10.1016/j.future.2020.05.029
IF: 7.307
2020-01-01
Future Generation Computer Systems
Abstract:Along with the large and increasing number of scientific workflows publicly accessible on Web repositories, the discovery of workflow fragments is significant to promote the reuse or repurposing of best-practices evidenced in legacy workflows, when novel scientific experiments are to be conducted. This paper proposes a novel crossing-workflow fragment discovery mechanism, where an activity knowledge graph is constructed to capture flat invocation relations between activities, and hierarchical parent–child relations specified upon sub-workflows and their corresponding activities. Semantic relevance of activities and sub-workflows is calculated based on their representative topics, where these topics are generated by applying the biterm topic model. Given a requirement specified in terms of a workflow fragment template, individual candidate activities or sub-workflows are discovered when considering their semantic relevance and short-document descriptions. Candidate fragments are constructed through discovering the relations in activity knowledge graph specified upon candidate activities or sub-workflows. These fragments are evaluated by balancing their structural and semantic similarities. Evaluation results show that our approach is accurate in discovering appropriate crossing-workflow fragments in comparison with the state of art’s techniques.
What problem does this paper attempt to address?