Bayesian Networks based Hybrid Quantum-Classical Machine Learning Approach to Elucidate Gene Regulatory Pathways

Radhakrishnan Balu,Ajinkya Borle
DOI: https://doi.org/10.48550/arXiv.1901.10557
2019-01-23
Abstract:We report a scalable hybrid quantum-classical machine learning framework to build Bayesian networks (BN) that captures the conditional dependence and causal relationships of random variables. The generation of a BN consists of finding a directed acyclic graph (DAG) and the associated joint probability distribution of the nodes consistent with a given dataset. This is a combinatorial problem of structural learning of the underlying graph, starting from a single node and building one arc at a time, that fits a given ensemble using maximum likelihood estimators (MLE). It is cast as an optimization problem that consists of a scoring step performed on a classical computer, penalties for acyclicity and number of parents allowed constraints, and a search step implemented using a quantum annealer. We have assumed uniform priors in deriving the Bayesian network that can be relaxed by formulating the problem as an estimation Dirichlet parameters. We demonstrate the utility of the framework by applying to the problem of elucidating the gene regulatory network for the MAPK/Raf pathway in human T-cells using proteomics data where the concentration of proteins, nodes of the BN, are interpreted as probabilities.
Machine Learning,Quantum Physics
What problem does this paper attempt to address?
This paper aims to solve how to use a hybrid quantum - classical machine - learning framework to construct Bayesian Networks (BNs) in order to reveal conditional dependencies and causal relationships in gene - regulation pathways. Specifically, the paper focuses on using hybrid quantum - classical methods to optimize the learning process of Bayesian network structures, especially for modeling protein - interaction networks in biochemical pathways. This involves finding a Directed Acyclic Graph (DAG) and the joint probability distribution of its nodes from a given dataset, so that the graph can best represent the statistical characteristics in the dataset. ### Main Objectives of the Paper 1. **Develop a Hybrid Quantum - Classical Machine - Learning Framework**: This framework is used to construct Bayesian networks and is especially suitable for modeling protein - activation causal relationships in biochemical pathways. 2. **Optimize the Structure Learning of Bayesian Networks**: By transforming the problem into an optimization problem, use quantum annealing techniques to achieve efficient search and scoring steps on a quantum computer. 3. **Apply to Real - World Biological Data**: The paper demonstrates the application of this framework to the MAPK/Raf signaling pathway in human T - cells, using proteomics data to infer gene - regulation networks. ### Key Technologies and Methods - **Bayesian Networks**: Used to capture conditional dependencies and causal relationships between random variables. - **Quantum Annealing**: As an optimization technique, used to solve combinatorial optimization problems in Bayesian network - structure learning. - **Hybrid Quantum - Classical Methods**: Combine the scoring steps of classical computers and the search steps of quantum computers to improve overall efficiency and accuracy. - **Dirichlet Priors**: Used to estimate parameters in Bayesian networks, assuming a uniform distribution, and plans to relax this assumption in future work. ### Experimental Results - **Data Processing**: Use the R statistical package to discretize protein - concentration data in order to set the Hamiltonian of the annealer. - **Model Construction**: Successfully construct a Bayesian network of the MAPK/Raf signaling pathway containing 8 proteins. - **Performance Evaluation**: Through 30 annealing schedules, with each schedule running 9,000 times, the generated results are basically consistent with the original biological pathway, but there is a relatively high false - positive rate, mainly due to the noise problem of qubits. ### Conclusions The paper demonstrates the feasibility and potential of using the D - Wave quantum annealer in real - world biological problems, especially in the modeling of signaling pathways. Although the current method still has some challenges, such as qubit noise and limited connectivity, with the progress of quantum - computing technology, more complex biological networks can be studied in the future. In addition, the paper also plans to expand to the research of dynamic Bayesian networks and Bayesian neural networks.