Deep Bidirectional Intelligence: AlphaZero, Deep IA-search, Deep IA-infer, and TPC Causal Learning

Lei Xu
DOI: https://doi.org/10.1186/s40535-018-0052-y
2018-01-01
Applied Informatics
Abstract:This paper starts at a brief review on AlphaGoZero, Q learning, and Monte-Carlo tree search (MCTS), in a comparison with decades ago studies on A* search and CNneim-A that was proposed in 1986 and shares a scouting technique similar to one used in MCTS. Then, we combine the strengths of AlphaGoZero and CNneim-A, resulting in a family named deep IA-search that consists of Deep Scout A*, Deep CNneim-A, Deep Bi-Scout A*, and V-AlphaGoZero, as well as their extensions. Moreover, relation between search and reasoning motivates to extend deep IA-search to Deep IA-Infer for implementing reasoning. Especially, another early study (Xu and Pearl, Structuring causal tree models with continuous variables. In: Proceedings of the 3rd annual conference on uncertainty in artificial intelligence, pp 170–179 1987) on structuring causal tree is developed into a three phase causal learning approach, namely topology identification, parameter reestimation, and causal \(\rho\)-tree search on a casual \(\rho\)-diagram that is defined by a set of pairwise correlation coefficient \(\rho\). Algorithms are sketched for discovering casual topologies of triplets, stars, and trees, as well as some topologies of casual \(\rho\)-Directed Acyclic Graph (DAG), e.g. ones for Yule–Simpson’s paradox, Pearl’s Sprinkler DAG, and Back door DAG. Furthermore, the classic Boolean SAT problem is extended into one \(\rho\)-SAT problem, and the roles of four fundamental mechanisms in an intelligent system are elaborated, with insights on integrating these mechanisms to encode not only variables but also how they are organised, as well as on why deep networks are preferred while extra depth is unnecessary.
What problem does this paper attempt to address?