An introduction to probabilistic graphical models

Michael I Jordan
2003-06-30
Abstract:A graphical model can be thought of as a probabilistic database, a machine that can answer" queries" regarding the values of sets of random variables. We build up the database in pieces, using probability theory to ensure that the pieces have a consistent overall interpretation. Probability theory also justifies the inferential machinery that allows the pieces to be put together" on the fly" to answer queries.Consider a set of random variables {1 2 n} and let Üi represent the realization of random variable i. Each random variable may be scalar-valued or vector-valued. Thus Üi is in general a vector in a vector space. In this section, for concreteness, we assume that the random variables are discrete; in general, however, we make no such restriction. There are several kinds of query that we might be interested in making regarding such an ensemble. We might, for example, be interested in knowing whether one subset of variables is independent of another, or whether one subset of variables is conditionally independent of another subset of variables given a third subset. Or we might be interested in calculating conditional probabilities-the probabilities of one subset of variables given the values of another subset of variables. Still other kinds of queries will be described in later chapters. In principle all such queries can be answered if we have in hand the joint probability distribution, written Č (1= Ü1 2= Ü2 n= Ün). Questions regarding independence can be answered by factoring the joint probability distribution, and questions regarding conditional probabilities can be answered by appropriate marginalization and normalization operations. To simplify our …
What problem does this paper attempt to address?