Data-driven perception of neuron point process with unknown unknowns

Ruochen Yang,Gaurav Gupta,Paul Bogdan
DOI: https://doi.org/10.1145/3302509.3311056
2019-04-16
Abstract:Identification of patterns from discrete data time-series for statistical inference, threat detection, social opinion dynamics, brain activity prediction has received recent momentum. In addition to the huge data size, the associated challenges are, for example, (i) missing data to construct a closed time-varying complex network, and (ii) contribution of unknown sources which are not probed. Towards this end, the current work focuses on statistical neuron system model with multi-covariates and unknown inputs. Previous research of neuron activity analysis is mainly concerned with effects from spiking history of the target neuron and the interaction with other neurons in the system while ignoring the influence of unknown stimuli. We propose to use unknown unknowns, which describes the effect of unknown stimuli, undetected neuron activities and all other hidden sources of error. The generalized linear model links neuron spiking behavior with past activities in the ensemble neuron system, as well as the unknown influence. We develop a maximum likelihood estimation method based on fixed-point iteration. The fixed-point iterations converge fast, and besides, the proposed methods can be efficiently parallelized to offer computational advantage especially when the input spiking trains are over long time-horizon. The developed framework provides an intuition into the meaning of having extra degrees-of-freedom in the data to support the need for unknowns. The proposed algorithm is applied to simulated spike trains and on real-world experimental data of mouse somatosensory, mouse retina and cat retina. The implementation shows a successful increase of the model likelihood with respect to the conditional intensity function, and it also reveals the convergence with iterations. Results suggest that the neural connection model with unknown unknowns can efficiently estimate the statistical properties of the process by increasing the network likelihood.
What problem does this paper attempt to address?