P 56 Deep learning derived quantitative Video-NystagmoGraphy using smartphone cameras: DeepVNG
M. Friedrich,J. Taeger,M. Bürklein,J. Hartig,J. Volkmann,C.W. Ip,R. Peach,D. Zeller
DOI: https://doi.org/10.1016/j.clinph.2022.01.087
IF: 4.861
2022-05-01
Clinical Neurophysiology
Abstract:Background: The assessment of eye movements is commonly considered a “window into the brain”. However, sole clinical observation is limited to semiquantitative findings. Recent advances in quantitative oculomotor phenotyping using video-oculography (VOG) promise additional diagnostic and prognostic value. However, methodological complexity and limited availability have hampered broad use of VOG in clinical and outpatient settings. Here, a novel approach for binocular video-nystagmography (VNG), capitalizing on recently deployed artificial intelligence frameworks for markerless pose estimation, is validated. Methods: Standardized 2D-optokinetic stimuli were presented on a smartphone screen and elicited nystagmus was recorded with two methods: first, using a conventional smartphone camera (1920x1080 px, 30 frames per second) mounted frontoparallely on a tripod and second, using gold standard infrared VOG goggles (EyeSeeCam, 188x120 px, 220 frames per second). Using the open-source software DeepLabCut, a recurrent neural network (RCNN) architecture was fine-tuned for tracking of pupils and facial anatomical landmarks. For subsequent comparisons, kinematic parameters (e. g. slow phase velocity (SPV)) of nystagmus were calculated from RCNN- and VOG-derived time series datasets using an instantaneous gradient algorithm. Precision of methods was compared using ANOVA and modified T-statistics, namely two one-sided T-testing (TOST) and one-sample T-tests. Results: In monocular analyses of RCNN-derived data, neither eye nor stimulus direction exerted significant influence on SPV (F (1, 5) = 1.33, p= [DZ1].30 and F (3, 15) = 0.66, p = .59). Furthermore, method did not significantly influence omnidirectional SPV measurements (F (1, 5) = 0.009, p = .78). Finally, TOST results demonstrated equivalence in both horizontal (upper bound T(5) = -3.45, p = .009; lower bound (T(5) = 5.0, p= .002) and vertical (T(5) = -3.06, p = .01; T(5) = 3.35, p = .01) stimulus planes. Conclusions: This proof-of-concept study shows that a deep learning based tool (DeepVNG) can extract characteristics of nystagmus from smartphone video recordings of eye movements elicited by standardized optokinetic stimuli with comparable precision to gold standard VOG. These results may lay the groundwork for broadly available “vestibular event monitors” assisting both caregivers and patients in detection and quantification of nystagmus and may open an avenue for more granular analyses of nystagmus expected to improve its diagnostic value.
neurosciences,clinical neurology