Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach

Benedikt W. Hosp,Björn Severitt,Rajat Agarwala,Evgenia Rusak,Yannick Sauer,Siegfried Wahl
2024-08-07
Abstract:In an era where personalized technology is increasingly intertwined with daily life, traditional eye-tracking systems and autofocal glasses face a significant challenge: the need for frequent, user-specific calibration, which impedes their practicality. This study introduces a groundbreaking calibration-free method for estimating focal depth, leveraging machine learning techniques to analyze eye movement features within short sequences. Our approach, distinguished by its innovative use of LSTM networks and domain-specific feature engineering, achieves a mean absolute error (MAE) of less than 10 cm, setting a new focal depth estimation accuracy standard. This advancement promises to enhance the usability of autofocal glasses and pave the way for their seamless integration into extended reality environments, marking a significant leap forward in personalized visual technology.
Computer Vision and Pattern Recognition,Artificial Intelligence,Human-Computer Interaction,Machine Learning,Signal Processing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the frequent need for user - specific calibration faced by existing autofocal glasses in practical applications. This problem greatly limits the practicality and user experience of these devices. Specifically, traditional eye - tracking systems and autofocal glasses require frequent user - specific calibration, which is not only time - consuming but also affects the convenience of users' daily use. To solve this problem, the author proposes a new machine - learning - based method - Foveal Attention Long Short - Term Memory Network (FOV AL), which can accurately estimate the depth of focus without calibration. By analyzing eye - movement features (such as eye vergence angle data) and combining the LSTM network in deep learning with domain - specific feature engineering, this model achieves a depth - of - focus estimation accuracy with a mean absolute error (MAE) of less than 10 centimeters. This method not only improves the accuracy of autofocal glasses but also significantly enhances their practicality in daily life, enabling them to be seamlessly integrated into the Extended Reality (XR) environment. In addition, this research also aims to overcome the limitations of existing depth - of - focus estimation methods, such as relying on a large amount of calibration, lacking generalization ability, and requiring a large amount of computing resources. By eliminating the need for frequent calibration, the FOV AL model provides new possibilities for the development of personalized vision technology and is expected to significantly improve the vision - correction experience and quality of life of presbyopia patients. ### Formula Summary - **Window Mean Calculation in Anomaly Detection**: \[ W_i=\{\text{GTdepth}_{\text{start}}, \ldots, \text{GTdepth}_{\text{end}}\} \] where: \[ \text{start}=\max\left(i-\frac{\text{window size}}{2}, 0\right) \] \[ \text{end}=\min\left(i+\frac{\text{window size}}{2}+1, N\right) \] \[ \text{mean}(W_i)=\frac{1}{|W_i|} \sum_{j = \text{start}}^{\text{end}}\text{GTdepth}_j \] - **Interquartile Range (IQR) Method**: \[ \text{IQR}=Q3 - Q1 \] \[ Q3 + 1.5\times\text{IQR}<\text{dfcolumn}<Q1 - 1.5\times\text{IQR} \] - **Global Normalization**: \[ X'=\frac{X-\text{Median}(X)}{\text{IQR}(X)} \] - **Individual Normalization**: \[ X'_{s,\text{normalized}}=\frac{X'_s-\text{Median}(X'_s)}{\text{IQR}(X'_s)} \] Through these methods and techniques, this research significantly improves the accuracy and robustness of depth - of - focus estimation, paving the way for the wide application of personalized vision technology.