Abstract:In an era where personalized technology is increasingly intertwined with daily life, traditional eye-tracking systems and autofocal glasses face a significant challenge: the need for frequent, user-specific calibration, which impedes their practicality. This study introduces a groundbreaking calibration-free method for estimating focal depth, leveraging machine learning techniques to analyze eye movement features within short sequences. Our approach, distinguished by its innovative use of LSTM networks and domain-specific feature engineering, achieves a mean absolute error (MAE) of less than 10 cm, setting a new focal depth estimation accuracy standard. This advancement promises to enhance the usability of autofocal glasses and pave the way for their seamless integration into extended reality environments, marking a significant leap forward in personalized visual technology.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the frequent need for user - specific calibration faced by existing autofocal glasses in practical applications. This problem greatly limits the practicality and user experience of these devices. Specifically, traditional eye - tracking systems and autofocal glasses require frequent user - specific calibration, which is not only time - consuming but also affects the convenience of users' daily use. To solve this problem, the author proposes a new machine - learning - based method - Foveal Attention Long Short - Term Memory Network (FOV AL), which can accurately estimate the depth of focus without calibration. By analyzing eye - movement features (such as eye vergence angle data) and combining the LSTM network in deep learning with domain - specific feature engineering, this model achieves a depth - of - focus estimation accuracy with a mean absolute error (MAE) of less than 10 centimeters. This method not only improves the accuracy of autofocal glasses but also significantly enhances their practicality in daily life, enabling them to be seamlessly integrated into the Extended Reality (XR) environment. In addition, this research also aims to overcome the limitations of existing depth - of - focus estimation methods, such as relying on a large amount of calibration, lacking generalization ability, and requiring a large amount of computing resources. By eliminating the need for frequent calibration, the FOV AL model provides new possibilities for the development of personalized vision technology and is expected to significantly improve the vision - correction experience and quality of life of presbyopia patients. ### Formula Summary - **Window Mean Calculation in Anomaly Detection**: \[ W_i=\{\text{GTdepth}_{\text{start}}, \ldots, \text{GTdepth}_{\text{end}}\} \] where: \[ \text{start}=\max\left(i-\frac{\text{window size}}{2}, 0\right) \] \[ \text{end}=\min\left(i+\frac{\text{window size}}{2}+1, N\right) \] \[ \text{mean}(W_i)=\frac{1}{|W_i|} \sum_{j = \text{start}}^{\text{end}}\text{GTdepth}_j \] - **Interquartile Range (IQR) Method**: \[ \text{IQR}=Q3 - Q1 \] \[ Q3 + 1.5\times\text{IQR}<\text{dfcolumn}<Q1 - 1.5\times\text{IQR} \] - **Global Normalization**: \[ X'=\frac{X-\text{Median}(X)}{\text{IQR}(X)} \] - **Individual Normalization**: \[ X'_{s,\text{normalized}}=\frac{X'_s-\text{Median}(X'_s)}{\text{IQR}(X'_s)} \] Through these methods and techniques, this research significantly improves the accuracy and robustness of depth - of - focus estimation, paving the way for the wide application of personalized vision technology.

Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach

Calibration-free Deep Optics for Depth Estimation with Precise Simulation

A Robust Monocular Depth Estimation Framework Based on Light-Weight ERF-Pspnet for Day-Night Driving Scenes

A Learning-Based Method Using Epipolar Geometry for Light Field Depth Estimation

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Calibration-Free Mobile Eye-Tracking Using Corneal Imaging

Real-Time Gaze Estimation with Online Calibration

Multi-channel Residual Network Model for Accurate Estimation of Spatially-Varying and Depth-Dependent Defocus Kernels.

Depth error correction for plenoptic cameras based on an innovative AR system combining geometrical waveguide and micro-lens array

A Deep Perceptual Measure for Lens and Camera Calibration

3D Gaze Estimation Using RGB-IR Cameras

FloatingFusion: Depth from ToF and Image-stabilized Stereo Cameras

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

DeepMetricEye: Metric Depth Estimation in Periocular VR Imagery

MobiDepth: Real-Time Depth Estimation Using On-Device Dual Cameras.

Aberration-Aware Depth-from-Focus

Autostereoscopic 3D Measurement Based on Adaptive Focus Volume Aggregation

Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models

Accurate Real‐time 3D Gaze Tracking Using a Lightweight Eyeball Calibration

Deep eyes: Joint depth inference using monocular and binocular cues