Multi-task Video Enhancement for Dental Interventions

Efklidis Katsaros,Piotr K. Ostrowski,Krzysztof Włódarczak,Emilia Lewandowska,Jacek Ruminski,Damian Siupka-Mróz,Łukasz Lassmann,Anna Jezierska,Daniel Węsierski
DOI: https://doi.org/10.1007/978-3-031-16449-1_18
2022-10-25
Abstract:A microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular, the proposed network jointly leverages video restoration and temporal alignment in a multi-scale manner for effective video enhancement. Our experiments on videos of natural teeth in phantom scenes demonstrate that the proposed network achieves state-of-the-art results in multiple tasks with near real-time processing. We release Vident-lab at <a class="link-external link-https" href="https://doi.org/10.34808/1jby-ay90" rel="external noopener nofollow">this https URL</a>, the first dataset of dental videos with multi-task labels to facilitate further research in relevant video processing applications.
Computer Vision and Pattern Recognition,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?