Relevance Detection of Ophthalmic Surgery Videos
Relevance Detection of Ophthalmic Surgery Videos
Disciplines
Computer Sciences (100%)
Keywords
-
Multimedia,
Biomedical Engineering,
Surgery Videos,
Computer Vision,
Video Content Analysis,
Machine Learning
Research Proposal: In an interdisciplinary research project, computer scientists and physicians collaborate to develop and evaluate methods for automatic detection of relevant temporal segments in ophthalmic surgery1 videos (OSVs). The main objective is the creation of relevance models allowing to detect video segments that are relevant for educational, scientific, or documentary purposes in medicine. Relevance models are produced by machine learning algorithms, which are trained using OSVs that were annotated by surgeons. Important instances of relevant OSV segments are irregular operation (OP) phases, which deviate from the usual procedure used in quasi-standardized ophthalmic surgeries. The automatic detection and classification of irregularities that occur more frequently and hence allow training of machine learning algorithms, provide an additional benefit for the creation of OSV datasets targeted at medical education or research. The development and evaluation of automatic classifiers of irregularities therefore represents another research objective of this project. Relevance models can be used to compress and store OSVs efficiently. This project will develop and evaluate appropriate methods and algorithms to achieve this goal. Finally, we want to demonstrate that relevant OSV segments are useful for medical research by addressing three specific medical research questions related to a certain type of ophthalmic surgery (cataract) using video analysis. Hypotheses: (1) Regular and irregular OP phases as well as common irregular events in OSVs can be detected and classified automatically by machine learning algorithms. (2) The automatic relevance prediction of video segments enables a significant increase in efficiency for storing and archiving OSVs. (3) Automatic analysis of OSVs facilitates the investigation of quantitative medical research questions related to complication rate and various OP conditions. Methods: Classifiers and compression algorithms are developed using an iterative process comprising design, evaluation and improvement. Quantitative medical research questions are addressed by statistical correlation and variance analysis. Innovation: Currently, automatic analysis of OSVs is a young research field with a small number of publications (mainly concerning phase detection in cataract surgeries). Relevance modelling of OSV segments is a new idea, as is relevance-driven compression of OSVs. Moreover, the use of video analysis to investigate quantitative research questions related to ophthalmic surgeries represents a new approach in medical research. 1 Ophthalmic surgery is concerned with the human eye.
Computer scientists and physicians have collaborated in a multidisciplinary research project with a focus on computer science to develop and evaluate methods for the automatic detection of relevant temporal and spatial content in ophthalmic surgery videos. The main goal of the project OVID ("Relevance Detection in Ophthalmic Surgery Videos") was to model relevance with respect to the use of video segments for medical teaching, research and documentation. For this purpose, mainly artificial intelligence (AI) methods were used and various neural network models were developed and applied for content recognition. This allowed relevance models to be learned automatically by machine learning methods, using surgical videos annotated by surgeons as training data. On the one hand, these relevance models can be used for efficient compression and storage of surgical videos, and on the other hand, they can also be used for automatic recognition of a wide variety of relevant content (e.g. surgical phases, instruments, iris, pupil, the lens used in cataracts, etc.). This makes it possible to capture important sections in an eye operation fully automatically, index them and make them usable for subsequent content-based searches. These models also allow large forensic studies of specific complications over a large number of operations or patients. In the OVID project, it was successfully shown that specific problem situations or complications in videos of cataract operations can be found through the targeted use of artificial intelligence and algorithmic methods of computer science. For example, the occurrence of pupil reactions (unpredictable enlargement/reduction of the pupil, which is accompanied by a dangerous change in the anterior chamber depth) was automatically detected. Furthermore, object detection and tracking in the video made it possible to measure the unfolding time of artificial lenses as well as their stability. The results of the research project have been published in several papers at high-level conferences on medical image analysis, pattern recognition, and multimedia retrieval.
- Klinikum Klagenfurt - 30%
- Universität Klagenfurt - 70%
- Yosuf El-Shabrawi, Klinikum Klagenfurt , associated research partner
Research Output
- 164 Citations
- 23 Publications
-
2021
Title LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos DOI 10.48550/arxiv.2107.00875 Type Preprint Author Ghamsarian N -
2021
Title Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization DOI 10.48550/arxiv.2104.14280 Type Preprint Author Ghamsarian N -
2021
Title Relevance Detection in Cataract Surgery Videos by Spatio- Temporal Action Localization DOI 10.1109/icpr48806.2021.9412525 Type Conference Proceeding Abstract Author Ghamsarian N Pages 10720-10727 Link Publication -
2020
Title Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks DOI 10.1145/3394171.3413658 Type Conference Proceeding Abstract Author Ghamsarian N Pages 3577-3585 -
2020
Title Deblurring Cataract Surgery Videos Using a Multi-Scale Deconvolutional Neural Network DOI 10.1109/isbi45749.2020.9098318 Type Conference Proceeding Abstract Author Ghamsarian N Pages 872-876 -
2020
Title Enabling Relevance-Based Exploration of Cataract Videos DOI 10.1145/3372278.3391937 Type Conference Proceeding Abstract Author Ghamsarian N Pages 378-382 -
2021
Title DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos DOI 10.48550/arxiv.2109.05352 Type Preprint Author Ghamsarian N -
2021
Title Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization DOI 10.13140/rg.2.2.14499.58402 Type Other Author Negin Ghamsarian Link Publication -
2023
Title Relevance-Based Compression of Cataract Surgery Videos DOI 10.48550/arxiv.2306.12829 Type Preprint Author Mathá N -
2020
Title Pixel-Based Iris and Pupil Segmentation in Cataract Surgery Videos Using Mask R-CNN DOI 10.1109/isbiworkshops50223.2020.9153367 Type Conference Proceeding Abstract Author Sokolova N Pages 1-4 -
2020
Title Pixel-Based Tool Segmentation in Cataract Surgery Videos with Mask R-CNN DOI 10.1109/cbms49503.2020.00112 Type Conference Proceeding Abstract Author Fox M Pages 565-568 -
2024
Title Predicting Postoperative Intraocular Lens Dislocation in Cataract Surgery via Deep Learning DOI 10.1109/access.2024.3361042 Type Journal Article Author Ghamsarian N Journal IEEE Access Pages 21012-21025 Link Publication -
2021
Title Automatic detection of pupil reactions in cataract surgery videos DOI 10.1371/journal.pone.0258390 Type Journal Article Author Sokolova N Journal PLOS ONE Link Publication -
2021
Title LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos DOI 10.1007/978-3-030-87237-3_8 Type Book Chapter Author Ghamsarian N Publisher Springer Nature Pages 76-86 -
2021
Title ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos DOI 10.48550/arxiv.2109.12448 Type Preprint Author Ghamsarian N -
2021
Title ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos DOI 10.1007/978-3-030-92238-2_33 Type Book Chapter Author Ghamsarian N Publisher Springer Nature Pages 391-402 -
2023
Title Predicting Postoperative Intraocular Lens Dislocation in Cataract Surgery via Deep Learning DOI 10.48550/arxiv.2312.03401 Type Preprint Author Ghamsarian N -
2022
Title Evaluation of Relevance-Driven Compression of Regular Cataract Surgery Videos DOI 10.1109/cbms55023.2022.00083 Type Conference Proceeding Abstract Author Mathá N Pages 429-434 -
2022
Title DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos DOI 10.1007/978-3-031-16443-9_27 Type Book Chapter Author Ghamsarian N Publisher Springer Nature Pages 276-286 -
2022
Title DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos DOI 10.48550/arxiv.2207.01453 Type Preprint Author Ghamsarian N -
2024
Title Predicting postoperative intraocular lens dislocation in cataract surgery via deep learning DOI 10.48350/199061 Type Journal Article Author Ghamsarian Link Publication -
2024
Title DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos DOI 10.48350/189044 Type Conference Proceeding Abstract Author Ghamsarian Link Publication -
2019
Title Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos DOI 10.1007/978-3-030-37734-2_51 Type Book Chapter Author Sokolova N Publisher Springer Nature Pages 626-636