Projectdetail

Grant DOI 10.55776/P31486
Funding program Principal Investigator Projects
Status ended
Start October 1, 2018
End September 30, 2022
Funding amount € 379,635
Project website

Disciplines

Computer Sciences (100%)

Keywords

Multimedia, Biomedical Engineering, Surgery Videos, Computer Vision, Video Content Analysis, Machine Learning

Abstract

Final report

Research Proposal: In an interdisciplinary research project, computer scientists and physicians collaborate to develop and evaluate methods for automatic detection of relevant temporal segments in ophthalmic surgery1 videos (OSVs). The main objective is the creation of relevance models allowing to detect video segments that are relevant for educational, scientific, or documentary purposes in medicine. Relevance models are produced by machine learning algorithms, which are trained using OSVs that were annotated by surgeons. Important instances of relevant OSV segments are irregular operation (OP) phases, which deviate from the usual procedure used in quasi-standardized ophthalmic surgeries. The automatic detection and classification of irregularities that occur more frequently and hence allow training of machine learning algorithms, provide an additional benefit for the creation of OSV datasets targeted at medical education or research. The development and evaluation of automatic classifiers of irregularities therefore represents another research objective of this project. Relevance models can be used to compress and store OSVs efficiently. This project will develop and evaluate appropriate methods and algorithms to achieve this goal. Finally, we want to demonstrate that relevant OSV segments are useful for medical research by addressing three specific medical research questions related to a certain type of ophthalmic surgery (cataract) using video analysis. Hypotheses: (1) Regular and irregular OP phases as well as common irregular events in OSVs can be detected and classified automatically by machine learning algorithms. (2) The automatic relevance prediction of video segments enables a significant increase in efficiency for storing and archiving OSVs. (3) Automatic analysis of OSVs facilitates the investigation of quantitative medical research questions related to complication rate and various OP conditions. Methods: Classifiers and compression algorithms are developed using an iterative process comprising design, evaluation and improvement. Quantitative medical research questions are addressed by statistical correlation and variance analysis. Innovation: Currently, automatic analysis of OSVs is a young research field with a small number of publications (mainly concerning phase detection in cataract surgeries). Relevance modelling of OSV segments is a new idea, as is relevance-driven compression of OSVs. Moreover, the use of video analysis to investigate quantitative research questions related to ophthalmic surgeries represents a new approach in medical research. 1 Ophthalmic surgery is concerned with the human eye.

Computer scientists and physicians have collaborated in a multidisciplinary research project with a focus on computer science to develop and evaluate methods for the automatic detection of relevant temporal and spatial content in ophthalmic surgery videos. The main goal of the project OVID ("Relevance Detection in Ophthalmic Surgery Videos") was to model relevance with respect to the use of video segments for medical teaching, research and documentation. For this purpose, mainly artificial intelligence (AI) methods were used and various neural network models were developed and applied for content recognition. This allowed relevance models to be learned automatically by machine learning methods, using surgical videos annotated by surgeons as training data. On the one hand, these relevance models can be used for efficient compression and storage of surgical videos, and on the other hand, they can also be used for automatic recognition of a wide variety of relevant content (e.g. surgical phases, instruments, iris, pupil, the lens used in cataracts, etc.). This makes it possible to capture important sections in an eye operation fully automatically, index them and make them usable for subsequent content-based searches. These models also allow large forensic studies of specific complications over a large number of operations or patients. In the OVID project, it was successfully shown that specific problem situations or complications in videos of cataract operations can be found through the targeted use of artificial intelligence and algorithmic methods of computer science. For example, the occurrence of pupil reactions (unpredictable enlargement/reduction of the pupil, which is accompanied by a dangerous change in the anterior chamber depth) was automatically detected. Furthermore, object detection and tracking in the video made it possible to measure the unfolding time of artificial lenses as well as their stability. The results of the research project have been published in several papers at high-level conferences on medical image analysis, pattern recognition, and multimedia retrieval.

Research institution(s)

Klinikum Klagenfurt - 30%
Universität Klagenfurt - 70%

Project participants

Yosuf El-Shabrawi, Klinikum Klagenfurt , associated research partner

Research Output

164 Citations
23 Publications

Publications

Title	LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos
DOI	10.48550/arxiv.2107.00875
Type	Preprint
Author	Ghamsarian N

Title	Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization
DOI	10.48550/arxiv.2104.14280
Type	Preprint
Author	Ghamsarian N

Title	Relevance Detection in Cataract Surgery Videos by Spatio- Temporal Action Localization
DOI	10.1109/icpr48806.2021.9412525
Type	Conference Proceeding Abstract
Author	Ghamsarian N
Pages	10720-10727
Link	Publication

Title	Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks
DOI	10.1145/3394171.3413658
Type	Conference Proceeding Abstract
Author	Ghamsarian N
Pages	3577-3585

Title	Deblurring Cataract Surgery Videos Using a Multi-Scale Deconvolutional Neural Network
DOI	10.1109/isbi45749.2020.9098318
Type	Conference Proceeding Abstract
Author	Ghamsarian N
Pages	872-876

Title	Enabling Relevance-Based Exploration of Cataract Videos
DOI	10.1145/3372278.3391937
Type	Conference Proceeding Abstract
Author	Ghamsarian N
Pages	378-382

Title	DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
DOI	10.48550/arxiv.2109.05352
Type	Preprint
Author	Ghamsarian N

Title	Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization
DOI	10.13140/rg.2.2.14499.58402
Type	Other
Author	Negin Ghamsarian
Link	Publication

Title	Relevance-Based Compression of Cataract Surgery Videos
DOI	10.48550/arxiv.2306.12829
Type	Preprint
Author	Mathá N

Title	Pixel-Based Iris and Pupil Segmentation in Cataract Surgery Videos Using Mask R-CNN
DOI	10.1109/isbiworkshops50223.2020.9153367
Type	Conference Proceeding Abstract
Author	Sokolova N
Pages	1-4

Title	Pixel-Based Tool Segmentation in Cataract Surgery Videos with Mask R-CNN
DOI	10.1109/cbms49503.2020.00112
Type	Conference Proceeding Abstract
Author	Fox M
Pages	565-568

Title	Predicting Postoperative Intraocular Lens Dislocation in Cataract Surgery via Deep Learning
DOI	10.1109/access.2024.3361042
Type	Journal Article
Author	Ghamsarian N
Journal	IEEE Access
Pages	21012-21025
Link	Publication

Title	Automatic detection of pupil reactions in cataract surgery videos
DOI	10.1371/journal.pone.0258390
Type	Journal Article
Author	Sokolova N
Journal	PLOS ONE
Link	Publication

Title	LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos
DOI	10.1007/978-3-030-87237-3_8
Type	Book Chapter
Author	Ghamsarian N
Publisher	Springer Nature
Pages	76-86

Title	ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos
DOI	10.48550/arxiv.2109.12448
Type	Preprint
Author	Ghamsarian N

Title	ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos
DOI	10.1007/978-3-030-92238-2_33
Type	Book Chapter
Author	Ghamsarian N
Publisher	Springer Nature
Pages	391-402

Title	Predicting Postoperative Intraocular Lens Dislocation in Cataract Surgery via Deep Learning
DOI	10.48550/arxiv.2312.03401
Type	Preprint
Author	Ghamsarian N

Title	Evaluation of Relevance-Driven Compression of Regular Cataract Surgery Videos
DOI	10.1109/cbms55023.2022.00083
Type	Conference Proceeding Abstract
Author	Mathá N
Pages	429-434

Title	DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
DOI	10.1007/978-3-031-16443-9_27
Type	Book Chapter
Author	Ghamsarian N
Publisher	Springer Nature
Pages	276-286

Title	DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
DOI	10.48550/arxiv.2207.01453
Type	Preprint
Author	Ghamsarian N

Title	Predicting postoperative intraocular lens dislocation in cataract surgery via deep learning
DOI	10.48350/199061
Type	Journal Article
Author	Ghamsarian
Link	Publication

Title	DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
DOI	10.48350/189044
Type	Conference Proceeding Abstract
Author	Ghamsarian
Link	Publication

Title	Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos
DOI	10.1007/978-3-030-37734-2_51
Type	Book Chapter
Author	Sokolova N
Publisher	Springer Nature
Pages	626-636

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

Relevance Detection of Ophthalmic Surgery Videos

Relevance Detection of Ophthalmic Surgery Videos

Disciplines

Keywords

Research Output

Contact

General information

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

SOCIAL MEDIA

SCILOG

Relevance Detection of Ophthalmic Surgery Videos

Relevance Detection of Ophthalmic Surgery Videos

Disciplines

Keywords

Research Output