Efficient Segmentation for Multimedia Semantic Extraction
Efficient Segmentation for Multimedia Semantic Extraction
Disciplines
Computer Sciences (100%)
Keywords
-
Multimedia,
Image segmentation,
Physics-based image analysis,
Video analysis
The aim of the project is to develop efficient image segmentation algorithms for use in the automated extraction of semantics from multimedia data. Due to the immense amount of multimedia data currently accessible on the world wide web, it is incredibly difficult to locate exactly the image or video that one is looking for. Enriching this data with additional layers of automatically generated semantic metadata as well as with artificial intelligence to reason about the (meta)data, is the only conceivable way to easily search through their complex content. However, the automatic extraction of semantically rich metadata from the computationally accessible ("low-level") features poses tremendous scientific and technological challenges. The European Union has recognised these challenges by declaring Semantic-based knowledge systems to be one of the strategic objectives in the Information Society Technologies (IST) thematic area of the European 6th Framework Programme. The PRIP group of the Vienna University of Technology is a member of the MUSCLE (Multimedia Understanding through Semantics, Computation and LEarning) Network of Excellence (NoE) within this programme, which, through exchanges with the MUSCLE partners, will ensure that research carried out in this FWF project will have a Europe-wide impact. The PRIP group intends to concentrate on obtaining the best features possible for use in the extraction of semantics from images and video sequences. For example, a segmentation algorithm which takes all the available physics- based information in a scene into account to segment the scene into actual physical objects (e.g., regions made of the same material) instead of arbitrary regions influenced by lighting and shadow would simplify further stages of semantic extraction immensely. Our innovation will be in the combination of spatial segmentation techniques (e.g., the watershed algorithm) with physics-based segmentation techniques through the use of hierarchies of image partitions. In the process of reaching this goal, we also intend to make significant contributions to the fields of mathematical morphology for colour images and colour texture analysis.
The aim of the project is to develop efficient image segmentation algorithms for use in the automated extraction of semantics from multimedia data. Due to the immense amount of multimedia data currently accessible on the world wide web, it is incredibly difficult to locate exactly the image or video that one is looking for. Enriching this data with additional layers of automatically generated semantic metadata as well as with artificial intelligence to reason about the (meta)data, is the only conceivable way to easily search through their complex content. However, the automatic extraction of semantically rich metadata from the computationally accessible ("low-level") features poses tremendous scientific and technological challenges. The European Union has recognised these challenges by declaring Semantic-based knowledge systems to be one of the strategic objectives in the Information Society Technologies (IST) thematic area of the European 6th Framework Programme. The PRIP group of the Vienna University of Technology is a member of the MUSCLE (Multimedia Understanding through Semantics, Computation and LEarning) Network of Excellence (NoE) within this programme, which, through exchanges with the MUSCLE partners, will ensure that research carried out in this FWF project will have a Europe-wide impact. The PRIP group intends to concentrate on obtaining the best features possible for use in the extraction of semantics from images and video sequences. For example, a segmentation algorithm which takes all the available physics- based information in a scene into account to segment the scene into actual physical objects (e.g., regions made of the same material) instead of arbitrary regions influenced by lighting and shadow would simplify further stages of semantic extraction immensely. Our innovation will be in the combination of spatial segmentation techniques (e.g., the watershed algorithm) with physics-based segmentation techniques through the use of hierarchies of image partitions. In the process of reaching this goal, we also intend to make significant contributions to the fields of mathematical morphology for colour images and colour texture analysis.
- Technische Universität Wien - 100%
- Jean Serra, Ecole Nationale Superieure des Mines de Paris - France
- Dmitry Chetverikov, Hungarian Academy of Sciences - Hungary
Research Output
- 38 Citations
- 3 Publications
-
2009
Title Morphological segmentation on learned boundaries DOI 10.1016/j.imavis.2008.06.012 Type Journal Article Author Hanbury A Journal Image and Vision Computing Pages 480-488 Link Publication -
2007
Title Multi-label image segmentation via max-sum solver* *Research of B. MicuÅ¡Ãk has been supported by FWF-P17189-N04 SESAME and FP6-IST-507752 MUSCLE and research of T. Pajdla by FP6-IST-027787 DIRAC and MSM6840770038 DMCM III grants. DOI 10.1109/cvpr.2007.383230 Type Conference Proceeding Abstract Author MicuÅ¡Ãk B Pages 1-6 Link Publication -
2007
Title Do Colour Interest Points Improve Image Retrieval? DOI 10.1109/icip.2007.4378918 Type Conference Proceeding Abstract Author Stoettinger J