Automatic Analysis of Elephant Vocalizations
Automatic Analysis of Elephant Vocalizations
Disciplines
Biology (40%); Electrical Engineering, Electronics, Information Engineering (40%); Computer Sciences (20%)
Keywords
-
African Savannah Elephants,
Multimodal Content-Based Analysis,
Vocal Communication,
Automatic Analysis,
Bioacoustics
The decline of habitat for elephants as a result of expanding human activity combined with increasing elephant numbers in spatially separated clusters of conservation areas is a serious conservation problem in Africa. Nearly 80% of the distributional range of elephants in Southern Africa stretches beyond the borders of officially protected areas. This fact leads to deadly conflicts between humans and elephants. We are a team of biologists and computer scientists who want to alleviate these conflicts by the development of early warning and information systems for humans living near the corridors who regularly get into serious conflict with traveling elephants. Such systems require the robust recognition and categorization of elephant vocalizations. Today, no system exists that fulfills the requirements for automatic recognition of elephant vocalizations under natural conditions. The goals of the proposed project are (i) to investigate the complex vocal communication system of elephants and (ii) to develop automatic techniques for the analysis of elephant calls. The acquired knowledge and the automatic techniques will form the basis for a future autonomous warning and information system. Elephants make extensive use of powerful infrasonic calls which travel distances of up to several kilometers. This qualifies the elephant as a perfect model species for acoustic observation since it is possible to detect elephants even if they are out of sight. We will incorporate visual information into the automatic analyses where improvements can be expected (e.g. at open habitats such as waterholes). We will investigate and evaluate among others automatic statistical methods to detect elephants, estimate group-size from vocalization rates, and classify call types automatically. Another focus will be the manual and automatic analysis of age and gender of the caller due to characteristics in call structure. This will allow us to determine the demography of families and populations. We additionally aim at defining family and bond group specific acoustic signatures and vocal dialects within the study population. The project is proposed for an initial duration of three years and will be conducted by the Department for Cognitive Biology of the University of Vienna in collaboration with the Interactive Media Systems Group, Institute of Software Technology and Interactive Systems, Vienna University of Technology. One 50% post-doc and two 60% doctoral students shall be employed by the project partners. There is a clear definition of responsibilities and exchange of results between the project partners. At the end of the project, a large set of fully annotated audio and video material will be available together with statistical evaluations of the material. Additionally, novel analysis techniques will be available that are able to robustly recognize and classify elephant calls from data obtained under natural conditions.
The decline of habitat for elephants as a result of expanding human activity is a serious conservation problem in Africa and Asia. This fact leads to an increase of the human-elephant conflict that seriously threatens the long-term survival of elephants. The project team of biologists and computer scientists wants to alleviate these conflicts by the development of early warning and information systems for humans who regularly get into serious conflict with elephants. Such systems require the robust recognition and categorization of elephant vocalizations. The main goals of the project entitled Automatic Analysis of Elephant Vocalizations were (1) to investigate the complex vocalizations of elephants and (2) to develop automatic techniques for the analysis of these elephant calls. Elephants make extensive use of powerful infrasonic calls (rumbles) that travel distances of up to several kilometers. This qualifies the elephant as a perfect model species for acoustic observation since it is possible to detect elephants even if they are out of sight. As the main overall project result we present a method for the automated detection of elephant vocalizations (focusing on the low-frequency rumbles) that is robust to the diverse noise sources present in the field. We evaluated the method on data recorded under natural field conditions. The proposed method outperformed existing approaches and accurately detected elephant rumbles. We also investigated natural rumble variability. We found that rumbles vary considerably in acoustic structure due, among others, to differences in sound production mechanisms (oral versus nasal sound emission) and maturational processes (characteristics for age groups). Automatic classification distinguished these structural rumble variants with high accuracy. During our project we further investigated the capabilities of visual elephant detection and developed a first method for visually detecting and tracking elephants in wildlife videos. Our experiments on visual detection confirmed that tracking elephants in wildlife videos is feasible and particularly robust at near distances. We were able to detect and track elephants of different sizes and postures and in different activities. We conclude that a combined acoustic and visual elephant detector is a promising solution because both techniques potentially compensate their weaknesses, which might lead to a more reliable detection in future. We clearly identified the most critical challenges and necessary improvements in order to transfer the promising fundamental results into practical implementation, and conclude that our findings can form the basis for a future automated early warning system for elephants.
- Universität Wien - 51%
- Technische Universität Wien - 49%
- Christian Breiteneder, Technische Universität Wien , associated research partner
Research Output
- 667 Citations
- 13 Publications
-
2011
Title Vocal cues indicate level of arousal in infant African elephant roars DOI 10.1121/1.3605538 Type Journal Article Author Stoeger A Journal The Journal of the Acoustical Society of America Pages 1700-1710 Link Publication -
2013
Title Acoustic detection of elephant presence in noisy environments DOI 10.1145/2509896.2509900 Type Conference Proceeding Abstract Author Zeppelzauer M Pages 3-8 -
2013
Title African and Asian Elephant Vocal Communication: A Cross-Species Comparison DOI 10.1007/978-94-007-7414-8_3 Type Book Chapter Author Stoeger A Publisher Springer Nature Pages 21-39 -
2012
Title Visualizing Sound Emission of Elephant Vocalizations: Evidence for Two Rumble Production Types DOI 10.1371/journal.pone.0048907 Type Journal Article Author Stoeger A Journal PLoS ONE Link Publication -
2014
Title Age-group estimation in free-ranging African elephants based on acoustic cues of low-frequency rumbles DOI 10.1080/09524622.2014.888375 Type Journal Article Author Stoeger A Journal Bioacoustics Pages 231-246 Link Publication -
2012
Title Vocalizations and associated behaviour of Asian elephant (Elephas maximus) calves DOI 10.1163/156853912x648516 Type Journal Article Author Herler A Journal Behaviour Pages 575-599 -
2012
Title An Asian Elephant Imitates Human Speech DOI 10.1016/j.cub.2012.09.022 Type Journal Article Author Stoeger A Journal Current Biology Pages 2144-2148 Link Publication -
2012
Title How Low Can You Go? Physical Production Mechanism of Elephant Infrasonic Vocalizations DOI 10.1126/science.1219712 Type Journal Article Author Herbst C Journal Science Pages 595-599 -
2014
Title Towards an automated acoustic detection system for free-ranging elephants DOI 10.1080/09524622.2014.906321 Type Journal Article Author Zeppelzauer M Journal Bioacoustics Pages 13-29 Link Publication -
2014
Title Vocal learning in elephants: neural bases and adaptive context DOI 10.1016/j.conb.2014.07.001 Type Journal Article Author Stoeger A Journal Current Opinion in Neurobiology Pages 101-107 Link Publication -
2013
Title Complex vibratory patterns in an elephant larynx DOI 10.1242/jeb.091009 Type Journal Article Author Herbst C Journal Journal of Experimental Biology Pages 4054-4064 Link Publication -
2013
Title Automated detection of elephants in wildlife video DOI 10.1186/1687-5281-2013-46 Type Journal Article Author Zeppelzauer M Journal EURASIP Journal on Image and Video Processing Pages 46 Link Publication -
2015
Title Establishing the fundamentals for an elephant early warning and monitoring system DOI 10.1186/s13104-015-1370-y Type Journal Article Author Zeppelzauer M Journal BMC Research Notes Pages 409 Link Publication