Embedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and Evaluation
ERA-Net: CHIST ERA
Disciplines
Computer Sciences (100%)
Keywords
-
Human Computation,
Natural Language Proceesing,
Knowledge Resource Acquisition,
Ontology Engineering,
Open Evaluation Methods,
Heterogeneous Web Data
The rapid growth and fragmented character of social media and publicly available structured data challenges established approaches to knowledge extraction. Many algorithms fail when they encounter noisy, multilingual and contradictory input. Efforts to increase the reliability and scalability of these algorithms face a lack of suitable training data and gold standards. Given that humans excel at interpreting contradictory and context-dependent evidence, the uComp project will address the above mentioned shortcomings by merging collective human intelligence and automated methods in a symbiotic fashion. The project will build upon the emerging field of Human Computation (HC) in the tradition of games with a purpose and crowdsourcing marketplaces. It will advance the field of Web Science by developing a scalable and generic HC framework for knowledge extraction and evaluation, delegating the most challenging tasks to large communities of users and continuously learning from their feedback to optimise automated methods as part of an iterative process. A major contribution is the proposed foundational research on Embedded Human Computation (EHC), which will advance and integrate the currently disjoint research fields of human and machine computation. EHC goes beyond mere data collection and embeds the HC paradigm into adaptive knowledge extraction workflows. An open evaluation campaign will validate the accuracy and scalability of EHC to acquire factual and affective knowledge. In addition to novel evaluation methods, uComp will also provide shared datasets and benchmark EHC against established knowledge processing frameworks. While uComp methods will be generic and evaluated across domains, climate change was chosen as the main use case for its challenging nature, subject to fluctuating and often conflicting interpretations. Collaborating with international organisations such as EEA, NOAA and NASA will increase impact, provide a rich stream of input data, attract and retain a critical mass of users, and promote the adoption of EHC among a wide range of stakeholders.
The rapid growth and fragmented character of social media and publicly available structured data challenges established approaches to knowledge extraction. Many algorithms fail when they encounter noisy, multilingual and contradictory input. Efforts to increase the reliability and scalability of these algorithms face a lack of suitable training data and gold standards. Given that humans excel at interpreting contradictory and context-dependent evidence, the uComp project addresses the above-mentioned shortcomings by merging collective human intelligence and automated methods in a symbiotic fashion. The project advanced the field of Human Computation (HC), using a combination of games with a purpose and crowdsourcing marketplaces such as CrowdFlower. At the same time, uComp contributed to Web Science by developing a scalable and generic HC framework for knowledge extraction and evaluation, delegating the most challenging tasks to large user communities and continuously learning from their feedback to optimise automated methods as part of an iterative process.uComp developed an Embedded Human Computation (EHC) framework, which goes beyond mere data collection and embeds the HC paradigm into adaptive knowledge extraction workflows. The project validated the accuracy and scalability of EHC to acquire factual and affective knowledge - while uComp methods are generic and applicable across domains, the topic of climate change was chosen as the main use case for its challenging nature, subject to fluctuating and often conflicting interpretations.Active collaboration between MODUL University Viennas Department of New Media Technology and international partner organisations had a positive impact on the achievable scientific progress, and helped to increase the international visibility of the project. The list of partner organisations include the core partners of the uComp project itself, the consortium of DecarboNet.eu, a European research initiative within the EU 7th Framework Programme, the Climate Program Office of the U.S. National Oceanic & Atmospheric Administration (NOAA), and the United Nations Environment Programme (UNEP).Three public Web applications showcased the results of this collaborative work: The uComp Language Quiz to acquire multilingual language resources, the (2) Climate Challenge to gather social perceptions and create collective awareness of environmental issues, and the (3) Media Watch on Climate Change as a central knowledge repository including a visual dashboard to analyse the extracted factual and affective knowledge. These applications provided a rich stream of input data, attracted and engaged a critical mass of participants, and promoted the adoption of EHC among a wide range of stakeholders.
- Modul Univ. Vienna - 100%
- Patrick Paroubek, The Computer Sciences Laboratory for Mechanics and Engineering Sciences - France
- Wim Peters, University of Sheffield
Research Output
- 188 Citations
- 15 Publications
- 1 Spinouts
-
2014
Title Visualizing Contextual Information in Aggregated Web Content Repositories DOI 10.1109/laweb.2014.18 Type Conference Proceeding Abstract Author Scharl A Pages 114-118 Link Publication -
2014
Title Enriching semantic knowledge bases for opinion mining in big data applications DOI 10.1016/j.knosys.2014.04.039 Type Journal Article Author Weichselbraun A Journal Knowledge-Based Systems Pages 78-85 Link Publication -
2013
Title Rule-based opinion target and aspect extraction to acquire affective knowledge DOI 10.1145/2487788.2487994 Type Conference Proceeding Abstract Author Gindl S Pages 557-564 Link Publication -
2013
Title Extracting knowledge from the web and social media for progress monitoring in public outreach and science communication DOI 10.1145/2526188.2526219 Type Conference Proceeding Abstract Author Scharl A Pages 121-124 Link Publication -
2013
Title Crowdsourced Knowledge Acquisition: Towards Hybrid-Genre Workflows DOI 10.4018/ijswis.2013070102 Type Journal Article Author Sabou M Journal International Journal on Semantic Web and Information Systems (IJSWIS) Pages 14-41 -
2013
Title Knowledge capture from multiple online sources with the extensible web retrieval toolkit (eWRT) DOI 10.1145/2479832.2479861 Type Conference Proceeding Abstract Author Weichselbraun A Pages 129-132 Link Publication -
2013
Title Games with a Purpose or Mechanised Labour? DOI 10.1145/2494188.2494210 Type Conference Proceeding Abstract Author Sabou M Pages 1-8 -
2016
Title Application Design and Engagement Strategy of a Game with a Purpose for Climate Change Awareness DOI 10.1007/978-3-319-45982-0_9 Type Book Chapter Author Scharl A Publisher Springer Nature Pages 97-104 -
2016
Title Analyzing the public discourse on works of fiction – Detection and visualization of emotion in online coverage about HBO’s Game of Thrones DOI 10.1016/j.ipm.2015.02.003 Type Journal Article Author Scharl A Journal Information Processing & Management Pages 129-138 Link Publication -
2014
Title Visualizing Knowledge Along Semantic and Geographic Dimensions: A Web Intelligence Platform to Explore Climate Change Coverage DOI 10.1007/978-1-4471-6470-8_19 Type Book Chapter Author Scharl A Publisher Springer Nature Pages 423-441 -
2016
Title Extracting Opinion Targets from Environmental Web Coverage and Social Media Streams DOI 10.1109/hicss.2016.133 Type Conference Proceeding Abstract Author Weichselbraun A Pages 1040-1048 Link Publication -
2014
Title Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines. Type Conference Proceeding Abstract Author Sabou M Conference Proceedings of the 9th Language Resources & Evaluation Conference (LREC-2014) -
2014
Title Metadata Enriched Visualization of Keywords in Context. Type Conference Proceeding Abstract Author Fischl D Conference Sixth ACM SIGCHI Symposium on Engineering Interactive Computing Systems (EICS-2014) -
2014
Title Metadata enriched visualization of keywords in context DOI 10.1145/2607023.2611451 Type Conference Proceeding Abstract Author Fischl D Pages 193-196 Link Publication -
2015
Title Climate Challenge - Raising Collective Awareness in the Tradition of Games with a Purpose. Type Conference Proceeding Abstract Author Herring D Et Al Conference Proceedings of the 14th Brazilian Symposium on Human Factors in Computer Systems (IHC-2015)