• Skip to content (access key 1)
  • Skip to search (access key 7)
FWF — Austrian Science Fund
  • Go to overview page Discover

    • Research Radar
      • Research Radar Archives 1974–1994
    • Discoveries
      • Emmanuelle Charpentier
      • Adrian Constantin
      • Monika Henzinger
      • Ferenc Krausz
      • Wolfgang Lutz
      • Walter Pohl
      • Christa Schleper
      • Elly Tanaka
      • Anton Zeilinger
    • Impact Stories
      • Verena Gassner
      • Wolfgang Lechner
      • Birgit Mitter
      • Oliver Spadiut
      • Georg Winter
    • scilog Magazine
    • Austrian Science Awards
      • FWF Wittgenstein Awards
      • FWF ASTRA Awards
      • FWF START Awards
      • Award Ceremony
    • excellent=austria
      • Clusters of Excellence
      • Emerging Fields
    • In the Spotlight
      • 40 Years of Erwin Schrödinger Fellowships
      • Quantum Austria
    • Dialogs and Talks
      • think.beyond Summit
    • Knowledge Transfer Events
    • E-Book Library
  • Go to overview page Funding

    • Portfolio
      • excellent=austria
        • Clusters of Excellence
        • Emerging Fields
      • Projects
        • Principal Investigator Projects
        • Principal Investigator Projects International
        • Clinical Research
        • 1000 Ideas
        • Arts-Based Research
        • FWF Wittgenstein Award
      • Careers
        • ESPRIT
        • FWF ASTRA Awards
        • Erwin Schrödinger
        • doc.funds
        • doc.funds.connect
      • Collaborations
        • Specialized Research Groups
        • Special Research Areas
        • Research Groups
        • International – Multilateral Initiatives
        • #ConnectingMinds
      • Communication
        • Top Citizen Science
        • Science Communication
        • Book Publications
        • Digital Publications
        • Open-Access Block Grant
      • Subject-Specific Funding
        • AI Mission Austria
        • Belmont Forum
        • ERA-NET HERA
        • ERA-NET NORFACE
        • ERA-NET QuantERA
        • Alternative Methods to Animal Testing
        • European Partnership BE READY
        • European Partnership Biodiversa+
        • European Partnership BrainHealth
        • European Partnership ERA4Health
        • European Partnership ERDERA
        • European Partnership EUPAHW
        • European Partnership FutureFoodS
        • European Partnership OHAMR
        • European Partnership PerMed
        • European Partnership Water4All
        • Gottfried and Vera Weiss Award
        • LUKE – Ukraine
        • netidee SCIENCE
        • Herzfelder Foundation Projects
        • Quantum Austria
        • Rückenwind Funding Bonus
        • WE&ME Award
        • Zero Emissions Award
      • International Collaborations
        • Belgium/Flanders
        • Germany
        • France
        • Italy/South Tyrol
        • Japan
        • Korea
        • Luxembourg
        • Poland
        • Switzerland
        • Slovenia
        • Taiwan
        • Tyrol–South Tyrol–Trentino
        • Czech Republic
        • Hungary
    • Step by Step
      • Find Funding
      • Submitting Your Application
      • International Peer Review
      • Funding Decisions
      • Carrying out Your Project
      • Closing Your Project
      • Further Information
        • Integrity and Ethics
        • Inclusion
        • Applying from Abroad
        • Personnel Costs
        • PROFI
        • Final Project Reports
        • Final Project Report Survey
    • FAQ
      • Project Phase PROFI
      • Project Phase Ad Personam
      • Expiring Programs
        • Elise Richter and Elise Richter PEEK
        • FWF START Awards
  • Go to overview page About Us

    • Mission Statement
    • FWF Video
    • Values
    • Facts and Figures
    • Annual Report
    • What We Do
      • Research Funding
        • Matching Funds Initiative
      • International Collaborations
      • Studies and Publications
      • Equal Opportunities and Diversity
        • Objectives and Principles
        • Measures
        • Creating Awareness of Bias in the Review Process
        • Terms and Definitions
        • Your Career in Cutting-Edge Research
      • Open Science
        • Open-Access Policy
          • Open-Access Policy for Peer-Reviewed Publications
          • Open-Access Policy for Peer-Reviewed Book Publications
          • Open-Access Policy for Research Data
        • Research Data Management
        • Citizen Science
        • Open Science Infrastructures
        • Open Science Funding
      • Evaluations and Quality Assurance
      • Academic Integrity
      • Science Communication
      • Philanthropy
      • Sustainability
    • History
    • Legal Basis
    • Organization
      • Executive Bodies
        • Executive Board
        • Supervisory Board
        • Assembly of Delegates
        • Scientific Board
        • Juries
      • FWF Office
    • Jobs at FWF
  • Go to overview page News

    • News
    • Press
      • Logos
    • Calendar
      • Post an Event
      • FWF Informational Events
    • Job Openings
      • Enter Job Opening
    • Newsletter
  • Discovering
    what
    matters.

    FWF-Newsletter Press-Newsletter Calendar-Newsletter Job-Newsletter scilog-Newsletter

    SOCIAL MEDIA

    • LinkedIn, external URL, opens in a new window
    • , external URL, opens in a new window
    • Facebook, external URL, opens in a new window
    • Instagram, external URL, opens in a new window
    • YouTube, external URL, opens in a new window

    SCILOG

    • Scilog — The science magazine of the Austrian Science Fund (FWF)
  • elane login, external URL, opens in a new window
  • Scilog external URL, opens in a new window
  • de Wechsle zu Deutsch

  

BMFacts: Knowledge acquisition for a biomedical fact reposito

BMFacts: Knowledge acquisition for a biomedical fact reposito

Jose Antonio Minarro-Gimenez (ORCID: 0000-0001-7221-9843)
  • Grant DOI 10.55776/M1729
  • Funding program Lise Meitner
  • Status ended
  • Start September 15, 2014
  • End November 14, 2016
  • Funding amount € 145,000
  • Project website

Disciplines

Other Human Medicine, Health Sciences (30%); Other Social Sciences (20%); Computer Sciences (50%)

Keywords

    Biomedical facts, Biomedical linked data, Knowledge acquisition, Question answer system, Co-Occurrences Analysis, Biomedical terminologies

Abstract Final report

Information management in biomedical research, health care, and translational medicine would greatly benefit from structured repositories that represent and connect generally accepted biomedical facts. Such fact stores could be used as a knowledge resource to support document retrieval, question answering and decision support systems, in addition to the already established biomedical terminologies and ontologies. The literature database MEDLINE with more than 22 million bibliographic records, is already a comprehensive source of semi- structured biomedical information, especially due to associated rich metadata annotations, using the MeSH indexing vocabulary. This data is linked to other biomedical terminologies and ontologies via the UMLS Metathesaurus. This resource provides, in addition, statistical co- occurrences based on joint corpus annotations, which is a valuable but still underused information source. The proposed project BMFacts pursues to exploit the potential of co- occurrence data and biomedical terminologies in order to infer semantic relations based on statistical associations of annotations in biomedical publications. The inferred facts are characterized by representing context dependent domain knowledge, which goes far beyond what is currently represented in biomedical ontologies or terminologies. In BMFacts, we aim to construct an RDF triplestore of such biomedical facts. It will use Semantic Web technologies and Linked Data to manage and exploit the resulting knowledge base, called Biomedical Facts Repository (BMFR). BMFR uses SNOMED CT codes to identify concepts within inferred triples and proposes new types of binary associations, using and extending relations from the UMLS Semantic Network. We want to investigate the potential of BMFR for supporting the knowledge needed in health related decision support and clinical document retrieval. Linked Data principles will be applied in order to extend the content of the BMFR with external datasets and support translational medicine through the generated triples. The content of BMFR will furthermore be refined in three ways: (i) by comparing to facts from the Linking Open Data (LOD) cloud, (ii) by using additional metadata and extracts from MEDLINE, and (iii) by matching free-text renderings of the predications against large medical reference corpora. After this cleansing process, the BMFR will be benchmarked using two application scenarios: (i) a question answering framework targeting laypersons` information needs regarding diabetes mellitus and related diseases, for which a gold standard exists; and (ii) a clinical query infrastructure on a corpus of anonymised clinical texts, for which a set of user queries and relevance judgements exist.

The goal of BMFacts was to develop and assess methodologies to extract generally acceptable fact-like statements from the biomedical literature database MEDLINE. Such facts could be a valuable knowledge resource to support document retrieval, question answering and decision support. In contrast to content in biomedical terminologies and ontologies, which represent what is universally true (e.g. Lung cancer is always located in the lung), BMFacts aimed at generating contingent knowledge like symptoms / disorder associations, drug indications, side effects, or etiological factors (e.g. lung cancer caused by smoking). MEDLINE with more than 22 million bibliographic records is a rich source for extracting biomedical information, particularly due to annotations of each record with the MeSH (Medical Subject Headings) indexing vocabulary. The BMFacts methodology processes hundreds of millions of MeSH annotations provided by MEDLINE to obtain a list of co-occurring MeSH terms pairs. Then a clustering method is applied to induce suitable predicates that give sense to co-occurring MeSH term pairs. The set of candidate predicates like treats, diagnoses, prevents is taken from an external source, the UMLS Semantic Network and selected by lexical analysis of paper abstracts, also available in MEDLINE. The output is a repository of biomedical facts as simple Subject-Predicate-Object triples. An example would be Insulin; treats; Type-1-Diabetes. The produced repository of biomedical facts was evaluated using a given gold standard from a previous project, in which physicians had manually created a set pf plausible predications on Diabetes mellitus. The evaluation results showed the strengths and weaknesses of the completely unsupervised learning methodology. Dependent of the predicate examined, the precision of the fact repository exhibited a great variation. An in-depth analysis showed that in many cases the co-occurrence of MeSH terms could not be broken down to simple predications but to chains of predications, such as diabetes mellitus causes nephropathy which causes renal insufficiency which is treated by kidney transplant (whereas the found predication would not be correct). Other confounding factors were detected, such as most of the MeSH annotation of database records were related to contextual information and did not constitute the main topic of the related publication. The existence of many false positive predications still limits the usefulness of the resulting knowledge repository. However, the error analysis resulted in several strategies to mitigate these problems and to improve the quality of the repository. Examples are previous filtering of records, as well as the use of supervised machine learning and rules defined by domain experts. This will be subject to follow-up investigations, currently in planning by the BMFacts investigators.

Research institution(s)
  • Medizinische Universität Graz - 100%

Research Output

  • 3 Citations
  • 7 Publications
Publications
  • 2015
    Title Knowledge Extraction from MEDLINE by Combining Clustering with Natural Language Processing.
    Type Journal Article
    Author Miñarro-Giménez J
    Journal AMIA ... Annual Symposium proceedings. AMIA Symposium
    Pages 915-24
  • 2015
    Title Acquiring Plausible Predications from MEDLINE by Clustering MeSH Annotations.
    Type Journal Article
    Author Miñarro-Giménez J
    Journal Studies in health technology and informatics
    Pages 716-20
  • 2015
    Title Acquiring Plausible Predications from MEDLINE by Clustering MeSH Annotations
    DOI 10.3233/978-1-61499-564-7-716
    Type Book Chapter
    Author MiÑArro-GimÉNez Jose Antonio
    Publisher IOS Press
  • 2016
    Title MapReduce in the Cloud: A Use Case Study for Efficient Co-Occurrence Processing of MEDLINE Annotations with MeSH.
    Type Journal Article
    Author Kreuzthaler M
    Journal Studies in health technology and informatics
    Pages 582-6
  • 2016
    Title Publishing Biomedical Predication Repository About MeSH Co-Occurrences in MEDLINE.
    Type Journal Article
    Author Miñarro-Giménez J
    Journal Studies in health technology and informatics
    Pages 765-9
  • 2016
    Title Publishing Biomedical Predication Repository About MeSH Co-Occurrences in MEDLINE
    DOI 10.3233/978-1-61499-678-1-765
    Type Book Chapter
    Author MiÑArro-GimÉNez Jose Antonio
    Publisher IOS Press
  • 2016
    Title MapReduce in the Cloud: A Use Case Study for Efficient Co-Occurrence Processing of MEDLINE Annotations with MeSH
    DOI 10.3233/978-1-61499-678-1-582
    Type Book Chapter
    Author Kreuzthaler Markus
    Publisher IOS Press

Discovering
what
matters.

Newsletter

FWF-Newsletter Press-Newsletter Calendar-Newsletter Job-Newsletter scilog-Newsletter

Contact

Austrian Science Fund (FWF)
Georg-Coch-Platz 2
(Entrance Wiesingerstraße 4)
1010 Vienna

office(at)fwf.ac.at
+43 1 505 67 40

General information

  • Job Openings
  • Jobs at FWF
  • Press
  • Philanthropy
  • scilog
  • FWF Office
  • Social Media Directory
  • LinkedIn, external URL, opens in a new window
  • , external URL, opens in a new window
  • Facebook, external URL, opens in a new window
  • Instagram, external URL, opens in a new window
  • YouTube, external URL, opens in a new window
  • Cookies
  • Whistleblowing/Complaints Management
  • Accessibility Statement
  • Data Protection
  • Acknowledgements
  • IFG-Form
  • Social Media Directory
  • © Österreichischer Wissenschaftsfonds FWF
© Österreichischer Wissenschaftsfonds FWF