Biomedical Ontology Matching Through Attention-Based Bidirectional Long Short-Term Memory Network

Xingsi Xue, Chao Jiang, Jie Zhang, Cong Hu

Source Title: Journal of Database Management (JDM) 32(4)

DOI: 10.4018/JDM.2021100102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Biomedical ontology formally defines the biomedical entities and their relationships. However, the same biomedical entity in different biomedical ontologies might be defined in diverse contexts, resulting in the problem of biomedicine semantic heterogeneity. It is necessary to determine the mappings between heterogeneous biomedical entities to bridge the semantic gap, which is the so-called biomedical ontology matching. Due to the plentiful semantic meaning and flexible representation of biomedical entities, the biomedical ontology matching problem is still an open challenge in terms of the alignment's quality. To face this challenge, in this work, the biomedical ontology matching problem is deemed as a binary classification problem, and an attention-based bidirectional long short-term memory network (At-BLSTM)-based ontology matching technique is presented to address it, which is able to capture the semantic and contextual feature of biomedical entities. In the experiment, the comparisons with state-of-the-art approaches show the effectiveness of the proposal.

Article Preview

Top

Introduction

Semantic Web (SW) (Bonatti et al., 2019; d’Amato, 2020; Noura et al., 2019; Osman, 2021) has been paid researchers' growing attention, which provides excellent convenience for people to link and process diverse data. Ontology (Storey, 2017; Wand & Weber, 2017; Verdonck & Gailly, 2018; Kazi & Kazi, 2019; Kuster, 2020) is SW’s kernel technique, and biomedical ontology formally defines the biomedical entities and their relationships. However, the same biomedical entity in different biomedical ontologies may be defined in diverse contexts or different terms, resulting in the problem of biomedicine semantic heterogeneity. To solve this heterogeneity problem, it is vital to determine mappings among heterogeneity entities to bridge the semantic gaps, which is the so-called biomedical ontology matching.

Since it is unrealistic to manually determine the mapping when the scale of ontology is enormous, various (semi)automatic ontology matching techniques (Xue & Chen, 2020a; Xue & Wang, 2015b) have been proposed. A variety of applications have been investigated successfully using the Evolutionary Algorithm (EA) (Huang et al., 2011; Pan et al., 2020; Liu, 2020) and Machine Learning (ML) (Chen, 2018; Chen et al., 2020; Lin et al, 2020a; Lin et al, 2020c). Also, EA and ML-based ontology matching techniques are regarded as promising approaches, e.g., Compact Interactive Memetic Algorithm (CIMA) (Xue & Liu, 2017), Uniform Compact Genetic Algorithm (UCGA) (Jiang & Xue, 2021), Decision Tree (DT) (Amrouch et al., 2016), Logistic Regression (LR) (Alboukaey & Joukhadar, 2018), Support Vector Machine (SVM) (Mao et al., 2011). A method based on ML-based was first proposed to match ontologies that similarity measure was expressed by a joint probability distribution of entities involved (Doan et al., 2004). Mao et al. deemed the ontology matching problem as a binary classification problem and utilized a non-instance learning-based ontology mapping approach through SVM to address the problem (Mao et al. 2011). Khoudja et al. adopted the neural network to integrate several top-ranked ontology matchers to enhance the quality of alignment (Khoudja et al. 2020). However, these matching techniques cannot determine superior alignment due to the plentiful semantic meaning and flexible representation of biomedical entities. Also, several models (Santos et al., 2020; Harrow et al., 2020; Lin et al, 2020b), while effective for solving ontology matching problem or sequence labeling task, are not designed and suitable for matching biomedical ontologies. Hence biomedical ontology matching problem is still an open challenge in terms of the alignment’s quality. To face this challenge, an Attention-based Bidirectional Long Short-Term Memory Network (At-BLSTM)-based matching technique is proposed, which makes use of the semantic relationships of entities to find the mappings. By introducing the attention mechanism and bidirectional idea into LSTM, At-BLSTM is able to connect future and past contexts of entity pairs and catch the significant part to enhance the accuracy of the model. In addition, our proposal further improves the alignments’ quality by introducing the character embedding technique, which takes into account the semantic and context information of entities. Furthermore, At-BLSTM has the capability of determining superior mappings to overcome the biomedical ontology matching problem.

Complete Article List

Search this Journal:

Reset

Volume 35: 1 Issue (2024)

Volume 34: 3 Issues (2023)

Volume 33: 5 Issues (2022): 4 Released, 1 Forthcoming

Volume 32: 4 Issues (2021)

Volume 31: 4 Issues (2020)

Volume 30: 4 Issues (2019)

Volume 29: 4 Issues (2018)

Volume 28: 4 Issues (2017)

Volume 27: 4 Issues (2016)

Volume 26: 4 Issues (2015)

Volume 25: 4 Issues (2014)

Volume 24: 4 Issues (2013)

Volume 23: 4 Issues (2012)

Volume 22: 4 Issues (2011)

Volume 21: 4 Issues (2010)

Volume 20: 4 Issues (2009)

Volume 19: 4 Issues (2008)

Volume 18: 4 Issues (2007)

Volume 17: 4 Issues (2006)

Volume 16: 4 Issues (2005)

Volume 15: 4 Issues (2004)

Volume 14: 4 Issues (2003)

Volume 13: 4 Issues (2002)

Volume 12: 4 Issues (2001)

Volume 11: 4 Issues (2000)

Volume 10: 4 Issues (1999)

Volume 9: 4 Issues (1998)

Volume 8: 4 Issues (1997)

Volume 7: 4 Issues (1996)

Volume 6: 4 Issues (1995)

Volume 5: 4 Issues (1994)

Volume 4: 4 Issues (1993)

Volume 3: 4 Issues (1992)

Volume 2: 4 Issues (1991)

Volume 1: 2 Issues (1990)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Biomedical Ontology Matching Through Attention-Based Bidirectional Long Short-Term Memory Network

Abstract

Introduction

Complete Article List