A Survey on Semantic Similarity Measures between Concepts in Health Domain

Abstract

The similarity between biomedical terms/concepts is a very important task for biomedical information extraction and knowledge discovery. The measures and tests are tools used to define how to measure the goodness of ontology or its resources. The semantic similarity measuring techniques can be classified into three classes: first, measuring semantic similarity using ontology/ taxonomy; second, using training corpora and information content and third, combination between them. Some of the semantic similarity measures are based on the path length between the concept nodes as well as the depth of the LCS node in the ontology tree or hierarchy, and these measures assign high similarity when the two concepts are in the lower level of the hierarchy. However, most of the semantic similarity measures can be adopted to be used in health domain (Biomedical Domain). Many experiments have been conducted to check the applicability of these measures. In this paper, we investigate to measure semantic similarity between two concepts within single ontology or multiple ontologies in UMLS Metathesaurus (MeSH, SNOMED-CT, ICD), and compare my results to human experts score by correlation coefficient.

Share and Cite:

Abdelrahman, A. and Kayed, A. (2015) A Survey on Semantic Similarity Measures between Concepts in Health Domain. American Journal of Computational Mathematics, 5, 204-214. doi: 10.4236/ajcm.2015.52017.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Pedersen T., Pakhomov, S.V.S., Patwardhan, S. and Chute, C.G. (2007) Measures of Semantic Similarity and Relatedness in the Biomedical Domain. Journal of Biomedical Informatics, 40, 288-299.
http://dx.doi.org/10.1016/j.jbi.2006.06.004
[2] Al-Mubaid, H. and Nguyen, H.A. (2009) Measuring Semantic Similarity between Biomedical concepts within Multiple Ontologies. IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and Reviews, 39, 389-397.
[3] Kayed, A., et al. (2013) Ontology Evaluation: Which Test to Use. 2013 5th International Conference on Computer Science and Information Technology (CSIT), Amman, 27-28 March 2013, 45-48.
http://dx.doi.org/10.1109/csit.2013.6588756
[4] Al-Mubaid, H. and Nguyen, H.A. (2006) A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain. Proceedings of the 28th IEEE, EMBS Annual International Conference, New York, 30 Augugst-3 September 2006, 2713-2717.
http://dx.doi.org/10.1109/iembs.2006.259235
[5] Al-Mubaid, H. and Nguyen, H.A. (2006) Using MEDLINE as Standard Corpus for Measuring Semantic Similarity in the Biomedical Domain. 6th IEEE Symposium on BionInformatics and BioEngineering (BIBE’06), Arlington, 16-18 October 2006, 315-318.
http://dx.doi.org/10.1109/bibe.2006.253295
[6] Tartir, S. (2009) Ontology-Driven Question Answering and Ontology Quality Evaluation. PhD Thesis, University of Georgia, Athens.
[7] Batet, M., Sánchez, D. and Valls, A. (2011) An Ontology-Based Measure to Compute Semantic Similarity in Biomedicine. Journal of Biomedical Informatics, 44, 118-125.
http://dx.doi.org/10.1016/j.jbi.2010.09.002
[8] Garla1, V.N. and Brandt, C. (2012) Semantic Similarity in the Biomedical Domain: An Evaluation across Knowledge Sources. Garla and Brandt BMC Bioinformatics, 13, 261.
http://dx.doi.org/10.1186/1471-2105-13-261
[9] Ivanovic, M. and Budimac, Z. (2014) An Overview of Ontologies and Data Resources in Medical Domains. Expert Systems with Applications, 41, 5158-5166.
http://dx.doi.org/10.1016/j.eswa.2014.02.045
[10] Nguyen, H. and Al-Mubaid, H. (2006) New Semantic Similarity Techniques of Concepts Applied in the Biomedical Domain and WordNet. MS Thesis, University of Houston Clear Lake, Houston.
[11] World Health Organization (2010) International Statistical Classification of Diseases and Related Health Problems. 10th Revision Edition, WHO, Geneva.
[12] Elavarasi, S.A., Akilandeswari, J. and Menaga, K. (2014) A Survey on Semantic Similarity Measure. International Journal of Research in Advent Technology, 2, 389-398.
[13] Batet, M. (2010) Ontology-Based Semantic Clustering. Ph.D. Thesis.
[14] Sanchez, D., Solé-Ribalta, A., Batet, M. and Serratosa, F. (2012) Enabling Semantic Similarity Estimation across Mul-tiple Ontologies: An Evaluation in the Biomedical Domain. Journal of Biomedical Informatics, 45, 141-155.
http://dx.doi.org/10.1016/j.jbi.2011.10.005
[15] Hliaoutakis, A., Varelas, G., Voutsakis, E., Petrakis, E.G.M. and Milios, E. (2006) Information Retrieval by Semantic Similarity. International Journal on Semantic Web and Information Systems, 3, 55-73.
[16] Sathya, D. and Uthayan, K.R. (2011) Proposal for Semantic Metric to Assess the Quality of Ontologies. Proceedings of the 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies, Thuckafay, 21-22 July 2011, 754-756.
http://dx.doi.org/10.1109/ICSCCN.2011.6024651
[17] Lin, D.K. (1998) An Information-Theoretic Definition of Similarity. Proceedings of the 15th International Conference on Machine Learning, Madison, 24-27 July 1998, 296-304.
[18] Al-Mubaid, H. and Nguyen, H.A. (2006) New Ontology-Based Semantic Similarity Measure for the Biomedical Domain. Proceedings of the 2006 IEEE International Conference on Granular Computing, Atlanta, 10-12 May 2006, 623-628.
[19] http://apps.who.int/classifications/icd10/browse/2015/en#/VI

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.