Abstract
Most modern biomedical vocabularies employ some hierarchical representation that provides a “broader/narrower” meaning relationship among the “codes” or “concepts” found within them. Often, however, we may find within the clinical setting the creation and curation of unstructured custom vocabularies used in the everyday practice of classifying and categorizing clinical data and findings.
A significant and widely used example of this lies in the General Practice Research Database which makes use of the Oxford Medical Information Systems (OXMIS) coding scheme to represent drugs and medical conditions. This scheme is intrinsically unstructured, is generally regarded as disorganized, and is not amenable to comparison with other hierarchically structured medical coding schemes. To improve processes of data analysis and extraction, we define a semantically meaningful representation of the OXMIS codes by way of the Unified Medical Language System (UMLS) Metathesaurus. A structure-imposing ontology mapping is created, and this process provides a complete illustration of a general semantic mapping technique applicable to unstructured biomedical terminologies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
General Practice Research Database (GPRD) is maintained by the (UK) National Health Service Information Authority.
- 2.
UMLS Metathesaurus is a project of the (US) National Library of Medicine, Department of Health and Human Services. Available at: http://www.nlm.nih.org/research/umls/.
- 3.
From now on, we will refer to the collective set of codes found in the GPRD as Read-OXMIS. The designation refers to the combination (OXMIS and Read version 2) of coding schemes found in this particular database’s medical records.
- 4.
SNOMED CT is copyrighted by the International Health Terminology Standards Organization (IHTSDO). ICD-9 refers to ICD-9, CM the International Classification of Diseases, 9th Revision, Clinical Modification. ICD-10 is copyrighted by the World Health Organization and developed by the National Center for Health Statistics. Current Procedural Terminology (CPT) is copyright the American Medical Association. The Clinical Terms Version 3 (Read Codes) are maintained by the (UK) National Health Service Information Authority.
References
Perry J ed: OXMIS Problem Codes for Primary Medical Care. Oxford, Headington, 1978.
Jones R, Latinovic R, Charlton J, Gulliford M. Physical and Psychological Co-morbidity in Irritable Bowel Syndrome: A Matched Cohort Study Using the General Practice Research Database. Alimentary Pharmacology & Therapeutics 24 (5), 2006, pp. 879–886.
Rahm E, Bernstein PA. A Survey of Approaches to Automatic Schema Matching. The VDLB Journal 10, 2001, pp. 334–350.
Willard Dallas. Why Semantic Ascent Fails. Metaphilosophy 14 (3 & 4), July/October 1983, pp. 276–290.
Soriano, Maier, Visick, Pride. Validation of General Practitioner-Diagnosed COPD in the UK General Practiec Research Database. European Journal of Epidemiology 17 (12), 2001, pp. 1075–1080.
Bodenreider O. Using UMLS semantics for classification purposes. Proceedings of AMIA Symposium, 2000, pp. 86–90.
Mork P, Bernstein PA. Adapting a Generic Match Algorithm to Align Ontologies of Human Anatomy. In: 20th International Conference on Data Engineering; 2004 March 30–April 2; Boston, MA: IEEE, 2004.
Jensen LS, Martinez T. Improving text classification by using conceptual and contextual features. KDD-2000 Workshop on Text Mining, Boston, 2000, pp. 101–102.
El-Nasan A, Veeramachaneni S, Nagy G. Word Discrimination Based on Bigram Co-Occurrences. ICDAR, p. 0149, Sixth International Conference on Document Analysis and Recognition (ICDAR’01), 2001.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this paper
Cite this paper
Painter, J.L. (2010). Toward Automating an Inference Model on Unstructured Terminologies: OXMIS Case Study. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_71
Download citation
DOI: https://doi.org/10.1007/978-1-4419-5913-3_71
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-5912-6
Online ISBN: 978-1-4419-5913-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)