Skip to main content

Toward Automating an Inference Model on Unstructured Terminologies: OXMIS Case Study

  • Conference paper
  • First Online:
Advances in Computational Biology

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 680))

Abstract

Most modern biomedical vocabularies employ some hierarchical representation that provides a “broader/narrower” meaning relationship among the “codes” or “concepts” found within them. Often, however, we may find within the clinical setting the creation and curation of unstructured custom vocabularies used in the everyday practice of classifying and categorizing clinical data and findings.

A significant and widely used example of this lies in the General Practice Research Database which makes use of the Oxford Medical Information Systems (OXMIS) coding scheme to represent drugs and medical conditions. This scheme is intrinsically unstructured, is generally regarded as disorganized, and is not amenable to comparison with other hierarchically structured medical coding schemes. To improve processes of data analysis and extraction, we define a semantically meaningful representation of the OXMIS codes by way of the Unified Medical Language System (UMLS) Metathesaurus. A structure-imposing ontology mapping is created, and this process provides a complete illustration of a general semantic mapping technique applicable to unstructured biomedical terminologies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 219.00
Price excludes VAT (USA)
Softcover Book
USD 279.99
Price excludes VAT (USA)
Hardcover Book
USD 379.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    General Practice Research Database (GPRD) is maintained by the (UK) National Health Service Information Authority.

  2. 2.

    UMLS Metathesaurus is a project of the (US) National Library of Medicine, Department of Health and Human Services. Available at: http://www.nlm.nih.org/research/umls/.

  3. 3.

    From now on, we will refer to the collective set of codes found in the GPRD as Read-OXMIS. The designation refers to the combination (OXMIS and Read version 2) of coding schemes found in this particular database’s medical records.

  4. 4.

    SNOMED CT is copyrighted by the International Health Terminology Standards Organization (IHTSDO). ICD-9 refers to ICD-9, CM the International Classification of Diseases, 9th Revision, Clinical Modification. ICD-10 is copyrighted by the World Health Organization and developed by the National Center for Health Statistics. Current Procedural Terminology (CPT) is copyright the American Medical Association. The Clinical Terms Version 3 (Read Codes) are maintained by the (UK) National Health Service Information Authority.

References

  1. Perry J ed: OXMIS Problem Codes for Primary Medical Care. Oxford, Headington, 1978.

    Google Scholar 

  2. Jones R, Latinovic R, Charlton J, Gulliford M. Physical and Psychological Co-morbidity in Irritable Bowel Syndrome: A Matched Cohort Study Using the General Practice Research Database. Alimentary Pharmacology & Therapeutics 24 (5), 2006, pp. 879–886.

    Article  CAS  Google Scholar 

  3. Rahm E, Bernstein PA. A Survey of Approaches to Automatic Schema Matching. The VDLB Journal 10, 2001, pp. 334–350.

    Google Scholar 

  4. Willard Dallas. Why Semantic Ascent Fails. Metaphilosophy 14 (3 & 4), July/October 1983, pp. 276–290.

    Google Scholar 

  5. Soriano, Maier, Visick, Pride. Validation of General Practitioner-Diagnosed COPD in the UK General Practiec Research Database. European Journal of Epidemiology 17 (12), 2001, pp. 1075–1080.

    Google Scholar 

  6. Bodenreider O. Using UMLS semantics for classification purposes. Proceedings of AMIA Symposium, 2000, pp. 86–90.

    Google Scholar 

  7. Mork P, Bernstein PA. Adapting a Generic Match Algorithm to Align Ontologies of Human Anatomy. In: 20th International Conference on Data Engineering; 2004 March 30–April 2; Boston, MA: IEEE, 2004.

    Google Scholar 

  8. Jensen LS, Martinez T. Improving text classification by using conceptual and contextual features. KDD-2000 Workshop on Text Mining, Boston, 2000, pp. 101–102.

    Google Scholar 

  9. El-Nasan A, Veeramachaneni S, Nagy G. Word Discrimination Based on Bigram Co-Occurrences. ICDAR, p. 0149, Sixth International Conference on Document Analysis and Recognition (ICDAR’01), 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jeffery L. Painter .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this paper

Cite this paper

Painter, J.L. (2010). Toward Automating an Inference Model on Unstructured Terminologies: OXMIS Case Study. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_71

Download citation

Publish with us

Policies and ethics