Auditing complex concepts of SNOMED using a refined hierarchical abstraction network

Yue Wang, Michael Halper, Duo Wei, Huanying Gu, Yehoshua Perl, Junchuan Xu, Gai Elhanan, Yan Chen, Kent A. Spackman, James Case, George Hripcsak

Research output: Contribution to journalArticle

37 Citations (Scopus)

Abstract

Auditors of a large terminology, such as SNOMED CT, face a daunting challenge. To aid them in their efforts, it is essential to devise techniques that can automatically identify concepts warranting special attention. " Complex" concepts, which by their very nature are more difficult to model, fall neatly into this category. A special kind of grouping, called a partial-area, is utilized in the characterization of complex concepts. In particular, the complex concepts that are the focus of this work are those appearing in intersections of multiple partial-areas and are thus referred to as overlapping concepts. In a companion paper, an automatic methodology for identifying and partitioning the entire collection of overlapping concepts into disjoint, singly-rooted groups, that are more manageable to work with and comprehend, has been presented. The partitioning methodology formed the foundation for the development of an abstraction network for the overlapping concepts called a disjoint partial-area taxonomy. This new disjoint partial-area taxonomy offers a collection of semantically uniform partial-areas and is exploited herein as the basis for a novel auditing methodology. The review of the overlapping concepts is done in a top-down order within semantically uniform groups. These groups are themselves reviewed in a top-down order, which proceeds from the less complex to the more complex overlapping concepts. The results of applying the methodology to SNOMED's Specimen hierarchy are presented. Hypotheses regarding error ratios for overlapping concepts and between different kinds of overlapping concepts are formulated. Two phases of auditing the Specimen hierarchy for two releases of SNOMED are reported on. With the use of the double bootstrap and Fisher's exact test (two-tailed), the auditing of concepts and especially roots of overlapping partial-areas is shown to yield a statistically significant higher proportion of errors.

Original languageEnglish (US)
Pages (from-to)1-14
Number of pages14
JournalJournal of Biomedical Informatics
Volume45
Issue number1
DOIs
StatePublished - Feb 1 2012
Externally publishedYes

Fingerprint

Systematized Nomenclature of Medicine
Taxonomies
Terminology

Keywords

  • Abstraction network
  • Auditing
  • Complex concept
  • Group auditing
  • Neighborhood auditing
  • Partitioning
  • Quality assurance
  • SNOMED
  • Taxonomy
  • Terminology

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

Auditing complex concepts of SNOMED using a refined hierarchical abstraction network. / Wang, Yue; Halper, Michael; Wei, Duo; Gu, Huanying; Perl, Yehoshua; Xu, Junchuan; Elhanan, Gai; Chen, Yan; Spackman, Kent A.; Case, James; Hripcsak, George.

In: Journal of Biomedical Informatics, Vol. 45, No. 1, 01.02.2012, p. 1-14.

Research output: Contribution to journalArticle

Wang, Y, Halper, M, Wei, D, Gu, H, Perl, Y, Xu, J, Elhanan, G, Chen, Y, Spackman, KA, Case, J & Hripcsak, G 2012, 'Auditing complex concepts of SNOMED using a refined hierarchical abstraction network', Journal of Biomedical Informatics, vol. 45, no. 1, pp. 1-14. https://doi.org/10.1016/j.jbi.2011.08.016
Wang, Yue ; Halper, Michael ; Wei, Duo ; Gu, Huanying ; Perl, Yehoshua ; Xu, Junchuan ; Elhanan, Gai ; Chen, Yan ; Spackman, Kent A. ; Case, James ; Hripcsak, George. / Auditing complex concepts of SNOMED using a refined hierarchical abstraction network. In: Journal of Biomedical Informatics. 2012 ; Vol. 45, No. 1. pp. 1-14.
@article{29d3fe4e8c9344a18fc5528cac16cf38,
title = "Auditing complex concepts of SNOMED using a refined hierarchical abstraction network",
abstract = "Auditors of a large terminology, such as SNOMED CT, face a daunting challenge. To aid them in their efforts, it is essential to devise techniques that can automatically identify concepts warranting special attention. {"} Complex{"} concepts, which by their very nature are more difficult to model, fall neatly into this category. A special kind of grouping, called a partial-area, is utilized in the characterization of complex concepts. In particular, the complex concepts that are the focus of this work are those appearing in intersections of multiple partial-areas and are thus referred to as overlapping concepts. In a companion paper, an automatic methodology for identifying and partitioning the entire collection of overlapping concepts into disjoint, singly-rooted groups, that are more manageable to work with and comprehend, has been presented. The partitioning methodology formed the foundation for the development of an abstraction network for the overlapping concepts called a disjoint partial-area taxonomy. This new disjoint partial-area taxonomy offers a collection of semantically uniform partial-areas and is exploited herein as the basis for a novel auditing methodology. The review of the overlapping concepts is done in a top-down order within semantically uniform groups. These groups are themselves reviewed in a top-down order, which proceeds from the less complex to the more complex overlapping concepts. The results of applying the methodology to SNOMED's Specimen hierarchy are presented. Hypotheses regarding error ratios for overlapping concepts and between different kinds of overlapping concepts are formulated. Two phases of auditing the Specimen hierarchy for two releases of SNOMED are reported on. With the use of the double bootstrap and Fisher's exact test (two-tailed), the auditing of concepts and especially roots of overlapping partial-areas is shown to yield a statistically significant higher proportion of errors.",
keywords = "Abstraction network, Auditing, Complex concept, Group auditing, Neighborhood auditing, Partitioning, Quality assurance, SNOMED, Taxonomy, Terminology",
author = "Yue Wang and Michael Halper and Duo Wei and Huanying Gu and Yehoshua Perl and Junchuan Xu and Gai Elhanan and Yan Chen and Spackman, {Kent A.} and James Case and George Hripcsak",
year = "2012",
month = "2",
day = "1",
doi = "10.1016/j.jbi.2011.08.016",
language = "English (US)",
volume = "45",
pages = "1--14",
journal = "Journal of Biomedical Informatics",
issn = "1532-0464",
publisher = "Academic Press Inc.",
number = "1",

}

TY - JOUR

T1 - Auditing complex concepts of SNOMED using a refined hierarchical abstraction network

AU - Wang, Yue

AU - Halper, Michael

AU - Wei, Duo

AU - Gu, Huanying

AU - Perl, Yehoshua

AU - Xu, Junchuan

AU - Elhanan, Gai

AU - Chen, Yan

AU - Spackman, Kent A.

AU - Case, James

AU - Hripcsak, George

PY - 2012/2/1

Y1 - 2012/2/1

N2 - Auditors of a large terminology, such as SNOMED CT, face a daunting challenge. To aid them in their efforts, it is essential to devise techniques that can automatically identify concepts warranting special attention. " Complex" concepts, which by their very nature are more difficult to model, fall neatly into this category. A special kind of grouping, called a partial-area, is utilized in the characterization of complex concepts. In particular, the complex concepts that are the focus of this work are those appearing in intersections of multiple partial-areas and are thus referred to as overlapping concepts. In a companion paper, an automatic methodology for identifying and partitioning the entire collection of overlapping concepts into disjoint, singly-rooted groups, that are more manageable to work with and comprehend, has been presented. The partitioning methodology formed the foundation for the development of an abstraction network for the overlapping concepts called a disjoint partial-area taxonomy. This new disjoint partial-area taxonomy offers a collection of semantically uniform partial-areas and is exploited herein as the basis for a novel auditing methodology. The review of the overlapping concepts is done in a top-down order within semantically uniform groups. These groups are themselves reviewed in a top-down order, which proceeds from the less complex to the more complex overlapping concepts. The results of applying the methodology to SNOMED's Specimen hierarchy are presented. Hypotheses regarding error ratios for overlapping concepts and between different kinds of overlapping concepts are formulated. Two phases of auditing the Specimen hierarchy for two releases of SNOMED are reported on. With the use of the double bootstrap and Fisher's exact test (two-tailed), the auditing of concepts and especially roots of overlapping partial-areas is shown to yield a statistically significant higher proportion of errors.

AB - Auditors of a large terminology, such as SNOMED CT, face a daunting challenge. To aid them in their efforts, it is essential to devise techniques that can automatically identify concepts warranting special attention. " Complex" concepts, which by their very nature are more difficult to model, fall neatly into this category. A special kind of grouping, called a partial-area, is utilized in the characterization of complex concepts. In particular, the complex concepts that are the focus of this work are those appearing in intersections of multiple partial-areas and are thus referred to as overlapping concepts. In a companion paper, an automatic methodology for identifying and partitioning the entire collection of overlapping concepts into disjoint, singly-rooted groups, that are more manageable to work with and comprehend, has been presented. The partitioning methodology formed the foundation for the development of an abstraction network for the overlapping concepts called a disjoint partial-area taxonomy. This new disjoint partial-area taxonomy offers a collection of semantically uniform partial-areas and is exploited herein as the basis for a novel auditing methodology. The review of the overlapping concepts is done in a top-down order within semantically uniform groups. These groups are themselves reviewed in a top-down order, which proceeds from the less complex to the more complex overlapping concepts. The results of applying the methodology to SNOMED's Specimen hierarchy are presented. Hypotheses regarding error ratios for overlapping concepts and between different kinds of overlapping concepts are formulated. Two phases of auditing the Specimen hierarchy for two releases of SNOMED are reported on. With the use of the double bootstrap and Fisher's exact test (two-tailed), the auditing of concepts and especially roots of overlapping partial-areas is shown to yield a statistically significant higher proportion of errors.

KW - Abstraction network

KW - Auditing

KW - Complex concept

KW - Group auditing

KW - Neighborhood auditing

KW - Partitioning

KW - Quality assurance

KW - SNOMED

KW - Taxonomy

KW - Terminology

UR - http://www.scopus.com/inward/record.url?scp=84856375635&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84856375635&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2011.08.016

DO - 10.1016/j.jbi.2011.08.016

M3 - Article

C2 - 21907827

AN - SCOPUS:84856375635

VL - 45

SP - 1

EP - 14

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

SN - 1532-0464

IS - 1

ER -