A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships

Christopher Ochs, James Geller, Yehoshua Perl, Yan Chen, Ankur Agrawal, James Case, George Hripcsak

Research output: Contribution to journalArticle

17 Citations (Scopus)

Abstract

Objective Large and complex terminologies, such as Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT), are prone to errors and inconsistencies. Abstraction networks are compact summarizations of the content and structure of a terminology. Abstraction networks have been shown to support terminology quality assurance. In this paper, we introduce an abstraction network derivation methodology which can be applied to SNOMED CT target hierarchies whose classes are defined using only hierarchical relationships (ie, without attribute relationships) and similar description-logic-based terminologies. Methods We introduce the tribal abstraction network (TAN), based on the notion of a tribe-a subhierarchy rooted at a child of a hierarchy root, assuming only the existence of concepts with multiple parents. The TAN summarizes a hierarchy that does not have attribute relationships using sets of concepts, called tribal units that belong to exactly the same multiple tribes. Tribal units are further divided into refined tribal units which contain closely related concepts. A quality assurance methodology that utilizes TAN summarizations is introduced. Results A TAN is derived for the Observable entity hierarchy of SNOMED CT, summarizing its content. A TAN-based quality assurance review of the concepts of the hierarchy is performed, and erroneous concepts are shown to appear more frequently in large refined tribal units than in small refined tribal units. Furthermore, more erroneous concepts appear in large refined tribal units of more tribes than of fewer tribes. Conclusions In this paper we introduce the TAN for summarizing SNOMED CT target hierarchies. A TAN was derived for the Observable entity hierarchy of SNOMED CT. A quality assurance methodology utilizing the TAN was introduced and demonstrated.

Original languageEnglish (US)
Pages (from-to)628-639
Number of pages12
JournalJournal of the American Medical Informatics Association
Volume22
Issue number3
DOIs
StatePublished - Jan 1 2015
Externally publishedYes

Fingerprint

Systematized Nomenclature of Medicine
Population Groups
Terminology
Parents

Keywords

  • Abstraction network
  • Hierarchical abstraction network
  • SNOMED CT
  • Terminology quality assurance
  • Terminology summarization
  • Terminology without lateral relationships

ASJC Scopus subject areas

  • Health Informatics

Cite this

A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships. / Ochs, Christopher; Geller, James; Perl, Yehoshua; Chen, Yan; Agrawal, Ankur; Case, James; Hripcsak, George.

In: Journal of the American Medical Informatics Association, Vol. 22, No. 3, 01.01.2015, p. 628-639.

Research output: Contribution to journalArticle

Ochs, Christopher ; Geller, James ; Perl, Yehoshua ; Chen, Yan ; Agrawal, Ankur ; Case, James ; Hripcsak, George. / A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships. In: Journal of the American Medical Informatics Association. 2015 ; Vol. 22, No. 3. pp. 628-639.
@article{e033494f048d41148113450c8604d729,
title = "A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships",
abstract = "Objective Large and complex terminologies, such as Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT), are prone to errors and inconsistencies. Abstraction networks are compact summarizations of the content and structure of a terminology. Abstraction networks have been shown to support terminology quality assurance. In this paper, we introduce an abstraction network derivation methodology which can be applied to SNOMED CT target hierarchies whose classes are defined using only hierarchical relationships (ie, without attribute relationships) and similar description-logic-based terminologies. Methods We introduce the tribal abstraction network (TAN), based on the notion of a tribe-a subhierarchy rooted at a child of a hierarchy root, assuming only the existence of concepts with multiple parents. The TAN summarizes a hierarchy that does not have attribute relationships using sets of concepts, called tribal units that belong to exactly the same multiple tribes. Tribal units are further divided into refined tribal units which contain closely related concepts. A quality assurance methodology that utilizes TAN summarizations is introduced. Results A TAN is derived for the Observable entity hierarchy of SNOMED CT, summarizing its content. A TAN-based quality assurance review of the concepts of the hierarchy is performed, and erroneous concepts are shown to appear more frequently in large refined tribal units than in small refined tribal units. Furthermore, more erroneous concepts appear in large refined tribal units of more tribes than of fewer tribes. Conclusions In this paper we introduce the TAN for summarizing SNOMED CT target hierarchies. A TAN was derived for the Observable entity hierarchy of SNOMED CT. A quality assurance methodology utilizing the TAN was introduced and demonstrated.",
keywords = "Abstraction network, Hierarchical abstraction network, SNOMED CT, Terminology quality assurance, Terminology summarization, Terminology without lateral relationships",
author = "Christopher Ochs and James Geller and Yehoshua Perl and Yan Chen and Ankur Agrawal and James Case and George Hripcsak",
year = "2015",
month = "1",
day = "1",
doi = "10.1136/amiajnl-2014-003173",
language = "English (US)",
volume = "22",
pages = "628--639",
journal = "Journal of the American Medical Informatics Association : JAMIA",
issn = "1067-5027",
publisher = "Oxford University Press",
number = "3",

}

TY - JOUR

T1 - A tribal abstraction network for SNOMED CT target hierarchies without attribute relationships

AU - Ochs, Christopher

AU - Geller, James

AU - Perl, Yehoshua

AU - Chen, Yan

AU - Agrawal, Ankur

AU - Case, James

AU - Hripcsak, George

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Objective Large and complex terminologies, such as Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT), are prone to errors and inconsistencies. Abstraction networks are compact summarizations of the content and structure of a terminology. Abstraction networks have been shown to support terminology quality assurance. In this paper, we introduce an abstraction network derivation methodology which can be applied to SNOMED CT target hierarchies whose classes are defined using only hierarchical relationships (ie, without attribute relationships) and similar description-logic-based terminologies. Methods We introduce the tribal abstraction network (TAN), based on the notion of a tribe-a subhierarchy rooted at a child of a hierarchy root, assuming only the existence of concepts with multiple parents. The TAN summarizes a hierarchy that does not have attribute relationships using sets of concepts, called tribal units that belong to exactly the same multiple tribes. Tribal units are further divided into refined tribal units which contain closely related concepts. A quality assurance methodology that utilizes TAN summarizations is introduced. Results A TAN is derived for the Observable entity hierarchy of SNOMED CT, summarizing its content. A TAN-based quality assurance review of the concepts of the hierarchy is performed, and erroneous concepts are shown to appear more frequently in large refined tribal units than in small refined tribal units. Furthermore, more erroneous concepts appear in large refined tribal units of more tribes than of fewer tribes. Conclusions In this paper we introduce the TAN for summarizing SNOMED CT target hierarchies. A TAN was derived for the Observable entity hierarchy of SNOMED CT. A quality assurance methodology utilizing the TAN was introduced and demonstrated.

AB - Objective Large and complex terminologies, such as Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT), are prone to errors and inconsistencies. Abstraction networks are compact summarizations of the content and structure of a terminology. Abstraction networks have been shown to support terminology quality assurance. In this paper, we introduce an abstraction network derivation methodology which can be applied to SNOMED CT target hierarchies whose classes are defined using only hierarchical relationships (ie, without attribute relationships) and similar description-logic-based terminologies. Methods We introduce the tribal abstraction network (TAN), based on the notion of a tribe-a subhierarchy rooted at a child of a hierarchy root, assuming only the existence of concepts with multiple parents. The TAN summarizes a hierarchy that does not have attribute relationships using sets of concepts, called tribal units that belong to exactly the same multiple tribes. Tribal units are further divided into refined tribal units which contain closely related concepts. A quality assurance methodology that utilizes TAN summarizations is introduced. Results A TAN is derived for the Observable entity hierarchy of SNOMED CT, summarizing its content. A TAN-based quality assurance review of the concepts of the hierarchy is performed, and erroneous concepts are shown to appear more frequently in large refined tribal units than in small refined tribal units. Furthermore, more erroneous concepts appear in large refined tribal units of more tribes than of fewer tribes. Conclusions In this paper we introduce the TAN for summarizing SNOMED CT target hierarchies. A TAN was derived for the Observable entity hierarchy of SNOMED CT. A quality assurance methodology utilizing the TAN was introduced and demonstrated.

KW - Abstraction network

KW - Hierarchical abstraction network

KW - SNOMED CT

KW - Terminology quality assurance

KW - Terminology summarization

KW - Terminology without lateral relationships

UR - http://www.scopus.com/inward/record.url?scp=84946745752&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84946745752&partnerID=8YFLogxK

U2 - 10.1136/amiajnl-2014-003173

DO - 10.1136/amiajnl-2014-003173

M3 - Article

VL - 22

SP - 628

EP - 639

JO - Journal of the American Medical Informatics Association : JAMIA

JF - Journal of the American Medical Informatics Association : JAMIA

SN - 1067-5027

IS - 3

ER -