Querying multiple bioinformatics information sources: Can Semantic Web research help?

David Buttler, Matthew A Coleman, Terence Critchlow, Renato Fileto, Wei Han, Calton Pu, Daniel Rocco, Li Xiong

Research output: Contribution to journalArticle

33 Citations (Scopus)

Abstract

Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources are often complex associative queries over multiple Web documents. Most associations are defined by string matching of textual fragments in two documents. Second, most of the queries required by Genomics researchers involve complex data extraction, and sophisticated workflows that implement the complex associative access. Third but not the least, complex Genomics-specific queries are often reused many times by other Genomics researchers, either directly or through some refinements, and are considered as a part of the research results by Genomics researchers. In this short article we present a list of challenging issues in supporting effective querying over bioinformatics data sources and illustrate them through a selection of representative search scenarios provided by biologists. We end the article with a discussion on how the state-of-art research and technological development in Semantic Web, Ontology, Internet Data Management, and Internet Computing Systems can help addressing these issues.

Original languageEnglish (US)
Pages (from-to)59-64
Number of pages6
JournalSIGMOD Record
Volume31
Issue number4
DOIs
StatePublished - Dec 2002

Fingerprint

Bioinformatics
Semantic Web
Ontology
Semantics
Internet
Chemical analysis
Information management
Web services
Computer systems
Genomics

ASJC Scopus subject areas

  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Querying multiple bioinformatics information sources : Can Semantic Web research help? / Buttler, David; Coleman, Matthew A; Critchlow, Terence; Fileto, Renato; Han, Wei; Pu, Calton; Rocco, Daniel; Xiong, Li.

In: SIGMOD Record, Vol. 31, No. 4, 12.2002, p. 59-64.

Research output: Contribution to journalArticle

Buttler, D, Coleman, MA, Critchlow, T, Fileto, R, Han, W, Pu, C, Rocco, D & Xiong, L 2002, 'Querying multiple bioinformatics information sources: Can Semantic Web research help?', SIGMOD Record, vol. 31, no. 4, pp. 59-64. https://doi.org/10.1145/637411.637421
Buttler, David ; Coleman, Matthew A ; Critchlow, Terence ; Fileto, Renato ; Han, Wei ; Pu, Calton ; Rocco, Daniel ; Xiong, Li. / Querying multiple bioinformatics information sources : Can Semantic Web research help?. In: SIGMOD Record. 2002 ; Vol. 31, No. 4. pp. 59-64.
@article{faecd035030047b6a27bbec9cfbc4c50,
title = "Querying multiple bioinformatics information sources: Can Semantic Web research help?",
abstract = "Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources are often complex associative queries over multiple Web documents. Most associations are defined by string matching of textual fragments in two documents. Second, most of the queries required by Genomics researchers involve complex data extraction, and sophisticated workflows that implement the complex associative access. Third but not the least, complex Genomics-specific queries are often reused many times by other Genomics researchers, either directly or through some refinements, and are considered as a part of the research results by Genomics researchers. In this short article we present a list of challenging issues in supporting effective querying over bioinformatics data sources and illustrate them through a selection of representative search scenarios provided by biologists. We end the article with a discussion on how the state-of-art research and technological development in Semantic Web, Ontology, Internet Data Management, and Internet Computing Systems can help addressing these issues.",
author = "David Buttler and Coleman, {Matthew A} and Terence Critchlow and Renato Fileto and Wei Han and Calton Pu and Daniel Rocco and Li Xiong",
year = "2002",
month = "12",
doi = "10.1145/637411.637421",
language = "English (US)",
volume = "31",
pages = "59--64",
journal = "SIGMOD Record",
issn = "0163-5808",
publisher = "Association for Computing Machinery (ACM)",
number = "4",

}

TY - JOUR

T1 - Querying multiple bioinformatics information sources

T2 - Can Semantic Web research help?

AU - Buttler, David

AU - Coleman, Matthew A

AU - Critchlow, Terence

AU - Fileto, Renato

AU - Han, Wei

AU - Pu, Calton

AU - Rocco, Daniel

AU - Xiong, Li

PY - 2002/12

Y1 - 2002/12

N2 - Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources are often complex associative queries over multiple Web documents. Most associations are defined by string matching of textual fragments in two documents. Second, most of the queries required by Genomics researchers involve complex data extraction, and sophisticated workflows that implement the complex associative access. Third but not the least, complex Genomics-specific queries are often reused many times by other Genomics researchers, either directly or through some refinements, and are considered as a part of the research results by Genomics researchers. In this short article we present a list of challenging issues in supporting effective querying over bioinformatics data sources and illustrate them through a selection of representative search scenarios provided by biologists. We end the article with a discussion on how the state-of-art research and technological development in Semantic Web, Ontology, Internet Data Management, and Internet Computing Systems can help addressing these issues.

AB - Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources are often complex associative queries over multiple Web documents. Most associations are defined by string matching of textual fragments in two documents. Second, most of the queries required by Genomics researchers involve complex data extraction, and sophisticated workflows that implement the complex associative access. Third but not the least, complex Genomics-specific queries are often reused many times by other Genomics researchers, either directly or through some refinements, and are considered as a part of the research results by Genomics researchers. In this short article we present a list of challenging issues in supporting effective querying over bioinformatics data sources and illustrate them through a selection of representative search scenarios provided by biologists. We end the article with a discussion on how the state-of-art research and technological development in Semantic Web, Ontology, Internet Data Management, and Internet Computing Systems can help addressing these issues.

UR - http://www.scopus.com/inward/record.url?scp=2542558819&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2542558819&partnerID=8YFLogxK

U2 - 10.1145/637411.637421

DO - 10.1145/637411.637421

M3 - Article

AN - SCOPUS:2542558819

VL - 31

SP - 59

EP - 64

JO - SIGMOD Record

JF - SIGMOD Record

SN - 0163-5808

IS - 4

ER -