TY - JOUR
T1 - NeuroExtract
T2 - Facilitating Neuroscience-oriented Retrieval from Broadly-focused Bioscience Databases Using Text-based Query Mediation
AU - Crasto, Chiquito J.
AU - Masiar, Peter
AU - Miller, Perry L.
N1 - Funding Information:
This research was supported in part by NIH Grant P01 DC04732, by NIH contract N01 DA-BAA-5-7753, and by NIH Grants T15 LM0705 and P20 LM07253 from the National Library of Medicine. The authors would like to thank Professor Gordon M. Shepherd for his comments on the manuscript and the work described therein.
PY - 2007/5
Y1 - 2007/5
N2 - This paper describes NeuroExtract, a pilot system which facilitates the integrated retrieval of Internet-based information relevant to the neurosciences. The approach involved extracting descriptive metadata from the sources using domain-specific queries; retrieving, processing, and organizing the data into structured text files; searching the data files using text-based queries; and, providing the results in a Web page along with descriptions to entries and URL links to the original sources. NeuroExtract has been implemented for three bioscience resources, SWISSPROT, GEO, and PDB, which provide neuroscience-related information as sub-topics. We discuss several issues that arose in the course of NeuroExtract's implementation. This project is a first step in exploring how this general approach might be used, in conjunction with other query mediation approaches, to facilitate the integration of many Internet-accessible resources relevant to the neurosciences.
AB - This paper describes NeuroExtract, a pilot system which facilitates the integrated retrieval of Internet-based information relevant to the neurosciences. The approach involved extracting descriptive metadata from the sources using domain-specific queries; retrieving, processing, and organizing the data into structured text files; searching the data files using text-based queries; and, providing the results in a Web page along with descriptions to entries and URL links to the original sources. NeuroExtract has been implemented for three bioscience resources, SWISSPROT, GEO, and PDB, which provide neuroscience-related information as sub-topics. We discuss several issues that arose in the course of NeuroExtract's implementation. This project is a first step in exploring how this general approach might be used, in conjunction with other query mediation approaches, to facilitate the integration of many Internet-accessible resources relevant to the neurosciences.
UR - http://www.scopus.com/inward/record.url?scp=34247391553&partnerID=8YFLogxK
U2 - 10.1197/jamia.M2321
DO - 10.1197/jamia.M2321
M3 - Article
C2 - 17329721
AN - SCOPUS:34247391553
SN - 1067-5027
VL - 14
SP - 355
EP - 360
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 3
ER -