Visual Composition of Complex Queries on an Integrative Genomic and Proteomic Data Warehouse

Abstract

Biomedical questions are usually complex and regard several different life science aspects. Numerous valuable and he- terogeneous data are increasingly available to answer such questions. Yet, they are dispersedly stored and difficult to be queried comprehensively. We created a Genomic and Proteomic Data Warehouse (GPDW) that integrates data provided by some of the main bioinformatics databases. It adopts a modular integrated data schema and several metadata to describe the integrated data, their sources and their location in the GPDW. Here, we present the Web application that we developed to enable any user to easily compose queries, although complex, on all data integrated in the GPDW. It is publicly available at http://www.bioinformatics.dei.polimi.it/GPKB/. Through a visual interface, the user is only required to select the types of data to be included in the query and the conditions on their values to be retrieved. Then, the Web application leverages the metadata and modular schema of the GPDW to automatically compose an efficient SQL query, run it on the GPDW and show the extracted requested data, enriched with links to external data sources. Performed tests demonstrated efficiency and usability of the developed Web application, and showed its and GPDW relevance in supporting answering biomedical questions, also difficult.

Share and Cite:

Pessina, F. , Masseroli, M. and Canakoglu, A. (2013) Visual Composition of Complex Queries on an Integrative Genomic and Proteomic Data Warehouse. Engineering, 5, 94-98. doi: 10.4236/eng.2013.510B019.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] M. Y. Galperin and X. M. Fernández-Suárez, “The 2012 Nucleic Acids Research Database Issue and the Online Molecular Biology Database Collection,” Nucleic Acids Research, Vol. 40, Database Issue, 2012, pp. D1-D8. http://dx.doi.org/10.1093/nar/gkr1196
[2] N. W. Paton, S. A. Khan, A. Hayes, F. Moussouni, A. Brass, K. Eilbeck, C. A. Goble, S. J. Hubbard and S. G. Oliver, “Conceptual Modeling of Genomic Information,” Bioinformatics, Vol. 16, No. 6, 2000, pp. 548-557. http://dx.doi.org/10.1093/bioinformatics/16.6.548
[3] E. Bornberg-Bauer and N. W. Paton, “Conceptual Data Modelling for Bioinformatics,” Briefings in Bioinformatics, Vol. 3, No. 2, 2002, pp. 166-180. http://dx.doi.org/10.1093/bib/3.2.166
[4] M. Masseroli, D. Martucci and F. Pinciroli, “GFINDer: Genome Function INtegrated Discoverer through Dynamic Annotation, Statistical Analysis, and Mining,” Nucleic Acids Research, Vol. 32, 2004, pp. W293-W300. http://dx.doi.org/10.1093/nar/gkh432
[5] M. Masseroli, O. Galati and F. Pinciroli, “GFINDer: Genetic Disease and Phenotype Location Statistical Analysis and Mining of Dynamically Annotated Gene Lists,” Nucleic Acids Research, Vol. 33, 2005, pp. W717-W723. http://dx.doi.org/10.1093/nar/gki454
[6] A. Canakoglu, G. Ghisalberti and M. Masseroli “Integration of Biomolecular Interaction Data in a Genomic and Proteomic Data Warehouse to Support Biomedical Knowledge Discovery,” In: E. Biganzoli, A. Vellido, F. Ambrogi and R. Tagliaferri, Eds., Computational Intelligence Methods for Bioinformatics and Biostatistics, Springer, Heidelberg, 2012, pp. 112-126. http://dx.doi.org/10.1007/978-3-642-35686-5_10
[7] J. Nielsen, “Usability Engineering,” Morgan Kaufmann, San Francisco, 1993.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.