Combining Gene-Phenotype Association Matrix with KEGG Pathways to Mine Gene Modules Using Data Set in GAW17


Currently, genome-wide association studies have been proved to be a powerful approach to identify risk loci. However, the molecular regulatory mechanisms of complex diseases are still not clearly understood. It is therefore important to consider the interplay between genetic factors and biological networks in elucidating the mechanisms of complex disease pathogenesis. In this paper, we first conducted a genome-wide association analysis by using the SNP genotype data and phenotype data provided by Genetic Analysis Workshop 17, in order to filter significant SNPs associated with the diseases. Second, we conducted a bioinformatics analysis of gene-phenotype association matrix to identify gene modules (biclusters). Third, we performed a KEGG enrichment test of genes involved in biclusters to find evidence to support their functional consensus. This method can be used for better understanding complex diseases.

Share and Cite:

Lin, H. , Zheng, Y. and Zhou, P. (2013) Combining Gene-Phenotype Association Matrix with KEGG Pathways to Mine Gene Modules Using Data Set in GAW17. Engineering, 5, 332-337. doi: 10.4236/eng.2013.510B067.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] J. Gauderman, C. Murcray, F. Gilliland and D. Conti, “Testing Association between Disease and Multiple SNPs in a Candidate Gene,” Genetic Epidemiology, Vol. 31, No. 5, 2007, pp. 383-395.
[2] X. Cui, Q. Sha, S. Zhang and H.-S. Chen, “A Combinatorial Approach for Detecting Gene-Gene Interaction Using Multiple Traits of Genetic Analysis Workshop 16 Rheumatoid Arthritis Data,” BMC Proceedings, Vol. 3, Suppl. 7, 2009, p. S43.
[3] I. Iossifov, T. Zheng, M. Baron, T. C. Gilliam and A. Rzhetsky, “Genetic-Linkage Mapping of Complex Hereditary Disorders to a Whole-Genome Molecular-Interaction Network,” Genome Research, Vol. 18, 2008, pp. 1150-1162.
[4] M. Holden, S. Deng, L. Wojnowski and B. Kulle, “GSEA-SNP: Applying Gene Set Enrichment Analysis to SNP Data from Genome-Wide Association Studies,” Bioinformatics, Vol. 24, No. 23, 2008, pp. 2784-2785.
[5] A. Tanay, R. Sharan and R. Shamir, “Discovering Statistically Significant Biclusters in Gene Expression Data,” Bioinformatics, Vol. 18, Suppl. 1, 2002, pp. 136-144.
[6] N. Stenzel, C. P. Fetzer, R. Heumann, S. Kai, “PDZ-Domain-Directed Basolateral Targeting of the Peripheral Membrane Protein FRMPD2 in Epithelial Cells,” Journal of Cell Science, Vol. 122, 2009, pp. 3374-3384.
[7] F. Sanfilippo, W. K. Vaughn, E. K. Spees, J. A. Light and W. M. LeFor, “Benefits of HLA-A and HLA-B Matching on Graft and Patient Outcome after Cadaveric-Donor Renal Transplantation,” The New England Journal of Medicine, Vol. 311, No. 6, 1984, pp. 358-364.
[8] G. Carbunaru, P. Prasad, B. Scoccia, P. Shea, N. Hopwood, F. Ziai, Y. T. Chang, S. Myers, J. Mason and S. Pang, “The Hormonal Phenotype of Nonclassic 3β-Hydroxysteroid Dehydrogenase (HSD3B) Deficiency in Hyperandrogenic Females Is Associated with Insulin-Resistant Polycystic Ovary Syndrome and Is Not a Variant of Inherited HSD3B2 Deficiency,” The Journal of Clinical Endocrinology & Metabolism, Vol. 89, No. 2, 2003, pp. 783-794.
[9] K. Itäaho, M. H. Court, P. Uutela, R. Kostiainen, A. Radominska-Pandya, M. F. Dopamine, “Is a Low-Affinity and High-Specificity Substrate for the Human UDP-Glucuronosyltransferase 1A10,” Drug Metabolism & Disposition, Vol. 37, No. 4, 2009, pp.768-775.
[10] Y. Zhu, A. Hoffman, X. Wu, H. Zhang, Y. Zhang, D. Leaderer and T. Zheng, “Correlating Observed Odds Ratios from Lung Cancer Case-Control Studies to SNP Functional Scores Predicted by Bioinformactic Tool,” Mutation Research, Vol. 639, No. 1-2, 2008, pp. 80-88.

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.