In silico tests on sequence motif significances for human tissue specific genes


Identification and analysis of tissue-specific (TS) genes and their regulatory activities play an important role in understanding the mechanisms of the organism, disease diagnosis and drug design. Although so far we are not clear about the mechanisms totally, the sequence features of TS genes are becoming an important clue. In this paper we used an integrated pipeline to discover sequences motifs for the promoter regions of TS genes. To test the significances of those motifs in a specific tissue, we used hypotheses test approaches including Bayesian hypothesis, Binomial distribution and traditional z-test. We finally got 2784, 1204 and 703 motifs respectively out of 3244 motifs obtained in discovery phase using above three tests from 3954 TS genes across 83 human tissues. 52.7% of those motifs can be found in public databases available.

Share and Cite:

Gong, X. and Xu, H. (2013) In silico tests on sequence motif significances for human tissue specific genes. Journal of Biomedical Science and Engineering, 6, 572-578. doi: 10.4236/jbise.2013.65072.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] Dezso, Z., et al. (2008) A comprehensive functional analysis of tissue specificity of human gene expression. BMC Biology, 6, 49.
[2] Kuzmin, D., et al. (2010) Novel strong tissue specific promoter for gene expression in human germ cells. BMC Biotechnology, 10, 58. doi:10.1186/1472-6750-10-58
[3] Grosso, A., Gomes, A. and Barbosa, N. (2008) Tissuespecific splicing factor gene expression signatures. Nucleic Acids, 36, 4823-4832. doi:10.1093/nar/gkn463
[4] Hebert, C. (2010) Nucleosome rotational setting is associated with transcriptional regulation in promoters of tissue-specific human genes. Genome Biology, 11, R51. doi:10.1186/gb-2010-11-5-r51
[5] Lawson, M.J. and Zhang, L. (2008) Housekeeping and tissue-specific genes differ in simple sequence repeats in the 5'-UTR region. Gene, 407, 54-62. doi:10.1016/j.gene.2007.09.017
[6] Song, F., et al. (2009) Tissue specific differentially methylated regions (TDMR): Changes in DNA methylation during development. Genomics, 93, 130-139. doi:10.1016/j.ygeno.2008.09.003
[7] Tompa, M., et al. (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nature Biotechnology, 23, 137-144. doi:10.1038/nbt1053
[8] Bailey, T.L., et al. (2009) MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Research, 37, W202-W208.
[9] Neuwald, F., Liu, J.S. and Lawrence, C.E. (1995) Gibbs motif sampling detection of bacterial outer membrane protein repeats. Protein Science: A Publication of the Protein Society, 4, 1618-1632.
[10] Clements, M. (2007) Creating motifs with LocoMotif. Scanning.
[11] Liu, X., Yu, X., Zack, D.J., Zhu, H. and Qian, J. (2008) TiGER: A database for tissue-specific gene expression and regulation. BMC Bioinformatics, 9, 271. doi:10.1186/1471-2105-9-271
[12] Xiao, S.J., Zhang, C. and Zou, Q. (2010) TiSGeD: A database for tissue-specific genes. Bioinformatics, 26, 12731275. doi:10.1093/bioinformatics/btq109
[13] Suzuki, Y., Yamashita, R., Nakai, K. and Sugano, S. (2002) DBTSS: DataBase of human transcriptional start sites and full-length cDNAs. Nucleic Acids Research, 30, 328-331. doi:10.1093/nar/30.1.328
[14] Périer, R.C., Praz, V., Junier, T., Bonnard, C. and Bucher, P. (2000) The eukaryotic promoter database (EPD). Nucleic Acids Research, 28, 302-303.
[15] Zare-Mirakabad, F., Ahrabian, H., Sadeghi, M., Hashemifar, S., Nowzari-Dalini, A. and Goliaei, B. (2009) Genetic algorithm for dyad pattern finding in DNA sequences. Genes & Genetic Systems, 84, 81-93. doi:10.1266/ggs.84.81
[16] Habib, N., Kaplan, T., Margalit, H. and Friedman, N. (2008) A novel bayesian DNA motif comparison method for clustering and retrieval. PLoS Computational Biology, 4. doi:10.1371/journal.pcbi.1000010
[17] Sandelin, A., Alkema, W., Engström, P., Wasserman, W.W. and Lenhard, B. (2004) JASPAR: An open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Research, 32, D91-D94.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.