TITLE: 
                        
                            A Gene Score Test for Disease Association with Multiple Genes
                                
                                
                                    AUTHORS: 
                                            Changchun Xie 
                                                    
                                                        KEYWORDS: 
                        Gene Score, SNP, GWAS, Permutation, GLM, Multiple Regression, Sum Test 
                                                    
                                                    
                                                        JOURNAL NAME: 
                        Open Journal of Statistics,  
                        Vol.1 No.1, 
                        April
                                                        21,
                        2011
                                                    
                                                    
                                                        ABSTRACT: The traditional method for creating a gene score to predict a given outcome is to use the most statistically significant single nucleotide polymorphisms (SNPs) from all SNPs which were tested. There are several disadvantages of this approach such as excluding SNPs that do not have strong single effects when tested on their own but do have strong joint effects when tested together with other SNPs. The interpretation of results from the traditional gene score may lack biological insight since the functional unit of interest is often the gene, not the single SNP. In this paper we present a new gene scoring method, which overcomes these problems as it generates a gene score for each gene, and the total gene score for all the genes available. First, we calculate a gene score for each gene and second, we test the association between this gene score and the outcome of interest (i.e. trait). Only the gene scores which are significantly associated with the outcome after multiple testing correction for the number of gene tests (not SNPs) are considered in the total gene score calculation. This method controls false positive results caused by multiple tests within genes and between genes separately, and has the advantage of identifying multi-locus genetic effects, compared with the Bonferroni correction, false discovery rate (FDR), and permutation tests for all SNPs. Another main feature of this method is that we select the SNPs, which have different effects within a gene by using adjustment in multiple regressions and then combine the information from the selected SNPs within a gene to create a gene score. A simulation study has been conducted to evaluate finite sample performance of the proposed method.