i-GSEA4GWAS v2.0 Home
    Home     |     Documents     |     Reference

Introduction

Brief Introduction

i-GSEA4GWAS v2 (improved GSEA for GWAS v2) is an important expansion of i-GSEA4GWAS [1]. In comparison to i-GSEA4GWAS, which applies a pathway-based analysis (PBA) approach named i-GSEA (improved gene set enrichment analysis) to analyze genome-wide association study (GWAS) SNP P-values to identify pathways associated with traits. i-GSEA4GWAS v2 is featured by implementing both i-GSEA and follow-up functinal analysis of SNPs in trait-associated pathways. The functional analysis of SNPs is performed based on ENCODE regulatory regions [2] to annotate the non-coding feature of SNPs and by using Ensembl putative functional SNP annotation data [3] to annotate the coding feature of SNPs. Expression quantitative trait loci (eQTLs) related to the SNPs were also annotated. The functional analysis will help to select putative functional SNPs as candidate causal SNPs for further validation research. To our knowledge, this is the first effort that the functional analysis of SNPs is implemented in a PBA tool for GWAS.

Input, processing, and output

With the GWAS SNP association values as input, the program firstly performs i-GSEA to identify pathway correlated to trait. Then it carries out functional analysis for the both the most significant SNPs of the genes involved in the pathways and their linkage disequilibrium (LD) proxies extracted from HapMap populations [4] or 1000 Genome [5]. The functional analysis includes annotation and enrichment analysis. The first type of annotation is to map SNPs to ENCODE regulatory feature peaks [3], including DNase-seq peaks of open chromatin, FAIRE peaks of open chromatin, TFBS SPP-based peaks, TFBS PeakSeq-based peaks and Histone peaks from ENCODE Analysis Hub. The second type of annotation is to annotate SNPs implacting protein function (deleterious or probably/possibly damaging, splice donor variant, stop lost, incomplete terminal codon variant, inframe insertion, transcript ablation, splice acceptor variant, frameshift variant, stop gained, initiator codon variant, splice region variant or inframe deletion) [2]. The third type of annotation is to annotate SNP related eQTL. The enrichment analysis is implemented for each type of functional elements to explore if the significant SNPs in each trait-associated pathway are significantly enriched in these functional elements. Finally, the trait-associated pathways with detailed results of functional analysis are displayed.

Additional notes

1. Since computation for i-GSEA and functional analysis will take some time (e.g. 12 minutes for demo data sets), to facilitate users to access the results, i-GSEA4GWAS v2 provides a web link to results upon data submission. The link will also briefly report the running status of the job. Users can bookmark the link to access results at a later time. Moreover, an e-mail with result notification will be sent automatically if users provide an email address.
2. It is ensured that uploaded data for analysis will only be visible to users who upload the data.
3. As the name of the web server defines, i-GSEA4GWAS v2 is only applicable to whole-genome SNP arrays.

References
1. Zhang, K., Cui, S., Chang, S., Zhang, L. and Wang, J. (2010) i-GSEA4GWAS: a web server for identification of pathways/gene sets associated with traits by applying an improved gene set enrichment analysis to genome-wide association study. Nucleic Acids Res, 38, W90-95.
2. ENCODE Project Consortium. (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57-74.
3. Flicek, P., Ahmed, I., Amode MR, Barrell D, Beal K, Brent S, et al. (2013) Ensembl 2013. Nucleic Acids Res 41(Database issue) D48-55.
4. The International HapMap Consortium, (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467(7311) 52-58.
5. Abecasis, G.R., Auton, A., Brooks, L.D., DePristo, M.A., Durbin, R.M., Handsaker, R.E., Kang, H.M., Marth, G.T. and McVean, G.A. (2012) An integrated map of genetic variation from 1,092 human genomes. Nature, 491, 56-65.




JSP Page

Copyright: Bioinformatics Lab, Institute of Psychology, Chinese Academy of Sciences Feedback
Last update: April 18, 2014