GAGE: generally applicable gene set enrichment for pathway analysis
- PMID: 19473525
- PMCID: PMC2696452
- DOI: 10.1186/1471-2105-10-161
GAGE: generally applicable gene set enrichment for pathway analysis
Abstract
Background: Gene set analysis (GSA) is a widely used strategy for gene expression data analysis based on pathway knowledge. GSA focuses on sets of related genes and has established major advantages over individual gene analyses, including greater robustness, sensitivity and biological relevance. However, previous GSA methods have limited usage as they cannot handle datasets of different sample sizes or experimental designs.
Results: To address these limitations, we present a new GSA method called Generally Applicable Gene-set Enrichment (GAGE). We successfully apply GAGE to multiple microarray datasets with different sample sizes, experimental designs and profiling techniques. GAGE shows significantly better results when compared to two other commonly used GSA methods of GSEA and PAGE. We demonstrate this improvement in the following three aspects: (1) consistency across repeated studies/experiments; (2) sensitivity and specificity; (3) biological relevance of the regulatory mechanisms inferred.GAGE reveals novel and relevant regulatory mechanisms from both published and previously unpublished microarray studies. From two published lung cancer data sets, GAGE derived a more cohesive and predictive mechanistic scheme underlying lung cancer progress and metastasis. For a previously unpublished BMP6 study, GAGE predicted novel regulatory mechanisms for BMP6 induced osteoblast differentiation, including the canonical BMP-TGF beta signaling, JAK-STAT signaling, Wnt signaling, and estrogen signaling pathways-all of which are supported by the experimental literature.
Conclusion: GAGE is generally applicable to gene expression datasets with different sample sizes and experimental designs. GAGE consistently outperformed two most frequently used GSA methods and inferred statistically and biologically more relevant regulatory pathways. The GAGE method is implemented in R in the "gage" package, available under the GNU GPL from http://sysbio.engin.umich.edu/~luow/downloads.php.
Figures





Similar articles
-
Time series gene expression profiling and temporal regulatory pathway analysis of BMP6 induced osteoblast differentiation and mineralization.BMC Syst Biol. 2011 May 23;5:82. doi: 10.1186/1752-0509-5-82. BMC Syst Biol. 2011. PMID: 21605425 Free PMC article.
-
Improving gene set analysis of microarray data by SAM-GS.BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242. BMC Bioinformatics. 2007. PMID: 17612399 Free PMC article.
-
Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information.BMC Bioinformatics. 2008 Nov 3;9:467. doi: 10.1186/1471-2105-9-467. BMC Bioinformatics. 2008. PMID: 18980677 Free PMC article.
-
Investigating the effect of paralogs on microarray gene-set analysis.BMC Bioinformatics. 2011 Jan 24;12:29. doi: 10.1186/1471-2105-12-29. BMC Bioinformatics. 2011. PMID: 21261946 Free PMC article.
-
Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets.BMC Genomics. 2014;15 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2164-15-S1-S6. Epub 2014 Jan 24. BMC Genomics. 2014. PMID: 24564564 Free PMC article.
Cited by
-
Circulating tumour cells from patients with colorectal cancer have cancer stem cell hallmarks in ex vivo culture.Gut. 2017 Oct;66(10):1802-1810. doi: 10.1136/gutjnl-2016-311447. Epub 2016 Jul 25. Gut. 2017. PMID: 27456153 Free PMC article.
-
Psychoactive pharmaceuticals at environmental concentrations induce in vitro gene expression associated with neurological disorders.BMC Genomics. 2016 Jun 29;17 Suppl 3(Suppl 3):435. doi: 10.1186/s12864-016-2784-1. BMC Genomics. 2016. PMID: 27356971 Free PMC article.
-
DDX5 regulates DNA replication and is required for cell proliferation in a subset of breast cancer cells.Cancer Discov. 2012 Sep;2(9):812-25. doi: 10.1158/2159-8290.CD-12-0116. Epub 2012 Jun 29. Cancer Discov. 2012. PMID: 22750847 Free PMC article.
-
Camera: a competitive gene set test accounting for inter-gene correlation.Nucleic Acids Res. 2012 Sep 1;40(17):e133. doi: 10.1093/nar/gks461. Epub 2012 May 25. Nucleic Acids Res. 2012. PMID: 22638577 Free PMC article.
-
Thermal fluctuations affect the transcriptome through mechanisms independent of average temperature.Sci Rep. 2016 Aug 4;6:30975. doi: 10.1038/srep30975. Sci Rep. 2016. PMID: 27487917 Free PMC article.
References
-
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–50. doi: 10.1073/pnas.0506580102. - DOI - PMC - PubMed
-
- Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstrale M, Laurila E, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34:267–73. doi: 10.1038/ng1180. - DOI - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases