Content-based microarray search using differential expression profiles
- PMID: 21172034
- PMCID: PMC3022631
- DOI: 10.1186/1471-2105-11-603
Content-based microarray search using differential expression profiles
Abstract
Background: With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations.
Results: We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and p-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3.
Conclusions: Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.
Figures







Similar articles
-
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.BMC Genomics. 2009 Sep 3;10:411. doi: 10.1186/1471-2164-10-411. BMC Genomics. 2009. PMID: 19728865 Free PMC article.
-
Probabilistic retrieval and visualization of biologically relevant microarray experiments.Bioinformatics. 2009 Jun 15;25(12):i145-53. doi: 10.1093/bioinformatics/btp215. Bioinformatics. 2009. PMID: 19477980 Free PMC article.
-
Microarray retriever: a web-based tool for searching and large scale retrieval of public microarray data.Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W327-31. doi: 10.1093/nar/gkn213. Epub 2008 May 7. Nucleic Acids Res. 2008. PMID: 18463138 Free PMC article.
-
Gene expression profiling in drug discovery and development.Curr Opin Mol Ther. 1999 Dec;1(6):671-9. Curr Opin Mol Ther. 1999. PMID: 19629863 Review.
-
Gene expression omnibus: microarray data storage, submission, retrieval, and analysis.Methods Enzymol. 2006;411:352-69. doi: 10.1016/S0076-6879(06)11019-8. Methods Enzymol. 2006. PMID: 16939800 Free PMC article. Review.
Cited by
-
Data-driven information retrieval in heterogeneous collections of transcriptomics data links SIM2s to malignant pleural mesothelioma.Bioinformatics. 2012 Jan 15;28(2):246-53. doi: 10.1093/bioinformatics/btr634. Epub 2011 Nov 20. Bioinformatics. 2012. PMID: 22106335 Free PMC article.
-
Leveraging 3D chemical similarity, target and phenotypic data in the identification of drug-protein and drug-adverse effect associations.J Cheminform. 2016 Jul 1;8:35. doi: 10.1186/s13321-016-0147-1. eCollection 2016. J Cheminform. 2016. PMID: 27375776 Free PMC article.
-
Toward computational cumulative biology by combining models of biological datasets.PLoS One. 2014 Nov 26;9(11):e113053. doi: 10.1371/journal.pone.0113053. eCollection 2014. PLoS One. 2014. PMID: 25427176 Free PMC article.
-
Identifying genes relevant to specific biological conditions in time course microarray experiments.PLoS One. 2013 Oct 11;8(10):e76561. doi: 10.1371/journal.pone.0076561. eCollection 2013. PLoS One. 2013. PMID: 24146889 Free PMC article.
-
REDD1 functions at the crossroads between the therapeutic and adverse effects of topical glucocorticoids.EMBO Mol Med. 2015 Jan;7(1):42-58. doi: 10.15252/emmm.201404601. EMBO Mol Med. 2015. PMID: 25504525 Free PMC article.
References
-
- Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999;286(5439):531–7. doi: 10.1126/science.286.5439.531. - DOI - PubMed
-
- Hughes TR, Marton MJ, Jones AR, Roberts CJ, Stoughton R, Armour CD, Bennett HA, Coffey E, Dai H, He YD, Kidd MJ, King AM, Meyer MR, Slade D, Lum PY, Stepaniants SB, Shoemaker DD, Gachotte D, Chakraburtty K, Simon J, Bard M, Friend SH. Functional discovery via a compendium of expression profiles. Cell. 2000;102:109–26. doi: 10.1016/S0092-8674(00)00015-5. - DOI - PubMed
-
- Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, Reich M, Hieronymus H, Wei G, Armstrong SA, Haggarty SJ, Clemons PA, Wei R, Carr SA, Lander ES, Golub TR. The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313(5795):1929–35. doi: 10.1126/science.1132939. - DOI - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials