Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr;50(4):621-629.
doi: 10.1038/s41588-018-0081-4. Epub 2018 Apr 9.

Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types

Affiliations

Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types

Hilary K Finucane et al. Nat Genet. 2018 Apr.

Abstract

We introduce an approach to identify disease-relevant tissues and cell types by analyzing gene expression data together with genome-wide association study (GWAS) summary statistics. Our approach uses stratified linkage disequilibrium (LD) score regression to test whether disease heritability is enriched in regions surrounding genes with the highest specific expression in a given tissue. We applied our approach to gene expression data from several sources together with GWAS summary statistics for 48 diseases and traits (average N = 169,331) and found significant tissue-specific enrichments (false discovery rate (FDR) < 5%) for 34 traits. In our analysis of multiple tissues, we detected a broad range of enrichments that recapitulated known biology. In our brain-specific analysis, significant enrichments included an enrichment of inhibitory over excitatory neurons for bipolar disorder, and excitatory over inhibitory neurons for schizophrenia and body mass index. Our results demonstrate that our polygenic approach is a powerful way to leverage gene expression data for interpreting GWAS signals.

PubMed Disclaimer

Conflict of interest statement

COMPETING FINANCIAL INTERESTS

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1
Overview of the approach. For each tissue in our gene expression data set, we compute t-statistics for differential expression for each gene. We then rank genes by t-statistic, take the top 10% of genes, and add a 100kb window to get a genome annotation. We use stratified LD score regression to test whether this annotation is significantly enriched for per-SNP heritability, conditional on the baseline model and the set of all genes.
Figure 2
Figure 2
Results of the multiple-tissue analysis for selected traits. Results for the remaining traits are displayed in Figure S1. Each point represents a tissue/cell type from either the GTEx data set or the Franke lab data set. Large points pass the FDR10(P)=2.75. GWAS data is described in Table S4, gene expression data is described in the Online Methods and Tables S2-3, and the statistical method is described in the Overview of Methods and the Online Methods. Numerical results are reported in Table S6.
Figure 3
Figure 3
Validation of gene expression results with chromatin data. (A) Examples of validation using chromatin data (bottom) of results from gene expression data (top), for selected traits. Results using chromatin data for all traits are displayed in Figure S5, with numerical results in Table S7. For the chromatin results, each point represents a track of peaks for H3K4me3, H3K4me1, H3K9ac, H3K27ac, H3K36me3, or DHS in a single tissue/cell type. (B) Results using gene expression data (including GTEx), Roadmap, and EN-TEx, for migraine (all subtypes) and migraine without aura. For both subfigures, large points pass the FDR10(P)=2.85 (chromatin) or –log10(P)=2.75 (gene expression). GWAS data is described in Table S4; gene expression data and chromatin data are described in the Online Methods, Tables S2-3, and Table S7; and the statistical method is described in the Overview of Methods and the Online Methods.
Figure 4
Figure 4
Results of the brain analysis for selected traits. Numerical results for all traits are reported in Table S8. (A) Results from within-brain analysis of 13 brain regions in GTEx, classified into four groups, for seven of 12 brain-related traits. Large points passed the FDR10(P)=2.34. (B) Results from the data of Cahoy et al. on three brain cell types for seven of 12 brain-related traits. Large points passed the FDR<5% cutoff, –log10(P)=2.22. (C) Results from PyschENCODE data on two neuronal subtypes for three of five neuron-related traits. Large points passed the Bonferroni significance threshold in this analysis, –log10(P)=2.06. GWAS data is described in Table S4, gene expression data is described in the Online Methods and Table S8, and the statistical method is described in the Overview of Methods and the Online Methods.
Figure 5
Figure 5
Results of the analysis of ImmGen gene expression data (top) and hematopoiesis ATAC-seq data (bottom) for selected traits. Results for the remaining traits are displayed in Figure S9. Large points passed the FDR10(P)=3.03 (Gene expression) or –log10(P)=2.32 (Chromatin). Numerical results are reported in Table S10. GWAS data is described in Table S4, gene expression and chromatin data is described in the Online Methods and Table S10, and the statistical method is described in the Overview of Methods and the Online Methods.

Comment in

  • Linking tissues to disease.
    Cloney R. Cloney R. Nat Rev Genet. 2018 Jun;19(6):328. doi: 10.1038/s41576-018-0009-y. Nat Rev Genet. 2018. PMID: 29686399 No abstract available.

Similar articles

Cited by

References

    1. The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. - PMC - PubMed
    1. Kundaje A, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. - PMC - PubMed
    1. The GTEx Consortium. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science. 2015;348:648–660. - PMC - PubMed
    1. Ernst J, et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature. 2011;473:43–49. - PMC - PubMed
    1. Trynka G, et al. Chromatin marks identify critical cell types for fine mapping complex trait variants. Nat Genet. 2013;45:124–130. - PMC - PubMed

Publication types