Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Jun;72(6):1492-1504.
doi: 10.1086/375613.

Control of confounding of genetic associations in stratified populations

Affiliations
Comparative Study

Control of confounding of genetic associations in stratified populations

Clive J Hoggart et al. Am J Hum Genet. 2003 Jun.

Abstract

To control for hidden population stratification in genetic-association studies, statistical methods that use marker genotype data to infer population structure have been proposed as a possible alternative to family-based designs. In principle, it is possible to infer population structure from associations between marker loci and from associations of markers with the trait, even when no information about the demographic background of the population is available. In a model in which the total population is formed by admixture between two or more subpopulations, confounding can be estimated and controlled. Current implementations of this approach have limitations, the most serious of which is that they do not allow for uncertainty in estimations of individual admixture proportions or for lack of identifiability of subpopulations in the model. We describe methods that overcome these limitations by a combination of Bayesian and classical approaches, and we demonstrate the methods by using data from three admixed populations--African American, African Caribbean, and Hispanic American--in which there is extreme confounding of trait-genotype associations because the trait under study (skin pigmentation) varies with admixture proportions. In these data sets, as many as one-third of marker loci show crude associations with the trait. Control for confounding by population stratification eliminates these associations, except at loci that are linked to candidate genes for the trait. With only 32 markers informative for ancestry, the efficiency of the analysis is 70%. These methods can deal with both confounding and selection bias in genetic-association studies, making family-based designs unnecessary.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Directed graphical model for dependence-of-trait measurement and genotypes on admixture proportions and locus ancestry. Observed data (double-edged rectangles), stochastic nodes (ellipses), stochastic dependence (continuous arrows), and strata (single-edged rectangles) of individuals and loci, are shown.
Figure  2
Figure 2
Effect of adjustment for population stratification on P values for association with skin pigmentation in African American. Loci for which unadjusted associations are significant at P<.05 are shown as shaded squares.
Figure  3
Figure 3
Effect of adjustment for population stratification on P values for association with skin pigmentation in African Caribbeans. Loci for which unadjusted associations are significant at P<.05 are shown as shaded squares.
Figure  4
Figure 4
Effect of adjustment for population stratification on P values for association with skin reflectance in Hispanic Americans. Loci for which unadjusted associations are significant at P<.05 are shown as shaded squares.

Similar articles

Cited by

References

Electronic-Database Information

    1. dbSNP Home Page, http://www.ncbi.nlm.nih.gov/SNP/
    1. Genetic Epidemiology Group, http://www.lshtm.ac.uk/eu/genetics (for AdmixMap program)

References

    1. Akey JM, Sosnoski D, Parra E, Dios S, Hiester K, Su B, Bonilla C, Jin L, Shriver MD (2001) Melting curve analysis of SNPs (McSNP): a simple gel-free low-cost approach to SNP genotyping and DNA fragment analysis. Biotechniques 30:358–362 - PubMed
    1. Allison DB (1997) Transmission/disequilibrium tests for quantitative traits. Am J Hum Genet 60:676–690 - PMC - PubMed
    1. Anonymous (1999) Freely associating. Nat Genet 22:1–2 - PubMed
    1. Cardon LR, Bell JI (2001) Association study designs for complex diseases. Nat Rev Genet 2:91–99 - PubMed
    1. Cardon LR, Palmer LJ (2003) Population stratification and spurious allelic association. Lancet 361:598–604 - PubMed

Publication types

Substances

LinkOut - more resources