Control of confounding of genetic associations in stratified populations
- PMID: 12817591
- PMCID: PMC1180309
- DOI: 10.1086/375613
Control of confounding of genetic associations in stratified populations
Abstract
To control for hidden population stratification in genetic-association studies, statistical methods that use marker genotype data to infer population structure have been proposed as a possible alternative to family-based designs. In principle, it is possible to infer population structure from associations between marker loci and from associations of markers with the trait, even when no information about the demographic background of the population is available. In a model in which the total population is formed by admixture between two or more subpopulations, confounding can be estimated and controlled. Current implementations of this approach have limitations, the most serious of which is that they do not allow for uncertainty in estimations of individual admixture proportions or for lack of identifiability of subpopulations in the model. We describe methods that overcome these limitations by a combination of Bayesian and classical approaches, and we demonstrate the methods by using data from three admixed populations--African American, African Caribbean, and Hispanic American--in which there is extreme confounding of trait-genotype associations because the trait under study (skin pigmentation) varies with admixture proportions. In these data sets, as many as one-third of marker loci show crude associations with the trait. Control for confounding by population stratification eliminates these associations, except at loci that are linked to candidate genes for the trait. With only 32 markers informative for ancestry, the efficiency of the analysis is 70%. These methods can deal with both confounding and selection bias in genetic-association studies, making family-based designs unnecessary.
Figures




Similar articles
-
Validation of a small set of ancestral informative markers for control of population admixture in African Americans.Am J Epidemiol. 2011 Mar 1;173(5):587-92. doi: 10.1093/aje/kwq401. Epub 2011 Jan 24. Am J Epidemiol. 2011. PMID: 21262910 Free PMC article.
-
Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.Genetics. 2003 Aug;164(4):1567-87. doi: 10.1093/genetics/164.4.1567. Genetics. 2003. PMID: 12930761 Free PMC article.
-
Measuring and using admixture to study the genetics of complex diseases.Hum Genomics. 2003 Nov;1(1):52-62. doi: 10.1186/1479-7364-1-1-52. Hum Genomics. 2003. PMID: 15601533 Free PMC article.
-
Mapping of disease-associated variants in admixed populations.Genome Biol. 2011;12(5):223. doi: 10.1186/gb-2011-12-5-223. Epub 2011 May 30. Genome Biol. 2011. PMID: 21635713 Free PMC article. Review.
-
Unbiased methods for population-based association studies.Genet Epidemiol. 2001 Dec;21(4):273-84. doi: 10.1002/gepi.1034. Genet Epidemiol. 2001. PMID: 11754464 Review.
Cited by
-
Evidence of associations between cytokine genes and subjective reports of sleep disturbance in oncology patients and their family caregivers.PLoS One. 2012;7(7):e40560. doi: 10.1371/journal.pone.0040560. Epub 2012 Jul 23. PLoS One. 2012. PMID: 22844404 Free PMC article.
-
Fine-mapping in African-American women confirms the importance of the 10p12 locus to sarcoidosis.Genes Immun. 2012 Oct;13(7):573-8. doi: 10.1038/gene.2012.42. Epub 2012 Sep 13. Genes Immun. 2012. PMID: 22972473 Free PMC article.
-
Phenotypic and molecular characteristics associated with various domains of quality of life in oncology patients and their family caregivers.Qual Life Res. 2016 Nov;25(11):2853-2868. doi: 10.1007/s11136-016-1310-x. Epub 2016 May 9. Qual Life Res. 2016. PMID: 27160108 Free PMC article.
-
An Ancestry Informative Marker Set Which Recapitulates the Known Fine Structure of Populations in South Asia.Genome Biol Evol. 2018 Sep 1;10(9):2408-2416. doi: 10.1093/gbe/evy182. Genome Biol Evol. 2018. PMID: 30184103 Free PMC article.
-
Genome-wide detection of allele specific copy number variation associated with insulin resistance in African Americans from the HyperGEN study.PLoS One. 2011;6(8):e24052. doi: 10.1371/journal.pone.0024052. Epub 2011 Aug 25. PLoS One. 2011. PMID: 21901158 Free PMC article.
References
Electronic-Database Information
-
- dbSNP Home Page, http://www.ncbi.nlm.nih.gov/SNP/
-
- Genetic Epidemiology Group, http://www.lshtm.ac.uk/eu/genetics (for AdmixMap program)
References
-
- Akey JM, Sosnoski D, Parra E, Dios S, Hiester K, Su B, Bonilla C, Jin L, Shriver MD (2001) Melting curve analysis of SNPs (McSNP): a simple gel-free low-cost approach to SNP genotyping and DNA fragment analysis. Biotechniques 30:358–362 - PubMed
-
- Anonymous (1999) Freely associating. Nat Genet 22:1–2 - PubMed
-
- Cardon LR, Bell JI (2001) Association study designs for complex diseases. Nat Rev Genet 2:91–99 - PubMed
-
- Cardon LR, Palmer LJ (2003) Population stratification and spurious allelic association. Lancet 361:598–604 - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources