Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Jan 6;53(D1):D925-D931.
doi: 10.1093/nar/gkae977.

The evolution of dbSNP: 25 years of impact in genomic research

Affiliations
Review

The evolution of dbSNP: 25 years of impact in genomic research

Lon Phan et al. Nucleic Acids Res. .

Abstract

The Single Nucleotide Polymorphism Database (dbSNP), established in 1998 by the National Center for Biotechnology Information (NCBI), has been a critical resource in genomics for cataloging small genetic variations. Originally focused on single nucleotide polymorphisms (SNPs), dbSNP has since expanded to include a variety of genetic variants, playing a key role in genome-wide association studies (GWAS), population genetics, pharmacogenomics, and cancer research. Over 25 years, dbSNP has grown to include more than 4.4 billion submitted SNPs and 1.1 billion unique reference SNPs, providing essential data for identifying disease-related genetic variants and studying human diversity. Integrating large-scale projects like 1000 Genomes, gnomAD, TOPMed, and ALFA has expanded dbSNP's catalog of human genetic variation, increasing its usefulness for research and clinical applications. Keeping up with advancements such as next-generation sequencing and cloud-based infrastructure, dbSNP remains a cornerstone of genetic research supporting continued discoveries in precision medicine and population genomics. DATABASE URL: https://www.ncbi.nlm.nih.gov/snp.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Timeline of dbSNP milestones and genomics submissions. This timeline categorizes key events in dbSNP history and related genomics submissions. Notably, the completion of the Human Genome Project in 2003 significantly increased dbSNP submissions. ‘Events’ include the establishment of dbSNP in 1998, reaching 1 million RS in 2000, the 2017 redesign and the 25th anniversary in 2024. ‘Submissions’ feature significant contributions like HapMap (2004), 1000 Genomes (2009), ExAC (2015), GnomAD (2017) and TopMed (2020) (13–17). The figure illustrates the chronological order and importance of these milestones.
Figure 2.
Figure 2.
Growth of dbSNP Submitted SNP (ss) and Reference SNP (rs) accessions over time. The growth of dbSNP records over time, highlighting the increase in both Submitted SNP (ss) and Reference SNP (rs) accessions. As observed, there was steady growth in the number of Submitted SNP (ss) accessions starting around 2005, with a significant acceleration in growth following the completion of the human genome in 2003 (Figure 1) and after 2010, driven by advances in sequencing technologies and major genomic projects. While the number of submitted SNPs (ss) continues to grow, the plateau of rs accessions after 2020 reflects variants based on GRCh38 genome assembly, though additional unknown variants may still be discovered through the Telomere-to-Telomere (T2T) genome and pangenome projects.
Figure 3.
Figure 3.
Gene locus, RNA and coding sequence (CDS) feature locations used for assignment of consequences and SO terms. This figure illustrates the locations of key gene features, including gene loci, RNA transcripts and coding sequences (CDS), used by dbSNP to assign functional consequences and Sequence Ontology (SO) terms to genetic variants. By mapping variants to specific locations within the gene, dbSNP classifies them into categories such as synonymous, missense, frameshift or splice site variants.
Figure 4.
Figure 4.
dbSNP RefSNP page for rs268, illustrating the layout and key sections of the redesigned interface. The page includes allele frequency data, clinical significance, and functional consequences for the selected rs. The ALFA Allele Frequency section provides aggregated allele frequency data from global populations, as shown by the specific sub-populations (e.g. European, African, East Asian, etc.). Further details and explanations of each tab and section are available in the Supplementary File (Supplementary Figure S1).

Similar articles

Cited by

References

    1. Sherry S.T., Ward M., Sirotkin K.. dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res. 1999; 9:677–679. - PubMed
    1. Ramos E.M., Hoffman D., Junkins H.A., Maglott D., Phan L., Sherry S.T., Feolo M., Hindorff L.A.. Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources. European journal of human genetics. Eur. J. Hum. Genet. 2014; 22:144–147. - PMC - PubMed
    1. Hindorff L.A., Sethupathy P., Junkins H.A., Ramos E.M., Mehta J.P., Collins F.S., Manolio T.A.. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl. Acad. Sci. U.S.A. 2009; 106:9362–9367. - PMC - PubMed
    1. Akey J.M., Zhang G., Zhang K., Jin L., Shriver M.D.. Interrogating a high-density SNP map for signatures of natural selection. Genome Res. 2002; 12:1805–1814. - PMC - PubMed
    1. Tishkoff S.A., Reed F.A., Friedlaender F.R., Ehret C., Ranciaro A., Froment A., Hirbo J.B., Awomoyi A.A., Bodo J.M., Doumbo O.et al. .. The genetic structure and history of Africans and African Americans. Science. 2009; 324:1035–1044. - PMC - PubMed

LinkOut - more resources