ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions
- PMID: 31134279
- PMCID: PMC6853681
- DOI: 10.1093/bioinformatics/btz431
ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions
Abstract
Summary: We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci.
Availability and implementation: ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.
Figures

Similar articles
-
A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples.PLoS One. 2024 Apr 1;19(4):e0300545. doi: 10.1371/journal.pone.0300545. eCollection 2024. PLoS One. 2024. PMID: 38558075 Free PMC article.
-
Analysis of Tandem Repeats in Short-Read Sequencing Data: From Genotyping Known Pathogenic Repeats to Discovering Novel Expansions.Curr Protoc. 2024 Nov;4(11):e70010. doi: 10.1002/cpz1.70010. Curr Protoc. 2024. PMID: 39499075 Free PMC article.
-
Dante: genotyping of known complex and expanded short tandem repeats.Bioinformatics. 2019 Apr 15;35(8):1310-1317. doi: 10.1093/bioinformatics/bty791. Bioinformatics. 2019. PMID: 30203023
-
Primer3_masker: integrating masking of template sequence with primer design software.Bioinformatics. 2018 Jun 1;34(11):1937-1938. doi: 10.1093/bioinformatics/bty036. Bioinformatics. 2018. PMID: 29360956
-
Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.Bioinformatics. 2020 May 1;36(10):3260-3262. doi: 10.1093/bioinformatics/btaa121. Bioinformatics. 2020. PMID: 32096820 Free PMC article.
Cited by
-
Diagnostic uplift through the implementation of short tandem repeat analysis using exome sequencing.Eur J Hum Genet. 2024 May;32(5):584-587. doi: 10.1038/s41431-024-01542-w. Epub 2024 Feb 2. Eur J Hum Genet. 2024. PMID: 38308084 Free PMC article.
-
Whole genome sequencing in ROHHAD trios proved inconclusive: what's beyond?Front Genet. 2023 Aug 7;14:1031074. doi: 10.3389/fgene.2023.1031074. eCollection 2023. Front Genet. 2023. PMID: 37609037 Free PMC article.
-
A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples.PLoS One. 2024 Apr 1;19(4):e0300545. doi: 10.1371/journal.pone.0300545. eCollection 2024. PLoS One. 2024. PMID: 38558075 Free PMC article.
-
Genetic diagnosis and detection rates using C9orf72 repeat expansion and a multi-gene panel in amyotrophic lateral sclerosis.J Neurol. 2024 Jul;271(7):4258-4266. doi: 10.1007/s00415-024-12368-3. Epub 2024 Apr 16. J Neurol. 2024. PMID: 38625400
-
Comprehensive de novo mutation discovery with HiFi long-read sequencing.Genome Med. 2023 May 8;15(1):34. doi: 10.1186/s13073-023-01183-6. Genome Med. 2023. PMID: 37158973 Free PMC article.
References
-
- Amiel J. et al. (2003) Polyalanine expansion and frameshift mutations of the paired-like homeobox gene PHOX2B in congenital central hypoventilation syndrome. Nat. Genet., 33, 459.. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases