Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 5;108(8):1436-1449.
doi: 10.1016/j.ajhg.2021.06.006. Epub 2021 Jul 2.

Targeted long-read sequencing identifies missing disease-causing variation

Affiliations

Targeted long-read sequencing identifies missing disease-causing variation

Danny E Miller et al. Am J Hum Genet. .

Abstract

Despite widespread clinical genetic testing, many individuals with suspected genetic conditions lack a precise diagnosis, limiting their opportunity to take advantage of state-of-the-art treatments. In some cases, testing reveals difficult-to-evaluate structural differences, candidate variants that do not fully explain the phenotype, single pathogenic variants in recessive disorders, or no variants in genes of interest. Thus, there is a need for better tools to identify a precise genetic diagnosis in individuals when conventional testing approaches have been exhausted. We performed targeted long-read sequencing (T-LRS) using adaptive sampling on the Oxford Nanopore platform on 40 individuals, 10 of whom lacked a complete molecular diagnosis. We computationally targeted up to 151 Mbp of sequence per individual and searched for pathogenic substitutions, structural variants, and methylation differences using a single data source. We detected all genomic aberrations-including single-nucleotide variants, copy number changes, repeat expansions, and methylation differences-identified by prior clinical testing. In 8/8 individuals with complex structural rearrangements, T-LRS enabled more precise resolution of the mutation, leading to changes in clinical management in one case. In ten individuals with suspected Mendelian conditions lacking a precise genetic diagnosis, T-LRS identified pathogenic or likely pathogenic variants in six and variants of uncertain significance in two others. T-LRS accurately identifies pathogenic structural variants, resolves complex rearrangements, and identifies Mendelian variants not detected by other technologies. T-LRS represents an efficient and cost-effective strategy to evaluate high-priority genes and regions or complex clinical testing results.

Keywords: long-read sequencing, adaptive sampling, nanopore sequencing, targeted long-read sequencing.

PubMed Disclaimer

Conflict of interest statement

P.S.K. reports receiving consulting fees from Sanofi Genzyme, Amicus Therapeutics, Maze Therapeutics, JCR Pharmaceutical, and Asklepios Biopharmaceutical, Inc; research and grant support from Sanofi Genzyme, Valerion Therapeutics, and Amicus Therapeutics; has equity in Asklepios Biopharmaceutical, Inc., and Maze Therapeutics; and is a member of the Pompe and Gaucher Disease Registry Advisory Board for Sanofi Genzyme, Amicus Therapeutics, and Baebies.

Figures

Figure 1
Figure 1
Targeted long-read sequencing simultaneously detects repeat expansion and methylation status Expansion and methylation of a GGC repeat in the 5′ UTR of XYLT1 is a common cause of Baratela-Scott syndrome. (A) Southern blot of family 04 reported by LaCroix and colleagues demonstrates that the proband (04-01) carries an expansion (1) of a region defined by two KpnI restriction enzyme sites containing a GGC repeat, the mother (04-02) carries one premutation (2) and one wild-type allele (3), and the father (04-03) carries two wild-type alleles (4). Both panels are from the same Southern blot on day 6 of exposure. (B) T-LRS of the trio revealed that the length of fragments from single reads spanning both KpnI cut sites used in (A) was consistent with the results from the Southern blot. Colored dots in (B) correspond to methylated (red) and non-methylated (blue) reads shown in (C); gray represents reads where methylation status was not determined. (C) Expansion of the GGC repeat in the proband results in methylation of the 5′ UTR and exon 1. Two reads in the mother are methylated (red), one of which spans the region between the KpnI cut sites and whose length is consistent with a premutation allele as shown in (B). The second methylated read terminates within the repeat and the length cannot be assayed.
Figure 2
Figure 2
Targeted long-read sequencing identifies additional structural differences not observed by standard clinical testing (A) T-LRS of individual S014 revealed two additional deletions and one rearrangement (inversion) not reported by CMA. Reanalysis of the CMA data confirmed deletion L. The “subway” plot shows how each region is connected and allows for reconstruction of the new DNA sequence and gene order in the individual. (B) Clinical CMA of individual S020 identified three deletions on chromosomes 4 and 14 and the subsequent karyotype revealed a complex translocation involving chromosomes 2, 4, 10, and 14. T-LRS identified 11 translocations, 13 rearrangements, and 6 deletions directly affecting 12 genes. Reconstruction of each derivative chromosome estimates the size of each event, as represented by the boxes surrounding part of the derivative chromosomes on the karyotype and is consistent with expected sizes based on karyotype.
Figure 3
Figure 3
Targeted long-read sequencing identifies variants not detected by clinical testing Pathogenic, likely pathogenic, or variants of uncertain significance (VUSs) identified by T-LRS along with variants identified by prior clinical testing (denoted by an asterisk). (A) T-LRS detected a candidate intronic splice acceptor variant as well as the known paternally inherited stop-gain. Long-read phasing demonstrates that these variants are in trans. (B) A 1,900 bp deletion within HPS1 removes exon 3; phasing revealed that this variant and the previously known paternally inherited stop-gain occur on different haplotypes. Clinical testing with an exon-level array confirmed the deletion. (C) T-LRS reveals a previously known paternally inherited stop-gain as well as a novel Alu insertion in exon 20 of ALMS1. Subsequent clinical testing confirmed the Alu was pathogenic and maternally inherited. (D) A 187 bp deletion and 17 Mbp inversion disrupts HPRT1. Clinical testing confirmed the presence of an inversion. (E) Insertion of a 1,500 bp composite retrotransposable element is predicted to create multiple splice acceptor and donor sites and represents a candidate second hit. Linkage disequilibrium phasing suggests the variants are on different haplotypes. (F) Expansion of an AGAA repeat within DMD represents a VUS in an individual with Duchenne muscular dystrophy and a family history lacking a genetic diagnosis.

Similar articles

Cited by

References

    1. Lowther C., Valkanas E., Giordano J.L., Wang H.Z., Currall B.B., O’Keefe K., Collins R.L., Zhao X., Austin-Tse C.A., Evangelista E. Systematic evaluation of genome sequencing as a first-tier diagnostic test for prenatal and pediatric disorders. bioRxiv. 2020 doi: 10.1101/531210. - DOI
    1. Boycott K.M., Rath A., Chong J.X., Hartley T., Alkuraya F.S., Baynam G., Brookes A.J., Brudno M., Carracedo A., den Dunnen J.T. International Cooperation to Enable the Diagnosis of All Rare Genetic Diseases. Am. J. Hum. Genet. 2017;100:695–705. - PMC - PubMed
    1. Frésard L., Montgomery S.B. Diagnosing rare diseases after the exome. Cold Spring Harb. Mol. Case Stud. 2018;4:a003392. - PMC - PubMed
    1. Ewans L.J., Schofield D., Shrestha R., Zhu Y., Gayevskiy V., Ying K., Walsh C., Lee E., Kirk E.P., Colley A. Whole-exome sequencing reanalysis at 12 months boosts diagnosis and is cost-effective when applied early in Mendelian disorders. Genet. Med. 2018;20:1564–1574. - PubMed
    1. Eichler E.E. Genetic Variation, Comparative Genomics, and the Diagnosis of Disease. N. Engl. J. Med. 2019;381:64–74. - PMC - PubMed

Publication types