| Literature DB >> 26295943 |
Karen N McFarland1, Jilin Liu1, Ivette Landrian1, Ronald Godiska2, Savita Shanker3, Fahong Yu3, William G Farmerie3, Tetsuo Ashizawa1.
Abstract
A large, non-coding ATTCT repeat expansion causes the neurodegenerative disorder, spinocerebellar ataxia type 10 (SCA10). In a subset of SCA10 patients, interruption motifs are present at the 5' end of the expansion and strongly correlate with epileptic seizures. Thus, interruption motifs are a predictor of the epileptic phenotype and are hypothesized to act as a phenotypic modifier in SCA10. Yet, the exact internal sequence structure of SCA10 expansions remains unknown due to limitations in current technologies for sequencing across long extended tracts of tandem nucleotide repeats. We used the third generation sequencing technology, Single Molecule Real Time (SMRT) sequencing, to obtain full-length contiguous expansion sequences, ranging from 2.5 to 4.4 kb in length, from three SCA10 patients with different clinical presentations. We obtained sequence spanning the entire length of the expansion and identified the structure of known and novel interruption motifs within the SCA10 expansion. The exact interruption patterns in expanded SCA10 alleles will allow us to further investigate the potential contributions of these interrupting sequences to the pathogenic modification leading to the epilepsy phenotype in SCA10. Our results also demonstrate that SMRT sequencing is useful for deciphering long tandem repeats that pose as "gaps" in the human genome sequence.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26295943 PMCID: PMC4546671 DOI: 10.1371/journal.pone.0135906
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1SCA10 expansion templates for SMRT sequencing.
(A) PCR amplification of the SCA10 expansion from gDNA extracted from blood lymphocytes (Subjects A and B) or from somatic cell hybrid lines (subject C). Lanes are cropped from non-adjacent lanes of the same gel. The full gel is shown is S1 Fig. Arrows indicate the size of bands that were excised for cloning and sequencing (subject A, the 6.5 kb band; subject B, the 5.9 kb band; subject C, the 4.7 kb band) (B) Purified template from cloned PCR products in Fig 1A for SMRT sequencing. L: 1 kb ladder (New England Biolabs). The full gels are shown in S2 Fig.
SMRT sequencing results summary.
| Subject A | Subject B | Subject C | ||||
|---|---|---|---|---|---|---|
| Gel Estimated | SMRT sequence | Gel Estimated | SMRT sequence | Gel Estimated | SMRT sequence | |
|
| 6500 bp (~840 rpts) | 6475 nts (870 rpts | 5900 bp (~820 rpts) | 5951 nts (884 rpts | 4700 bp (~530 rpts) | 4476 nts (514 rpts |
|
| 99.9% identity of 1038 bp | 99.6% identity of 904 bp | 99.1% identity of 650 bp | |||
|
| 98.5% identity of 796 bp | 98.4% identity of 733 bp | 98.2% identity of 935 bp | |||
|
| 100% identity of 626 bp | 99.5% identity of 485 bp | 99.7% identity of 350 bp | |||
|
| 99.6% identity of 564 bp | 99.1% identity of 1297 bp | 99.1% identity of 454 bp | |||
§as estimated by gel electrophoresis of cloned expansion fragment excised from plasmid backbone.
As compared with Sanger sequencing.
*Alignment based on reference genome sequence (NC_000022.11; GI: 568815576, Region: 45794523..45796589) using LALIGN.
^as determined by counting motif blocks in Fig 2. Rpts, repeats; nts, nucleotides; bp, base pairs
Fig 2Schematic representations of the repeat expansions.
(A) SCA10 expansion in subject A. (B) SCA10 expansion in subject B. (C) SCA10 expansion in subject C. Rectangles represent sequence motifs, as indicated by the color key, in the 5’ (upper left) to 3’ (lower right) direction. Black rectangles indicate unverified motifs described further in Fig 3B and are indicated as follows: A: ATTTCT, ATTTCT; B: ATTTCT, A, ATTTCT, A, ATTCCT, TAC, ATTTCT, A, ATT, ACTTCT, ATTCA, ATTTCT, ATTTCT, T, ACTTTCT, TCTTTCT, ATTT, ATTTCT, ATCT, ATTTCT, ATTTCT, ATTTCT, ATTTCT, ATTTCT, T, ATCC, ATTC, ATTTCC, C, ATTTCC, TTCCC, ATTTCC, CATCC, ATTTCC, C, C, C, C, ATTC, ATTTCC, ATTCC; C: ATCT, ATCT, ATCT, AT, ATCT, T, ATC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, C, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, ATCC, C
Fig 3Proportion of repeat motifs in SCA10 expansions.
Proportions are calculated as the percentage of nucleotides of each motifs divided by the total number of nucleotides for each expansion. Motifs present in SMRT sequence results that are verified by Sanger sequencing methods (“shotgun”) comprise the majority of motifs seen (A) while some motifs are unverified (B). Green, SMRT sequencing results from subject A; Green hatched, random shotgun sequencing results from subject A; Blue, SMRT sequencing results from subject B; Red, SMRT sequencing result from subject C; Red hatched, random shotgun sequencing results from subject C.
Fig 4pJAZZ-OCmin vector.
The telN gene and several non-essential genes from phage N15 (purple boxes) were deleted from the vector pJAZZ-OC. repA, cB: replication protein genes; Camr, chloramphenicol resistance; T, terminator. Closed terminal hairpin structures are indicated by black circles.