| Literature DB >> 24935865 |
Yuki Monden1, Ayaka Yamamoto2, Akiko Shindo1, Makoto Tahara3.
Abstract
In many crop species, DNA fingerprinting is required for the precise identification of cultivars to protect the rights of breeders. Many families of retrotransposons have multiple copies throughout the eukaryotic genome and their integrated copies are inherited genetically. Thus, their insertion polymorphisms among cultivars are useful for DNA fingerprinting. In this study, we conducted a DNA fingerprinting based on the insertion polymorphisms of active retrotransposon families (Rtsp-1 and LIb) in sweet potato. Using 38 cultivars, we identified 2,024 insertion sites in the two families with an Illumina MiSeq sequencing platform. Of these insertion sites, 91.4% appeared to be polymorphic among the cultivars and 376 cultivar-specific insertion sites were identified, which were converted directly into cultivar-specific sequence-characterized amplified region (SCAR) markers. A phylogenetic tree was constructed using these insertion sites, which corresponded well with known pedigree information, thereby indicating their suitability for genetic diversity studies. Thus, the genome-wide comparative analysis of active retrotransposon insertion sites using the bench-top MiSeq sequencing platform is highly effective for DNA fingerprinting without any requirement for whole genome sequence information. This approach may facilitate the development of practical polymerase chain reaction-based cultivar diagnostic system and could also be applied to the determination of genetic relationships.Entities:
Keywords: DNA fingerprinting; high-throughput sequencing; molecular marker; retrotransposon; sweet potato
Mesh:
Substances:
Year: 2014 PMID: 24935865 PMCID: PMC4195495 DOI: 10.1093/dnares/dsu015
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Scheme of the MiSeq sequencing library in this study. The 150-bp paired reads (black arrow) represent the junction sequence of the retrotransposon (Rtsp-1 or LIb) (red box) and the insertion site sequence from the AP3 adapter (purple box). The TruSeq adapters used in this study were indexed with short 6–8 bp sequences to facilitate multiplexing in a single sequencing run. The P5 and P7 sequences (blue box) allowed the final products to hybridize to the Illumina flow cell. The Rd1 and Rd2 sequences (green box) served as sequencing primers.
Summary of the MiSeq read data processing
| Retrotransposon | Analysis | No. of read pairs | No. of collapsed reads (≥10) | Ratio (%) | No. of clusters |
|---|---|---|---|---|---|
| Raw data | 3,567,587 | 100 | |||
| Validation of retrotransposon sequence | 3,095,771 | 16,428 | 86.8 | ||
| Trimming to 50 bp | 3,095,771 | 8,038 | 86.8 | ||
| QV filtering (≥20) | 2,292,344 | 5,245 | 64.3 | ||
| Outlier filtering (>99%) | 2,273,707 | 5,218 | 63.7 | ||
| Clustering with BLATa | 1,497 | ||||
| Raw data | 834,052 | 100 | |||
| Validation of retrotransposon sequence | 743,945 | 3,608 | 89.2 | ||
| Poly(A) tail trimming (A ≥ 3) | 743,935 | 3,571 | 89.2 | ||
| Trimming to 50 bp | 743,935 | 3,154 | 89.2 | ||
| QV filtering (≥20) | 434,502 | 1,619 | 52.1 | ||
| Outlier filtering (>98%) | 427,120 | 1,603 | 51.2 | ||
| Clustering with BLATa | 527 |
aThe clusters include groups of two or more sequences and non-clustered single sequences for the insertion sites of Rtsp-1and LIb, where the minimum number of reads for an identical sequence was 10.
Summary of the insertion sites identified
| All insertion sites | No. of polymorphic sites | Polymorphic ratio (%) | No. of cultivar-specific sites | Cultivar-specific ratio (%) | |
|---|---|---|---|---|---|
| 1,497 | 1,327 | 88.6 | 291 | 19.4 | |
| 527 | 523 | 99.2 | 85 | 16.1 | |
| Total | 2,024 | 1,850 | 91.4 | 376 | 18.6 |
Figure 2.The numbers of Rtsp_1 and LIb insertion sites identified in this study. The box plots show the distributions of these sites in the 38 cultivars. The central rectangle spans the first quartile to the third quartile. The line inside the rectangle shows the median number (257.5 and 92.5 for Rtsp-1 and LIb, respectively), and the whiskers above and below the box show the maximum and minimum number, respectively.
Figure 3.Agarose gel image showing the SCAR markers developed in this study. PCR amplification was conducted using the cultivar-specific insertion primer (RP_*) and the Rtsp-1-specific primer (Rtsp-1_ppt). Descriptions of these primers are provided in Supplementary Table S2. The arrows indicate the strong and reproducible bands that were specific for each cultivar. Lanes 1–38 correspond to the cultivars listed in Supplementary Table S1.
Figure 4.Phylogenetic analysis based on the Rtsp-1 insertion polymorphisms of the cultivars. The phylogenetic tree was constructed using the neighbour-joining method. The bootstrap values are shown. +The genetic relationships based on the Rtsp-1 and LIb (Supplementary Fig. S1) insertion polymorphisms were well consistent. *The genetic relationships among cultivars agreed well with the pedigree information (Supplementary Fig. S2).