| Literature DB >> 23162558 |
Jian Wu1, Bo Liu, Feng Cheng, Nirala Ramchiary, Su Ryun Choi, Yong Pyo Lim, Xiao-Wu Wang.
Abstract
Sequencing of the chloroplast (cp) genome using traditional sequencing methods has been difficult because of its size (>120 kb) and the complicated procedures required to prepare templates. To explore the feasibility of sequencing the cp genome using DNA extracted from whole cells and Solexa sequencing technology, we sequenced whole cellular DNA isolated from leaves of three Brassicarapa accessions with one lane per accession. In total, 246, 362, and 361 Mb sequence data were generated for the three accessions Chiifu-401-42, Z16, and FT, respectively. Micro-reads were assembled by reference-guided assembly using the cpDNA sequences of B. rapa, Arabidopsis thaliana, and Nicotiana tabacum. We achieved coverage of more than 99.96% of the cp genome in the three tested accessions using the B. rapa sequence as the reference. When A. thaliana or N. tabacum sequences were used as references, 99.7-99.8 or 95.5-99.7% of the B. rapa cp genome was covered, respectively. These results demonstrated that sequencing of whole cellular DNA isolated from young leaves using the Illumina Genome Analyzer is an efficient method for high-throughput sequencing of cp genome.Entities:
Keywords: Brassica rapa; Solexa sequencing technology; chloroplast genome; sequencing; whole cellular DNA
Year: 2012 PMID: 23162558 PMCID: PMC3492724 DOI: 10.3389/fpls.2012.00243
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Schematic view of micro-reads assembly method for chloroplast (cp) genome. De novo assembled contigs (blue bars) are aligned to reference (red bar) to extract sequences generated from cp genome (thin pink bars). Draft consensus (thick pink bar) was constructed guided by reference. Gaps were filled by extending sequence and joining two contigs that overlapped (green bar) by 10 or more nt.
Characteristics of reads from one lane sequencing on Illumina Solexa 1G Genome sequencer.
| Chiifu-402-41 | Z16 | FT | |
|---|---|---|---|
| Total reads | 7,015,639 | 10,313,714 | 10,356,209 |
| Aligned reads | 653,057 | 2,721,148 | 1,073,449 |
| Aligned ratio (%) | 9.3 | 26.4 | 10.4 |
| Mean read depth (-fold) | 103 | 550 | 217 |
| N50 (bp) | 13,509 | 3997 | 7461 |
Mean read depth was calculated by including one copy of inverted repeats.
Figure 2Plot showing sequencing depth by position for chloroplast genomes of three . Number of micro-reads per position (y-axis) is plotted against position in the assembly (x-axis, in kb) in a window size of 100 bp. Numbers above x-axis indicate boundary sites of large single copy (LSC) and small single copy (SSC), and two inverted repeats (IRa and IRb).
Assembly of chloroplast genomes of three .
| Accession | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Coverage (%) | No. of gaps | Total length of gaps (bp) | Coverage (%) | No. of gaps | Total length of gaps (bp) | Error base | Coverage (%) | No. of gaps | Total gap length (bp) | Error base | |
| Chiifu | 99.99 | 1 | 8 | 99.83 | 4 | 156 | 0 | 97.77 | 4 | 1048 | 0 |
| Z16 | 99.96 | 7 | 52 | 99.74 | 7 | 310 | 12 | 95.52 | 21 | 4161 | 34 |
| FT | 99.96 | 5 | 51 | 99.73 | 6 | 481 | 9 | 96.19 | 10 | 2339 | 3 |
Figure 3Comparison of assemblies for Chiifu-402-41 guided by chloroplast genome sequences of . Assembled consensuses are represented by three bars. For B. rapa, black parts of bar indicate LSC or SSC; red parts of bar indicate IRa or IRb. For A. thaliana and N. tabacum, silver parts indicate gaps. Blue block between bars of B. rapa and A. thaliana or N. tabacum indicates identity >95%.
Figure 4Circular gene map of . The thick lines indicate the extent of the inverted repeats (IRa and IRb, 26,213 bp), which separate the genome into small (SSC, 17,777 bp) and large (LSC, 83,282 bp) single copy regions. Genes on the outside of the map are transcribed in the clockwise direction and genes on the inside of the map are transcribed in the counterclockwise direction.
Sequence polymorphisms identified by reference-guided assembly of cp genomes of .
| Chiifu-402-41 | Z16 | FT | |
|---|---|---|---|
| No. of SNP | 1 | 31 | 8 |
| No. of InDel | 0 | 4 | 4 |
No. of SNPs (single nucleotide polymorphisms) and No. of insertions/deletions (InDels) are those compared with reference (DQ231548).