| Literature DB >> 26061691 |
Jia-Yee S Yap1, Thore Rohner2, Abigail Greenfield1, Marlien Van Der Merwe3, Hannah McPherson3, Wendy Glenn4, Geoff Kornfeld4, Elessa Marendy4, Annie Y H Pan4, Alan Wilton4, Marc R Wilkins4, Maurizio Rossetto3, Sven K Delaney4.
Abstract
The Wollemi pine (Wollemia nobilis) is a rare Southern conifer with striking morphological similarity to fossil pines. A small population of W. nobilis was discovered in 1994 in a remote canyon system in the Wollemi National Park (near Sydney, Australia). This population contains fewer than 100 individuals and is critically endangered. Previous genetic studies of the Wollemi pine have investigated its evolutionary relationship with other pines in the family Araucariaceae, and have suggested that the Wollemi pine genome contains little or no variation. However, these studies were performed prior to the widespread use of genome sequencing, and their conclusions were based on a limited fraction of the Wollemi pine genome. In this study, we address this problem by determining the entire sequence of the W. nobilis chloroplast genome. A detailed analysis of the structure of the genome is presented, and the evolution of the genome is inferred by comparison with the chloroplast sequences of other members of the Araucariaceae and the related family Podocarpaceae. Pairwise alignments of whole genome sequences, and the presence of unique pseudogenes, gene duplications and insertions in W. nobilis and Araucariaceae, indicate that the W. nobilis chloroplast genome is most similar to that of its sister taxon Agathis. However, the W. nobilis genome contains an unusually high number of repetitive sequences, and these could be used in future studies to investigate and conserve any remnant genetic diversity in the Wollemi pine.Entities:
Mesh:
Year: 2015 PMID: 26061691 PMCID: PMC4464890 DOI: 10.1371/journal.pone.0128126
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Sequence map of the Wollemia nobilis chloroplast genome.
Genes drawn outside of the circle are transcribed clockwise, while genes shown on the inside of the circle are transcribed counter-clockwise. Genes belonging to different functional groups are colour-coded. The darker gray in the inner circle indicates GC content, while the lighter gray corresponds to AT content.
Comparison of chloroplast genome characteristics in different species of Araucariaceae and Podocarpaceae.
| Araucariaceae | Podocarpaceae | ||||
|---|---|---|---|---|---|
| Characteristics |
|
|
|
|
|
| GenBank Accession no. | KP259800 | AB830884 | AB830885 | NC_020361.1 | NC_020361.1 |
| Size (bp) | 145,630 | 145,625 | 133,722 | 133,734 | 133,259 |
| GC content (%) | 36.50 | 36.54 | 37.26 | 37.100 | 37.16 |
| Total number of genes | 122 | 122 (123) | 117 (120) | 119 (118) | 94 (120) |
| Total number of unique genes | 118 | (119) | (118) | 118 (117) | (118) |
| Protein-coding genes | 82 | 81 (82) | 81 (82) | 82 | 75 (82) |
| Ribosomal RNAs | 5 | 5 | 4 | 4 | 4 |
| Transfer RNAs | 35 | 36 | 32 (34) | 31 (32) | 15 (34) |
| Protein-coding genes (bp) | 75,300 | 75,271 | 74,781 | 74,217 | 74,607 |
| Ribosomal RNAs (bp) | 4,636 | 4,638 | 4,529 | 4,504 | 4,501 |
| Transfer RNAs (bp) | 2,628 | 2,699 | 2,418 | 2,409 | 2,487 |
| Introns (bp) | 11,857 | 11,890 | 11,487 | 10,445 | 9,710 |
| Spacers (bp) | 51,189 | 51,127 | 40,507 | 42,159 | 41,821 |
| AT content (%) | |||||
| Genome | 63.51 | 63.46 | 62.74 | 62.90 | 62.84 |
| Protein-coding genes | 62.41 | 62.34 | 61.86 | 61.87 | 61.90 |
| Transfer RNA genes | 46.47 | 46.46 | 46.80 | 46.66 | 47.30 |
| Ribosomal RNA genes | 45.75 | 45.90 | 46.01 | 45.87 | 45.90 |
| Introns | 62.39 | 62.61 | 61.92 | 62.13 | 61.50 |
| Spacers | 67.88 | 67.85 | 67.44 | 67.62 | 62.84 |
[25]
[23]
In parentheses is the value observed after these published chloroplast genomes were re-annotated using tRNAscan-SE [39] and reference to the Cedrus deodara (NC_014575) plastome.
List of genes identified in the chloroplast genome of W. nobilis.
| Functional category | Group of genes | Name of genes | |||||
|---|---|---|---|---|---|---|---|
| Self-replication | Ribosomal RNA genes |
|
|
|
| ||
| Transfer RNA genes |
|
|
|
|
|
| |
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| ||
|
|
| ||||||
| Small subunit of ribosome |
|
|
|
|
|
| |
|
|
|
|
|
| |||
| Large subunit of ribosome |
|
|
|
|
|
| |
|
|
|
| |||||
| DNA-dependent RNA polymerase |
|
|
|
| |||
| Translational initiation factor |
| ||||||
| Genes for photosynthesis | Subunits of photosystem I |
|
|
|
|
|
|
|
|
| ||||||
| Subunits of photosystem II |
|
|
|
|
|
| |
|
|
|
|
|
|
| ||
|
|
|
| |||||
| Subunits of cytochrome |
|
|
|
|
|
| |
| Subunits of ATP synthase |
|
|
|
|
|
| |
| Large subunit of Rubisco |
| ||||||
| Chlorophyll biosynthesis |
|
|
| ||||
| Subunits of NADH dehydrogenase |
|
|
|
|
|
| |
|
|
|
|
|
| |||
| Other genes | Maturase |
| |||||
| Envelope membrane protein |
| ||||||
| Subunit of acetyl-CoA |
| ||||||
| C-type cytochrome synthesis gene |
| ||||||
| Protease |
| ||||||
| Component of TIC complex |
| ||||||
| Genes of unknown function | Conserved open reading frames |
| |||||
*genes with introns
**duplicated genes
Fig 2MAUVE alignment of W. nobilis and A. dammara chloroplast genomes.
The W. nobilis genome is shown at top as the reference genome. Within each of the alignments, local collinear blocks are represented by blocks of the same colour connected by lines. Note that the two LCBs in the A. dammara genome are both inverted relative to the W. nobilis genome.
Distribution of tandem repeats in the W. nobilis chloroplast genome.
| Serial No. | Indices | Repeat length | Size of repeat unit X Copy number | Location |
|---|---|---|---|---|
| 1 | 4445–4492 | 54 | 18x3 |
|
| 2 | 18715–18751 | 36 | 12x3 |
|
| 3 | 25945–25971 | 24 | 12x2 |
|
| 4 | 26323–26359 | 36 | 18x2 |
|
| 5 | 34174–34345 | 171 | 57x3 |
|
| 6 | 36587–36660 | 72 | 36x2 |
|
| 7 | 37477–37502 | 24 | 12x2 |
|
| 8 | 37734–37762 | 28 | 14x2 |
|
| 9 | 47091–47126 | 32 | 16x2 |
|
| 10 | 64129–64185 | 60 | 15x4 |
|
| 11 | 65803–65865 | 66 | 33x2 |
|
| 12 | 65853–65894 | 45 | 15x3 |
|
| 13 | 66632–66672 | 42 | 21x2 |
|
| 14 | 67290–67332 | 42 | 21x2 |
|
| 15 | 69017–69356 | 330 | 30x11 |
|
| 16 | 72719–72756 | 38 | 19x2 |
|
| 17 | 74646–74676 | 32 | 16x2 |
|
| 18 | 84689–84713 | 26 | 13x2 |
|
| 19 | 91442–91485 | 44 | 22x2 |
|
| 20 | 92695–92802 | 108 | 54x2 |
|
| 21 | 100704–101063 | 360 | 60x6 |
|
| 22 | 101250–101295 | 45 | 45 |
|
| 23 | 101313–101337 | 24 | 12x2 |
|
| 24 | 113049–113073 | 24 | 12x2 |
|
| 25 | 115787–115820 | 32 | 16x2 |
|
| 26 | 123920–124039 | 120 | 60x2 |
|
| 27 | 125626–125659 | 39 | 13x3 |
|
| 28 | 140203–140242 | 42 | 21x2 |
|
Characteristics of simple sequence repeats identified in the chloroplast genomes of W. nobilis, A. dammara and P. lambertii.
| Mono | Di | Tri | Tetra | Penta | Hexa | Total | |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Total counts | 239 | 69 | 62 | 15 | 1 | 1 | 387 |
| Total Repeat Length (repeat unit X number of repeat) (bp) | 1991 | 744 | 621 | 184 | 15 | 18 | 3573 |
| Density (Total repeat length/genome size) [bp/kb] | 13.67 | 5.11 | 4.26 | 1.26 | 0.10 | 0.12 | 24.53 |
| Proportion among other SSR (%) | 55.72 | 20.82 | 17.38 | 5.15 | 0.42 | 0.50 | 100 |
| Mean Length | 8.33 | 10.78 | 10.02 | 12.27 | 15 | 18 | 9.23 |
| Standard Deviation | 1.84 | 4.45 | 3.64 | 1.03 | 0 | 0 | |
|
| |||||||
| Total counts | 250 | 68 | 64 | 12 | 2 | 1 | 397 |
| Total Repeat Length (repeat unit X number of repeat) (bp) | 2168 | 720 | 615 | 152 | 30 | 18 | 3703 |
| Density (Total repeat length/genome size) [bp/kb] | 14.89 | 4.94 | 4.22 | 1.04 | 0.21 | 0.12 | 25.43 |
| Proportion among other SSR (%) | 58.55 | 19.44 | 16.61 | 4.10 | 0.81 | 0.49 | 100 |
| Mean Length | 8.67 | 10.59 | 9.61 | 12.67 | 15 | 18 | 9.33 |
| Standard Deviation | 2.38 | 3.88 | 1.53 | 1.56 | 0 | 0 | |
|
| |||||||
| Total counts | 198 | 63 | 52 | 9 | 1 | 1 | 324 |
| Total Repeat Length (repeat unit X number of repeat) (bp) | 1558 | 586 | 498 | 112 | 15 | 18 | 2787 |
| Density (Total repeat length/genome size) [bp/kb] | 11.65 | 4.38 | 3.72 | 0.84 | 0.11 | 0.13 | 20.84 |
| Proportion among other SSR (%) | 55.90 | 21.03 | 17.87 | 4.02 | 0.54 | 0.65 | 100 |
| Mean Length | 7.87 | 9.30 | 9.58 | 12.44 | 15 | 18 | 8.60 |
| Standard Deviation | 1.56 | 2.81 | 1.33 | 1.33 | 0 | 0 |