| Literature DB >> 21292629 |
David Roy Smith1, Kate Crosby, Robert W Lee.
Abstract
The abundance of nuclear plastid DNA-like sequences (NUPTs) in nuclear genomes can vary immensely; however, the forces responsible for this variation are poorly understood. "The limited transfer window hypothesis" predicts that species with only one plastid per cell will have fewer NUPTs than those with many plastids per cell, but a lack of genome sequence data from monoplastidic species has made this hypothesis difficult to test. Here, by analyzing newly available genome sequences from diverse mono- and polyplastidic taxa, we show that the hypothesis holds. On average, the polyplastidic species we studied had 80 times more NUPTs than those that were monoplastidic. Moreover, NUPT content was positively related to nuclear genome size, indicating that in addition to plastid number, NUPTs are influenced by the forces controlling the expansion and contraction of noncoding nuclear DNA. These findings are consistent with data on nuclear DNAs of mitochondrial origin (NUMTs), suggesting that similar processes govern the abundance of both NUPTs and NUMTs.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21292629 PMCID: PMC3101015 DOI: 10.1093/gbe/evr001
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Number and Total Amount (in Kilobases) of NUPTs and NUMTs in the Available Nuclear Genome Sequences from Plastid-Harboring Eukaryotes
| Taxon | Number of Plastids per Cell | Number of Mitochondria per Cell | NUPTs | NUMTs | ||||
| Number of Blast Hits | Accumulative Length (kb) | Average Length (kb) | Number of Blast Hits | Accumulative Length (kb) | Average Length (kb) | |||
| Land plants | ||||||||
| | Multiple | Multiple | 332 | 50 | 0.15 | 1,173 | 549 | 0.46 |
| | Multiple | Multiple | 310 | 114 | 0.37 | NA | NA | NA |
| | Multiple | Multiple | 839 | 291 | 0.34 | 1,528 | 467 | 0.32 |
| | Multiple | Multiple | 751 | 265 | 0.35 | NA | NA | NA |
| | Multiple | Multiple | 3,414 | 822 | 0.24 | NA | NA | NA |
| | Multiple | Multiple | 258 | 93.3 | 0.36 | NA | NA | NA |
| | Multiple | Multiple | 1,541 | 782 | 0.50 | 2,544 | 818 | 0.32 |
| | Multiple | Multiple | 2,036 | 1,073 | 0.52 | 3,072 | 834 | 0.27 |
| | Effectively monoplastidic | Multiple | 31 | 5 | 0.16 | 294 | 74 | 0.25 |
| | Multiple | Multiple | 2,036 | 428 | 0.30 | NA | NA | NA |
| | Effectively monoplastidic | Multiple | 114 | 11.4 | 0.10 | NA | NA | NA |
| | Multiple | Multiple | 1,574 | 329 | 0.20 | 1,957 | 406 | 0.20 |
| | Multiple | Multiple | 3,858 | 801 | 0.20 | 2,357 | 602 | 0.25 |
| Green algae | ||||||||
| | Single | Multiple | 35 | 1.9 | 0.05 | 35 | 3.3 | 0.09 |
| | Single | Single | 73 | 7.5 | 0.10 | 107 | 12 | 0.11 |
| | Single | Single | 0 | 0 | 0 | 0 | 0 | 0 |
| | Single | Single | 4 | 0.6 | 0.17 | 2 | 0.6 | 0.31 |
| | Single | Single | 3 | 0.5 | 0.16 | 0 | 0 | 0 |
| | Single | Single | 3 | 0.6 | 0.20 | 0 | 0 | 0 |
| | Single | Multiple | 1,100 | 65 | 0.12 | 802 | 33 | 0.09 |
| Red alga | ||||||||
| | Single | Single | 2 | 0.37 | 0.18 | 0 | 0 | 0 |
| Apicomplexans | ||||||||
| | Single | Single | 0 | 0 | 0 | 0 | 0 | 0 |
| | Single | Single | 31 | 2.8 | 0.09 | NA | NA | NA |
| | Single | Single | 2 | 0.11 | 0.05 | 2 | 0.11 | 0.05 |
| | Single | Single | 0 | 0 | 0 | 0 | 0 | 0 |
| | Single | Single | 77 | 10.3 | 0.03 | NA | NA | NA |
| Haptophyte | ||||||||
| | 1–2 | Single | 2 | 0.15 | 0.07 | 2 | 0.1 | 0.05 |
| Stramenopiles | ||||||||
| | Single | Single | 0 | 0 | 0 | NA | NA | NA |
| | Single | Multiple | 14 | 4 | 0.29 | NA | NA | NA |
| | 1–2 | Multiple | 8 | 1.6 | 0.20 | 0 | 0 | 0 |
NOTE.— NA—data not available (i.e., mitochondrial genome has not been sequenced; thus, we were unable to search the nuclear DNA for NUMTs).
Blast parameters were as follows: BlastN (version 2.2.23) with an expectation value of 0.0001; a word size of 11; match and mismatch scores of 2 and −3, respectively; and gap cost values of 5 (existence) and 2 (extension). Multiple organelle DNA hits to the same nuclear DNA regions were counted only once. Regions of nuclear DNA that contained tight clusters of NUPTs/NUMTs (i.e., sections of organelle-like DNA interrupted by genomic sequence that did not show sequence identity to organelle DNA) were not counted as a single NUPT/NUMT but as separate hits. See supplementary table S3 (Supplementary Material online) for references and notes on the number of organelles per cell.
S. moellendorffii and P. patens both contain cells that are polyplastidic, but for the purpose of this study, they are considered “effectively monoplastidic” because mitosis and meiosis only occurs in cells that contain a single plastid (Brown and Lemmon 1990).
F“Beanplot” depicting the difference in NUPT/NUMT content between mono- and polyplastidic (and polymitochondrial) species. Plot was generated using the beanplot package (Kampstra 2008) from R v. 2.1.1. The dashed line in the middle of each of the two plots is the overall average of the continuous variable (total NUPT or NUMT content in kilobases). The thick black line in the middle of each category (mono or poly) is the median of each continuous variable (NUPT or NUMT content) with respect to the categorical variable (mono- or polyplastidic/polymitochondrial). The colored curved beanpod surrounding the observations “beans” is the theoretical probability density distribution of these observations. If there were multiple observations with the same number (e.g., NUPTs content of 4 kb for two different taxa), then the line gets longer respective to the other measurements in the beanplot. A Wilcoxon signed rank test (nonparametric) was performed in R on all data because errors were not normally distributed. Note, the P values shown are approximations—the exact values could not be computed due to ties in the data. For polyorganellar species, the lowest and highest data points are named. However, to avoid clutter, only the highest points are labeled for mono-organellar species. Note: At, Arabidopsis thaliana, Cc, Coccomyxa sp. C-169; Os, Oryza sativa; Ts, Thalassiosira pseudonana; and Vc, Volvox carteri.
FLog-scale bar graph showing the number and accumulative length of NUPTs (right side) and NUMTs (left side) for various plastid-harboring eukaryotes. Species are ranked in ascending order based on their total NUPT content. See table 1 and supplementary table S3 (Supplementary Material online) for references and notes on the number of organelles per cell. 1Emiliania huxleyi and Thalassiosira pseudonana can have up to two plastids per cell. Selaginella moellendorffii and Physcomitrella patens are effectively monoplastidic—mitosis and meiosis only occurs in cells that contain a single plastid (Brown and Lemmon 1990).
FLog-log plot of NUPT (top) and NUMT (bottom) content versus nuclear genome size. Polyplastidic and polymitochondrial species are shaded gray on the top and bottom plots, respectively. The names of species of particular interest are shown on the plot. Nuclear genome size data came from GenBank’s Entrez Eukaryotic Genome Project database (http://www.ncbi.nlm.nih.gov/genomes/leuks.cgi).