| Literature DB >> 27499133 |
Sara Pinosio1, Stefania Giacomello2, Patricia Faivre-Rampant3, Gail Taylor4, Veronique Jorge5, Marie Christine Le Paslier3, Giusi Zaina6, Catherine Bastien5, Federica Cattonaro7, Fabio Marroni2, Michele Morgante8.
Abstract
Many recent studies have emphasized the important role of structural variation (SV) in determining human genetic and phenotypic variation. In plants, studies aimed at elucidating the extent of SV are still in their infancy. Evidence has indicated a high presence and an active role of SV in driving plant genome evolution in different plant species.With the aim of characterizing the size and the composition of the poplar pan-genome, we performed a genome-wide analysis of structural variation in three intercrossable poplar species: Populus nigra, Populus deltoides, and Populus trichocarpa We detected a total of 7,889 deletions and 10,586 insertions relative to the P. trichocarpa reference genome, covering respectively 33.2 Mb and 62.9 Mb of genomic sequence, and 3,230 genes affected by copy number variation (CNV). The majority of the detected variants are inter-specific in agreement with a recent origin following separation of species.Insertions and deletions (INDELs) were preferentially located in low-gene density regions of the poplar genome and were, for the majority, associated with the activity of transposable elements. Genes affected by SV showed lower-than-average expression levels and higher levels of dN/dS, suggesting that they are subject to relaxed selective pressure or correspond to pseudogenes.Functional annotation of genes affected by INDELs showed over-representation of categories associated with transposable elements activity, while genes affected by genic CNVs showed enrichment in categories related to resistance to stress and pathogens. This study provides a genome-wide catalogue of SV and the first insight on functional and structural properties of the poplar pan-genome.Entities:
Keywords: pan-genome; poplar; structural variation; transposable elements
Mesh:
Year: 2016 PMID: 27499133 PMCID: PMC5026262 DOI: 10.1093/molbev/msw161
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Summary Statistics of INDELs Detected in the High-depth Poplar Accessions.
Note.—The table reports a schematic representation of the phylogeny of the three studied poplar species and a classification of the detected deletions (DEL) and insertions (INS) on the bases of the species in which variants have been detected: Pn, Pd, Pt = P. nigra, P. deltoides, and P. trichocarpa-specific INDELs, respectively; Pn/Pd, Pn/Pt, Pd/Pt = INDELs detected in the pairs P. nigra–P. deltoides, P. nigra–P. trichocarpa, and P. deltoides–P. trichocarpa, respectively; Pn/Pd/Pt = INDELs detected in all the three studied species. For each category, the table reports the percentage of detected variants (% INDELs) belonging to that category, the percentage of variants detected in homozygous state in all individuals (%fixed) and the structural variation event (SV event) responsible for the correspondent INDEL in agreement with the known phylogeny and under the assumptions of all mutations having occurred only once and of absence of false positives and false negatives.
Classification of INDELs on the Basis of their Homology with Class I (retrotransposons) or Class II (DNA transposons) Transposable Elements
| # DEL | % DEL | # INS | %INS | ||
|---|---|---|---|---|---|
| Class I | LTR Gypsy | 1,752 | 39.9 | 5,175 | 49.8 |
| LTR Copia | 854 | 19.4 | 2,862 | 27.5 | |
| LINE L1 | 64 | 1.5 | 365 | 3.5 | |
| LTR | 38 | 0.9 | 308 | 3.0 | |
| SINE | 6 | 0.1 | 2 | 0.0 | |
| Total | 2,714 | 61.8 | 8,712 | 83.8 | |
| Class II | TIR hAT | 525 | 12.0 | 449 | 4.3 |
| Helitron | 380 | 8.7 | 255 | 2.5 | |
| TIR CACTA | 343 | 7.8 | 481 | 4.6 | |
| TIR PIF/Harbinger | 39 | 0.9 | 93 | 0.9 | |
| TIR Mutator | 26 | 0.6 | 0 | 0.0 | |
| Class II unknown | 365 | 8.3 | 410 | 3.9 | |
| Total | 1,678 | 38.2 | 1,688 | 16.2 | |
| Unclassified | 3,497 | 44.3 | 186 | 1.8 | |
aNumber of classified deletions.
bPercentage of classified deletions.
cNumber of classified insertions.
dPercentage of classified insertions.
. 1Genomic distribution of INDELs and genic CNVs. The number of deletions (dark blue track, y axes range = 0–20), insertions (blue track, y axes range = 0–40), genic CNVs (yellow track, y axes range = 0–15), annotated genes (red track, y axes range = 0–80), and the repetitiveness of the genome (green track, y axes range = 0–10) are represented in windows of 250 kb along the 19 Populus trichocarpa chromosomes (outer gray bars). Predicted centromeric regions are highlighted in red.
. 2Frequencies of the Gene Ontology terms for which an over-representation has been observed when comparing the subsets of genes included in deletions (DEL), genes interrupted by insertions (INS) and genic CNVs (gCNV) in the studied samples with respect to the complete dataset of P. trichocarpa annotated genes (ALL). * P-value <0.05, ** P-value < 0.01.
. 3Genetic features associated with P. trichocarpa transcriptome (ALL), genes disrupted by deletions (DEL), genes disrupted by insertions (INS), and genes affected by genic CNVs (gCNV). Left panel: percentage of non-expressed genes. The percentage of non-expressed genes was significantly higher in genes affected by any SV (deletion, insertion or genic CNV) than the whole transcriptome. Central panel: expression levels in the whole transcriptome and in genes affected by SV. Right panel: rates of nonsynonymous to synonymous changes in whole transcriptome and in genes affected by SV. dN/dS values in genes affected by SV were significantly higher compared with whole transcriptome. *P-value < 0.05.
. 4Composition of poplar pan-genome constructed using the structural variants detected in the seven high-depth accessions.