| Literature DB >> 30158460 |
Flavia Mascagni1, Alberto Vangelisti2, Tommaso Giordani3, Andrea Cavallini4, Lucia Natali5.
Abstract
The relationship between variation of the repetitive component of the genome and domestication in plant species is not fully understood. In previous work, variations in the abundance and proximity to genes of long terminal repeats (LTR)-retrotransposons of sunflower (Helianthus annuus L.) were investigated by Illumina DNA sequencingtocompare cultivars and wild accessions. In this study, we annotated and characterized 22 specific retrotransposon families whose abundance varies between domesticated and wild genotypes. These families mostly belonged to the Chromovirus lineage of the Gypsy superfamily and were distributed overall chromosomes. They were also analyzed in respect to their proximity to genes. Genes close to retrotransposon were classified according to biochemical pathways, and differences between domesticated and wild genotypes are shown. These data suggest that structural variations related to retrotransposons might have occurred to produce phenotypic variation between wild and domesticated genotypes, possibly by affecting the expression of genes that lie close to inserted or deleted retrotransposons and belong to specific biochemical pathways as those involved in plant stress responses.Entities:
Keywords: Helianthus annuus; long terminal repeat retrotransposons; plant domestication; retrotransposon abundance; retrotransposon proximity to genes
Year: 2018 PMID: 30158460 PMCID: PMC6162735 DOI: 10.3390/genes9090433
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Sunflower genotypes used in this study. For each genotype, the United State Department of Agriculture (USDA) identification code, the area of cultivation for domesticated genotypes, and the number of reads sequenced by the Illumina technique are reported. Reads were trimmed at 90 nt and used in analyses as single ends for measuring LTR-retrotransposons (LTR-REs) abundance. For analyzing the proximity between LTR-REs and genes, paired ends were used and no specific length was defined.
| Type | Name | Id Code | Area of Cultivation | Raw Reads | Trimmed Reads (as Single Ends, 90 nt) | Trimmed Reads (as Paired Ends) |
|---|---|---|---|---|---|---|
| Domesticated | Hata | Ames 22503 | Argentina | 32,100,390 | 31,085,284 | 31,624,960 |
| Dussol | Ames 22499 | France | 25,678,406 | 24,988,640 | 25,375,912 | |
| Argentario | Ames 1842 | Italy | 10,759,866 | 10,134,402 | 10,566,652 | |
| Karlik | Ames 3454 | Spain | 23,499,752 | 22,938,364 | 23,087,458 | |
| Zelenka | Ames 22530 | Russia | 9,048,276 | 8,824,270 | 8,858,154 | |
| Roman “A” | PI531386 | Romania | 19,408,888 | 18,621,244 | 19,095,974 | |
| HOPI | PI369359 | USA | 15,768,198 | 15,254,502 | 15,437,790 | |
| Seneca | PI369360 | USA | 13,911,506 | 13,334,732 | 13,667,436 | |
| Wild | Arizona (AZ) | Ames14400 | - | 14,641,510 | 14,013,588 | 14,357,666 |
| Colorado (CO) | PI586840 | - | 23,335,576 | 21,965,694 | 22,916,284 | |
| Illinois (IL) | PI 435540 | - | 18,577,580 | 17,366,768 | 18,145,470 | |
| Kentucky (KY) | PI 435613 | - | 14,853,802 | 13,845,748 | 14,580,828 | |
| Mississippi (MS) | PI 435608 | - | 22,921,544 | 21,376,594 | 22,226,864 | |
| North Dakota (ND) | PI586811 | - | 51,681,332 | 47,906,352 | 49,574,892 | |
| Washington (WA) | PI 531018 | - | 6,996,658 | 6,479,624 | 6,724,410 |
Figure 1Principal component analysis (PCA) plots of abundance values of 22 LTR-RE families in domesticated (white dots) and wild genotypes (grey dots) of Heliantus annuus. The percentage of variation accounted by each axis is shown. Asterisks indicate permutational multivariate analysis of variance (PERMANOVA) significance between cultivars and wild accessions: *** p < 0.001; ** p < 0.01.
Figure 2Mean number (± standard error) of mapped reads (x million) of each of the 22 LTR-RE families in eight cultivars and seven wild accessions of H. annuus. All families were differentially abundant by PERMANOVA, at p < 0.01.
Figure 3Percentage of long terminal repeats-retrotransposons (LTR-RE) families showing significant differences in abundance between wild and cultivated genotypes, according to the lineage to which such families belong. For each lineage, the total number of families in the sunflower genome is reported in parentheses.
Figure 4Distribution of the Gypsy (in green) and Copia (in red) LTR-RE families along the 17 linkage groups (LGs) of the sunflower genome (line HanXRQ [42]). The distribution of a putative centromeric sequence ([58], in black) is also reported. Red arrows indicate the most probable centromere position of each LG, corresponding to the peaks of highest frequency of the putative centromeric sequence. The space of each LG is proportional to its length in nucleotides.
Mean number (per million reads) of paired reads where one read maps to a LTR-RE family and the other to a gene sequence. For each family, the lineage and the superfamily are reported. Statistical significance of differences between cultivars and wild accessions was assessed by PERMANOVA (*** p < 0.001; * p < 0.05).
| Superfamily | Lineage | Family | Mean nr. of Gene-RE Mapping Paired Reads per Million Reads | ||
|---|---|---|---|---|---|
| Cultivars | Wild accessions | PERMANOVA | |||
| Gypsy | Chromovirus | CL5 | 2.95 | 4.05 | |
| Gypsy | Chromovirus | CL18 | 1.61 | 1.64 | |
| Gypsy | Chromovirus | CL25 | 4.37 | 6.53 | * |
| Gypsy | Chromovirus | CL32 | 1.07 | 0.80 | |
| Gypsy | Chromovirus | CL35 | 0.80 | 0.90 | |
| Gypsy | Chromovirus | CL47 | 0.84 | 1.57 | *** |
| Gypsy | Chromovirus | CL57 | 0.73 | 1.04 | * |
| Gypsy | Chromovirus | CL64 | 0.47 | 0.65 | |
| Gypsy | Chromovirus | CL88 | 0.47 | 0.81 | * |
| Gypsy | Chromovirus | CL94 | 0.25 | 0.32 | |
| Gypsy | Chromovirus | CL96 | 0.76 | 0.82 | |
| Gypsy | Chromovirus | CL102 | 0.14 | 0.18 | |
| Gypsy | Chromovirus | CL138 | 0.17 | 0.14 | |
| Gypsy | Chromovirus | CL193 | 0.03 | 0.05 | |
| Gypsy | Chromovirus | CL232 | 0.03 | 0.01 | |
| Gypsy | Athila | CL29 | 1.16 | 1.34 | |
| Gypsy | Athila | CL43 | 1.31 | 1.90 | * |
| Gypsy | Athila | CL87 | 0.42 | 0.64 | * |
| Mean Gypsy | 0.98 | 1.38 | |||
| Copia | AleII | CL48 | 1.08 | 1.08 | |
| Copia | Maximus/SIRE | CL115 | 0.12 | 0.21 | |
| Copia | Angela | CL100 | 0.27 | 0.29 | |
| Copia | TAR/Tork | CL255 | 0.18 | 0.17 | |
| Mean Copia | 0.41 | 0.44 | |||
Figure 5Percentages of Kyoto encyclopedia of genes and genomes (KEGG) biochemical pathway terms associated with gene-RE pairs in cultivars and wild accessions of H. annuus. Asterisks indicate PERMANOVA significance between cultivars and wild accessions: *** p < 0.001; * p < 0.05.