| Literature DB >> 34946893 |
Shaoqi Shen1, Yuxian Li1, Jianyu Wang1, Chendan Wei1, Zhenyi Wang1, Weina Ge1, Min Yuan1, Lan Zhang1, Li Wang1, Sangrong Sun1, Jia Teng1, Qimeng Xiao1, Shoutong Bao1, Yishan Feng1, Yan Zhang1, Jiaqi Wang1, Yanan Hao1, Tianyu Lei1, Jinpeng Wang1,2,3.
Abstract
The peanut (Arachis hypogaea L.) is the leading oil and food crop among the legume family. Extensive duplicate gene pairs generated from recursive polyploidizations with high sequence similarity could result from gene conversion, caused by illegitimate DNA recombination. Here, through synteny-based comparisons of two diploid and three tetraploid peanut genomes, we identified the duplicated genes generated from legume common tetraploidy (LCT) and peanut recent allo-tetraploidy (PRT) within genomes. In each peanut genome (or subgenomes), we inferred that 6.8-13.1% of LCT-related and 11.3-16.5% of PRT-related duplicates were affected by gene conversion, in which the LCT-related duplicates were the most affected by partial gene conversion, whereas the PRT-related duplicates were the most affected by whole gene conversion. Notably, we observed the conversion between duplicates as the long-lasting contribution of polyploidizations accelerated the divergence of different Arachis genomes. Moreover, we found that the converted duplicates are unevenly distributed across the chromosomes and are more often near the ends of the chromosomes in each genome. We also confirmed that well-preserved homoeologous chromosome regions may facilitate duplicates' conversion. In addition, we found that these biological functions contain a higher number of preferentially converted genes, such as catalytic activity-related genes. We identified specific domains that are involved in converted genes, implying that conversions are associated with important traits of peanut growth and development.Entities:
Keywords: Arachis; duplicated genes; gene conversion; polyploidization; subgenome
Mesh:
Year: 2021 PMID: 34946893 PMCID: PMC8701993 DOI: 10.3390/genes12121944
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Homologous gene quartets and inference of conversion through phylogenetic analyses. (A) Colinear chromosomal segments from two genomes (A,B), represented by rectangles of different colors. Arrows show genes and homologous genes are coded by the same color. Homologous gene quartet formed by paralogous genes A1 and A2 in A and their respective orthologous genes B1 and B2 in B. (B) Squares indicate the WGD event in their common ancestral genome and circles symbolize species divergence. The expected phylogenetic relationship and potential conversion event of the homologous quartets: (I) the expected phylogenetic relationship of the homologous genes in quartet if no conversion occurs; (II) A2 (acceptor) is converted by A1 (donor); (III) B1 is converted by B2; (IV) both of the above conversions occurred. (C) Homoeologous chromosomal segments from A. duranensis, A. ipaensis, A. monticola and A. hypogaea. (D) The phylogenetic relationship of conversions between PRT-related duplicated genes.
Figure 2The phylogeny of the studied peanut genomes and inference of duplicated and orthologous gene pairs within and between genomes. (A) The phylogeny of the studied peanut genomes. (B) The genomic homologous dotplot within the A. ipaensis genome. Highlighted boxes indicate the paralogous blocks generated from the LCT event. The median Ks values of anchored gene pairs located in blocks are placed next to the highlighted boxes. (C) The genomic homologous dotplot between the genomes of A. ipaensis and A. duranensis. Highlighted boxes indicate the orthologous blocks produced by the divergence of A. ipaensis from A. duranensis. The median Ks values of anchored gene pairs located in blocks is placed next to the highlighted boxes. (D) Alignment of the peanut and relative genomes with A. duranensis as reference. The innermost circle represents the 10 chromosomes of the Ad genome and the gray lines linked paralogous genes generated from LCT. Genomic paralogy, orthology and outparalogy information within and among eight (sub) genomes, with the name abbreviations of A. duranensis (A), A. ipaensis (B), A. monticola A (MA), A. monticola B (MB), A. hypogaea A (Shitouqi) (SA), A. hypogaea B (Shitouqi) (SB), A. hypogaea A (Tifrunner) (TA) and A. hypogaea B (Tifrunner) (TB), displayed in 16 circles. The short line forming the innermost A. duranensis chromosome circle represents predicted genes, which have one set of paralogous regions, forming another circle. Each of the two sets of A. duranensis paralogous chromosomal regions has one orthologous region in other Arachis genomes.
Converted paralogues in peanut genomes.
| Species | Quartet Patterns | Paralogues in Quartets | WCV-I a | WCV-II b | PCV c | Total | Conversion Rate (%) |
|---|---|---|---|---|---|---|---|
|
| A1-B1-A2-B2 | 1871 | - | 4 | 216 | 220 | 11.8% |
|
| - | 2 | 241 | 242 | 13.0% | ||
| Ama1-Amb1-Ama2-Amb2 | 99 | - | 2 | 11 | 13 | 13.1% | |
| - | - | 14 | 14 | 14.1% | |||
| Aha1-Ahb1-Aha2-Ahb2 | 1314 | 3 | 3 | 121 | 126 | 9.6% | |
| 3 | 2 | 112 | 115 | 8.8% | |||
| Aha1-Ahb1-Aha2-Ahb2 | 953 | 2 | 2 | 129 | 132 | 6.8% | |
| 3 | 3 | 134 | 139 | 7.1% |
Note: WCV-I a, the similarity of homologous gene pairs measured by Ks; WCV-II b, the ratios of amino acid locus identity of sequences in each quartet to measure the similarity and examination of the topological tree changes; PCV c, a dynamic programming algorithm combined with phylogenetic analysis.
Figure 3Gene conversion in peanut genomes. In each panel, the outer circle shows the chromosomes in the considered peanut genome. Converted duplicated gene pairs are connected with curvy lines. (A) Gene conversion in A. duranensis and A. ipaensis. (B) Gene conversion in A. monticola. (C) Gene conversion in Shitouqi. (D) Gene conversion in Tifrunner.
Figure 4Gene conversions between PRT-related duplicated genes in two cultivated tetraploid peanuts genomes. (A) Twenty chromosomes of Shitouqi are divided into ten groups according to the corresponding relationship of the subgenomes in Shitouqi. Converted duplicated genes related to PRT are marked out with a short line according to their location on subgenome A and B, while the red lines indicate genes from converted duplicates as donors and the blue lines indicate acceptor genes; the highlighted region indicates that the number of converted genes is no less than 25% of duplicates in the region. (B) Gene conversions between PRT-related duplicated genes in Tifrunner genome. (C) Colinearity and conversion at the end of chromosome 3 in Shitouqi and Tifrunner; rectangles represent annotated genes with orientation on the same strand (blue) or reverse strand (green); the grey lines connect syntenic gene pairs and red lines connect conversion genes. (D) The top is the tree of nonconverted genes and the bottom is the converted gene tree.
Nucleotide substitution rates of quartets in peanut genomes.
| Orthologues | Converted Genes | Nonconverted Genes | ||
|---|---|---|---|---|
|
|
| 0.055 | 0.023 | 9.48 × 10−14 |
|
| 0.109 | 0.064 | 7.05 × 10−7 | |
| 0.505 | 0.359 | 3.68 × 10−9 | ||
|
| 0.114 | 0.0380 | 1.40 × 10−3 | |
|
| 0.220 | 0.0846 | 3.60 × 10−3 | |
| 0.523 | 0.449 | 1.30 × 10−3 | ||
|
| 0.0620 | 0.0483 | 2.10 × 10−3 | |
|
| 0.116 | 0.0900 | 9.00 × 10−3 | |
| 0.534 | 0.537 | 4.98 × 10−5 | ||
|
| 0.0704 | 0.0340 | 4.01 × 10−5 | |
|
| 0.133 | 0.0753 | 4.10 × 10−5 | |
| 0.529 | 0.452 | 9.33 × 10−7 | ||
Figure 5The association between the block length (colinear gene pairs) and the gene conversion rate. (A) The association between LCT-related block length and conversion rate among all eight peanut genomes. (B) The association between PRT-related block length and gene conversion among three tetraploid peanut genomes.
Figure 6Histogram of gene ontology (GO) statistics for converted and all duplicated genes. (A) GO statistics for converted genes and duplicated genes in A. hypogaea A (Shitouqi). x-axis shows user-selected GO terms; y-axis shows the percentages of genes (number of a particular gene divided by total gene number). (B) GO statistics for converted genes and duplicated genes in A. hypogaea B (Shitouqi). (C) GO statistics for converted genes and duplicated genes in A. hypogaea A (Tifrunner). (D) GO statistics for converted genes and duplicated genes in A. hypogaea B (Tifrunner).
Figure 7Bubble diagram of domain enrichment for converted genes. (A) Enrichment of duplicated gene domains in gene conversion in diploid peanut. The abscissa represents the total percentage of genes containing this domain, the ordinate represents the various domains; the size of the circle represents the number and the color represents the e-value. (B) Enrichment of duplicated gene domains in gene conversion in Shitouqi. (C) Enrichment of duplicated gene domains in gene conversion in Tifrunner. (D) Enrichment of duplicated gene domains in gene conversion in all the different ploidies of peanut.