Literature DB >> 32853350

A Lineage-Specific Paralog of Oma1 Evolved into a Gene Family from Which a Suppressor of Male Sterility-Inducing Mitochondria Emerged in Plants.

Takumi Arakawa1,2, Hiroyo Kagami1, Takaya Katsuyama1, Kazuyoshi Kitazaki1, Tomohiko Kubo1.   

Abstract

Cytoplasmic male sterility (MS) in plants is caused by MS-inducing mitochondria, which have emerged frequently during plant evolution. Nuclear restorer-of-fertility (Rf)genes can suppress their cognate MS-inducing mitochondria. Whereas many Rfs encode a class of RNA-binding protein, the sugar beet (Caryophyllales) Rf encodes a protein resembling Oma1, which is involved in the quality control of mitochondria. In this study, we investigated the molecular evolution of Oma1 homologs in plants. We analyzed 37 plant genomes and concluded that a single copy is the ancestral state in Caryophyllales. Among the sugar beet Oma1 homologs, the orthologous copy is located in a syntenic region that is preserved in Arabidopsis thaliana. The sugar beet Rf is a complex locus consisting of a small Oma1 homolog family (RF-Oma1 family) unique to sugar beet. The gene arrangement in the vicinity of the locus is seen in some but not all Caryophyllalean plants and is absent from Ar. thaliana. This suggests a segmental duplication rather than a whole-genome duplication as the mechanism of RF-Oma1 evolution. Of thirty-seven positively selected codons in RF-Oma1, twenty-six of these sites are located in predicted transmembrane helices. Phylogenetic network analysis indicated that homologous recombination among the RF-Oma1 members played an important role to generate protein activity related to suppression. Together, our data illustrate how an evolutionarily young Rf has emerged from a lineage-specific paralog. Interestingly, several evolutionary features are shared with the RNA-binding protein type Rfs. Hence, the evolution of the sugar beet Rf is representative of Rf evolution in general.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  zzm321990 restorer-of-fertilityzzm321990 ; cytoplasmic male sterility; nuclear–mitochondrial interaction; plant mitochondria; positive selection; sugar beet

Year:  2020        PMID: 32853350      PMCID: PMC7846149          DOI: 10.1093/gbe/evaa186

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

The endosymbiotic theory proposes that the mitochondrion originated as an endosymbiont resembling an alpha-proteobacterium (Scheffler 2007), and is supported by molecular phylogenetic analyses of mitochondrial DNA (mtDNA) (Lynch 2007). The mtDNA is, however, not only a remnant of the ancestral genome but contains genes important for the eukaryotic host. Although the number of mitochondrial genes is small (e.g., 37 in mammals and 50–60 in flowering plants) (Gualberto and Newton 2017), their impact on the host’s phenotype and fitness cannot be ignored (Wallace 2018). In many cases, the mtDNA is inherited maternally, unlike the nuclear DNA (Perlman et al. 2015). According to evolutionary theory, this situation invokes differential interests between the mitochondrial and nuclear DNAs: for the mtDNA, mutations that reduce the number of male gametes can be beneficial if they render some advantage to female fitness. However, such mutations are deleterious for inheritance of the nuclear DNA, and thus they drive the evolution of nuclear genes that suppress the male-harming mitochondrial mutations (Touzet 2012; Rice 2013). This situation is called mitonuclear conflict (Havird et al. 2019). Cytoplasmic male sterility (CMS) in flowering plants offers a good example of the mitonuclear conflict (Havird et al. 2019). Some flowering plant mitochondria are known to induce male sterility (MS) that inhibits pollen production but does not affect other organs (Schnable and Wise 1998). The MS-inducing mitochondria produce unique proteins that are absent in noninducing mitochondria (Hanson and Bentolila 2004; Kazama et al. 2019), and the genes encoding such unique proteins have been identified in multiple plant species (Chen and Liu 2014). These gene sequences are composed of fragments of duplicated mitochondrial genes and/or sequences with unknown origins, and their compositions vary, indicating that their occurrences are evolutionarily independent. Considering that CMS has been reported in more than 140 species (Laser and Lersten 1972), it appears that MS-inducing mitochondria have evolved independently and frequently in flowering plants. A nuclear-encoded suppressor of CMS is referred to a restorer-of-fertility (Rf) gene, and generally, a dominant Rf allele suppresses the MS phenotype (Chase 2007). Genetic analyses suggest that each Rf gene is a “specialist” that can suppress a specific MS-inducing mitochondrion but not others (Duvick 1965). The repeated evolution of MS-inducing mitochondria and their cognate Rf genes constitutes an intragenomic arms race (Touzet and Budar 2004), and their evolutionary mechanisms can provide key insights into the mitonuclear conflict. Rf genes have been identified and analyzed in many plant species, and they encode several different types of proteins (Kubo et al. 2020). The most predominant type is pentatrico peptide repeat (PPR) proteins, which bind RNA and recognize specific sequences using arrays of ∼35 amino acid motifs, each of which has high affinity for one of the four nucleotide residues (Barkan and Small 2014). RF proteins belonging to the PPR class are imported into mitochondria and participate in post-transcriptional mechanisms to repress the expression of MS-inducing genes (Dahan and Mireau 2013; Gaborieau et al. 2016). The PPR-encoding Rfs are members of the restorer-of-fertility-like (RFL) gene family, which is a subset of the larger PPR gene family (Fujii et al. 2011). The RFL genes appear to have diverged from a single origin that was established before the split of the major flowering plant lineages, and they are maintained in flowering plant genomes as clusters at one or several loci (Geddy and Brown 2007; Fujii et al. 2011). The organizational diversity of the RFL family within species is well known (Melonek et al. 2019). Molecular phylogenetic studies have revealed that members of the RFL family form species-specific clusters that are paralogous to each other and exhibit signatures of positive selection, features that are reminiscent of the evolutionary patterns of pathogen resistance (R) genes (Geddy and Brown 2007; Fujii et al. 2011). These observations support the concept of an arms race in the mitonuclear conflict (Fujii et al. 2011). However, the selection patterns of the other type of Rf genes (i.e., non-PPR Rfs) have not been examined in such detail, and it is not known if they also exhibit the signature of an arms race. In sugar beet (Beta vulgaris), several different types of MS-inducing mitochondria have been reported, and among these, the Owen, I-12CMS(3)/E, and G types have been characterized at the molecular level (Yamamoto et al. 2008; Kitazaki et al. 2015; Meyer et al. 2018). The molecular actions of their cognate Rfs are unclear except for that of the Owen type. A protein unique to the Owen-type mitochondria is encoded by the gene preSatp6 (whose origin is unknown), and the preSATP6 protein is the target of the cognate Rf (Kitazaki et al. 2015). Interestingly, a post-translational mechanism is involved in this interaction, in contrast with the post-transcriptional mechanisms associated with the PPR-encoding Rfs (Kitazaki et al. 2015) (see below). Oma1 was first identified in yeast as a gene encoding a protease (overlapping activity with m-AAA protease) (Käser et al. 2003), but now its various roles in the quality control of mitochondria are well known in animals and plants (Migdal et al. 2017; Guo et al. 2020). In sugar beet, a duplicated Oma1 copy evolved into an Rf, which was found by our positional cloning of a locus designated as Rf1 (Matsuhira et al. 2012; Arakawa et al. 2019a). The sugar beet Rf1 locus is in fact a complex locus with multiple clustered Oma1-like genes that participate in fertility restoration, with each gene contributing differently. Hence Rf1 (in terms of a genetic factor identified by a classical genetic approach) does not necessarily correspond to a single open reading frame (Arakawa T, et al., submitted). We therefore designated the clustered Oma1-like genes as the RF-Oma1 gene family, after the Oma1-like gene in the Rf1 locus. We have observed molecular diversity within the Rf1 locus among sugar beet lines: the copy number of clustered RF-Oma1 genes varies from one to four, and the nucleotide sequences of the genes have diverged (Moritani et al. 2013; Arakawa et al. 2019b). Our genetic analyses have identified different Rf1 alleles with variations in dominance, including dominant, semidominant, hypomorphic, and recessive (Arakawa et al. 2018, 2019b). These differences can be explained by the suite of RF-Oma1 genes clustered in each Rf1 allele. Namely, some RF-Oma1 genes encode proteins that can bind the preSATP6 protein of sugar beet to alter its higher order structure (Kitazaki et al. 2015), and the total amount of transcript from these genes is one of the determinants of the allele’s dominance (Arakawa T, et al., submitted). This molecular chaperone-like activity toward preSATP6 is unique to some RF-Oma1 genes and is absent from other Oma1 genes such as sugar beet bvOma1 and Arabidopsis atOma1 (Arakawa et al. 2019a). This suggests the evolution of a novel function within the RF-Oma1 gene family (Arakawa et al. 2019a). Some RF-Oma1 genes are transcribed but their protein products are unable to bind the preSATP6 protein (Kitazaki et al. 2015). If the locus consists of such genes, the allele is recessive. Interestingly, no sugar beet line examined to date has lost all its RF-Oma1 genes, even when the line is judged as rf1 homozygous recessive by a genetic analysis (Ohgami et al. 2016). If the locus has genes that encode both binding and nonbinding products, the allele exhibits (strong or weak, depending on the expression level) restoration ability. The strongest allele that we identified had four RF-Oma1 genes, each with the binding activity (Arakawa T, et al., submitted). We have identified a total of 12 RF-Oma1 genes that comprise six Rf1 alleles, and clarified their binding activity (Arakawa et al. 2018, 2019b; Arakawa T, et al., submitted). In addition to bvOma1 and the RF-Oma1 family, there are other Oma1 homologous genes, LOC104888056 and LOC104906603, but they exhibit presence/absence polymorphisms in the beet genetic resources (Arakawa et al. 2019a) and their functional significance is unknown. The objective of this study is to detail the molecular evolution of the RF-Oma1 genes. Here, we show that the RF-Oma1 genes are paralogs of Oma1 that have evolved relatively recently. There are similarities in the patterns of molecular diversity between the RF-Oma1 and RFL families, and in contrast, the evolutionary pattern of the RF-Oma1 genes differs from that of other Oma1 genes such as bvOma1. This suggests that similar mechanisms were involved in the evolution of the RF-Oma1 and RFL families, even though their gene products are different. The evolutionary pattern of the RF-Oma1 family suggests that paralog genes played significant roles in the foundation of Rf, and this has been less stressed in the case of the RFL family. We previously showed that RF-Oma1 gene expression is predominant in the anther, unlike the expression pattern of bvOma1 (Arakawa et al. 2019a). It is interesting to note that some features of the RF-Oma1 family (such as specific expression in male reproductive organs, paralogs encoding mitochondrial proteins, and many positively selected codons) seem to be shared with animal genes that counteract male-harming mitochondrial variants (Havird et al. 2019).

Materials and Methods

Nucleotide Sequence Analysis

Homology searches were performed using the databases at NCBI (https://www.ncbi.nlm.nih.gov/; last accessed September 07, 2020), Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html; last accessed September 07, 2020), miyakogusa.jp (http://www.kazusa.or.jp/lotus/index.html; last accessed September 07, 2020), and CuGenDB (http://cucurbitgenomics.org/organism/1; last accessed September 07, 2020). The transmembrane helix was predicted using the TMHMM Server v. 2.0 (http://www.cbs.dtu.dk/services/TMHMM/; last accessed September 07, 2020) (Emanuelsson et al. 2007). Reference sequences are summarized in supplementary table S1, Supplementary Material online. Gene maps were drawn using Easyfig (Sullivan et al. 2011) and Microsoft PowerPoint.

Phylogenetic Analysis

Phylogenetic trees were drawn using MEGA7 (Kumar et al. 2016) based on amino acid sequences aligned by ClustalW (supplementary data set 1, Supplementary Material online). The partial-deletion option was used with the site coverage cutoff set to 90%. Phylogeny was tested using 1,000 bootstrap replications. Evolutionary distances were determined using the neighbor-joining method and estimated by the Poisson correction distance, with the assumption that the amino acid substitution rate varied among sites according to the gamma distribution (shape number: 5). The maximum-likelihood tree was constructed under the Jones–Taylor–Thornton model assuming the gamma distribution for variation in the amino acid substitution rate (number of discrete parameters: 5). For the tree inference option, the initial tree was made automatically with default settings. The maximum-likelihood heuristic method was the Nearest-Neighbor-Interchange.

Selection Pattern Analysis

Codons were aligned using PAL2NAL v. 14 (Suyama et al. 2006). The ratios of synonymous (dS) and nonsynonymous (dN) nucleotide substitution rates were estimated using PAML v. 4.9h (Yang 2007). A comparison of one- and two-ratio models was performed using the branch model of codeml implemented in PAML. Variations in the dN/dS ratios among sites were examined by comparing five evolutionary models provided by the codeml: M0 (single dN/dS ratio for all sites), M1a (dN/dS is <1 in some sites and = 1 in the others), M2a (dN/dS of each site is either <1, = 1 or >1), M7 (dN/dS ratios of codons follow a beta distribution and are <1 or =1), and M8 (a certain portion of sites evolve as M7 but the dN/dS ratios of the others are >1). Posterior probabilities for the site classes were calculated using the Bayes Empirical Bayes method (Yang et al. 2005).

Nucleotide Sequencing

The B. vulgaris accessions used in this study are summarized in supplementary table S2, Supplementary Material online. Plants were grown in the field at the Field Science Center for Northern Biosphere, Hokkaido University. Total cellular RNA was isolated from fresh green leaves using the RNeasy Plant Mini Kit (Qiagen, Valencia, CA). After treating with RNase-free DNase I (Takara Bio, Kusatsu, Japan), RNA samples were reverse transcribed using the SuperScript III First-Strand Synthesis System (Thermo Fisher Scientific, Waltham, MA) and an oligo dT primer. PCR was then used to amplify the bvOma1 sequences with the primers 5′-GTAAAACGACGGCCAGTAGCATATCTTCTCCTTCAAATATCATG-3′ and 5′-GGAAACAGCTATGACCATGCGAAGTTGCAAAGAAGATGATGTTAATGGTCA-3′. The nucleotide sequences of the RT-PCR products were determined by Sanger sequencing in an ABI3130 Genetic Analyzer (Applied Bio systems, Foster City, CA). The sequence data are available at the DDBJ/EMBL/GenBank database with the accession numbers shown in supplementary table S2, Supplementary Material online.

Network Analysis

Nucleotide sequences were aligned using ClustalW, implemented in the MEGA7 software package (supplementary data set 2, Supplementary Material online). The phylogenetic network was constructed using the SplitsTree software v. 4.14.8 (Huson and Bryant 2006) with the Neighbor-Net method (Bryant 2003).

Results

Oma1 Homologs in Flowering Plants

The bvOma1 and RF-Oma1 genes are ubiquitous in sugar beet (Arakawa et al. 2019a), but it was unknown whether multiple Oma1 copies are usual in other flowering plants. Therefore, we searched for Oma1 homologs in other flowering plant genomes. The amino acid sequence of atOMA1 (TAIR accession At5g51740.1) was submitted as a query in TBlastN searches of the publicly available databases (see Materials and Methods section). The identified sequences were translated in silico according to the annotated gene models. The obtained amino acid sequences were then compared with that of atOMA1 using BlastP, to confirm their homology. Table 1 shows the results for 37 flowering plant genomes from 16 orders. Included are Amborella trichopoda (the basal lineage of flowering plants), 3 monocots, 11 superasterids including 4 Caryophyllales genomes, and 22 superrosids, including 1 Saxifragales genome (detailed in supplementary table S1, Supplementary Material online). In 24 genomes from 8 orders, the copy number of Oma1 homologs is 1, and the other 13 genomes have multiple copies (table 1). The highest number of copies is eight in Medicago truncatula, and the other two Fabales genomes have two copies (table 1). Five orders contain at least one single copy genome and at least one multicopy genome.
Table 1.

Copy Numbers of Oma1 Homologs in Flowering Plant Genomes

Taxon
Number of Oma1 Homologs
EudicotsType of Oma1
Total
ConservedDiverged
Superrosids
Brassicales Arabidopsis thaliana 101
Arabidopsis lyrata 101
Capsella rubella 101
Brassica rapa 101
Carica papaya 101
Malvales Gossypium raimondii a 202
Theobroma cacao 101
Sapindales Citrus sinensis 101
Myrtales Eucalyptus grandis 112
Cucurbitales Cucumis melo 101
Citrullus lanatus 101
Rosales Prunus persica 101
Malus domestica 101
Fragaria vesca 123
Fabales L. japonicus b 022
Medicago truncatula c 268
Glycine max d 202
Malpighiales Ricinus communis 101
Manihot esculenta 101
Populus trichocarpa 101
Vitales Vitis vinifera 101
Saxifragales Kalanchoe fedtschenkoi 101
Superasterids
Asterales Helianthus annuus 134
Lactuca sativa e 202
Cynara cardunculus 101
Solanales Capsicum annuum 101
Solanum lycopersicum 101
Solanum tuberosum 101
Lamiales Sesamum indicum 202
Caryophyllales Beta vulgaris f 134
Spinacia oleracea 101
Chenopodium quinoa g 202
Amaranthus hypochondriacus 101
Monocots
Poales Zea mays 101
Oryza sativa 112
Hordeum vulgare 1h12
Basal Magnoliophyta
Amborellales Amborella trichopoda 101

In these cases, taxon-specific whole genome duplication is suggested (Sato et al. 2008; Schmutz et al. 2010; Young et al. 2011; Wang et al. 2012; Reyes-Chin-Wo et al. 2017).

Data adopted from Arakawa et al. (2019a).

In this case, taxon-specific whole genome duplication is suggested (Jarvis et al. 2017).

Part of the coding region is missing in the gene model.

Copy Numbers of Oma1 Homologs in Flowering Plant Genomes In these cases, taxon-specific whole genome duplication is suggested (Sato et al. 2008; Schmutz et al. 2010; Young et al. 2011; Wang et al. 2012; Reyes-Chin-Wo et al. 2017). Data adopted from Arakawa et al. (2019a). In this case, taxon-specific whole genome duplication is suggested (Jarvis et al. 2017). Part of the coding region is missing in the gene model. In the genomes with multiple Oma1 copies, the sequence homology with atOMA1 varies among the copies. For example, for the three Oma1 homologs in Fragaria vesca, their E values (BlastP) are 1.00E−175, 3.00E−43, and 2.00E−74 (supplementary table S1, Supplementary Material online). In rice, a second Oma1 copy that was overlooked by Matsuhira et al. (2012) has an E value of 2.00E−19, which is higher than the E value for the other copy (5.00E−141) (supplementary table S1, Supplementary Material online). We divided the plant Oma1 homologs into conserved and diverged types based on their homology with atOMA1, using 5.00E−141 as the threshold to discriminate between the two types. Accordingly, all but one genome contains at least one conserved Oma1, and 29 of the 36 genomes have only one copy of the conserved type. Six genomes (Gossypium raimondii, M. truncatula, Glycine max, Lactuca sativa, Sesamum indicum, and Chenopodium quinoa) have two conserved types (table 1). However, both of the two copies in Lotus japonicus are divergent based on our threshold criterion (1.00E−135 and 1.00E−107) (supplementary table S1, Supplementary Material online). Among the four groups of Oma1 homologs in sugar beet (bvOma1, the RF-Oma1 family, LOC104888056, and LOC104906603), only bvOma1 is of the conserved type based on our present threshold (1.00E−179), and the remaining three groups are diverged (4.00E−109 to 3.00E−33) (supplementary table S1, Supplementary Material online). All the other Oma1 homologs in the Caryophyllalean genomes that we investigated are conserved, including the two Oma1 copies in C. quinoa (2.00E−177 for each copy). Our data suggest that the conserved type of Oma1 is nearly ubiquitous in flowering plant genomes, and that the diverged copies sporadically occur in a lineage-specific manner.

Phylogenetic Relationships between bvOma1 and the RF-Oma1 Genes

From the results in the previous section, we inferred that the RF-Oma1 genes evolved from a lineage-specific duplicate of Oma1 rather than from an earlier duplicate, such as in the common ancestor of flowering plants (note that the RFL genes evolved before the split of monocot and dicot lineages; Fujii et al. 2011). We next investigated the phylogenetic relationships between the RF-Oma1 genes and other Oma1 homologs from flowering plants. From the 12 RF-Oma1 genes identified to date, we selected two, one from a dominant and one from a recessive allele (orf20 and orf20, respectively). The protein product of orf20 can bind the preSATP6 protein, whereas the product of orf20 does not (Kitazaki et al. 2015). Our analysis also included bvOma1, LOC104888056, and LOC104906603. A total of 46 amino acid sequences, including the five sequences from sugar beet, 40 Oma1 copies of the conserved type from the other genomes, and the Marchantia polymorpha Oma1 (as an outgroup) were subjected to phylogenetic analysis. In both the neighbor-joining and maximum-likelihood trees, the Oma1 homologs form order-specific clusters (fig. 1), although the superrosids and superasterids are not as clearly separated in the neighbor-joining tree. The bootstrap values tend to be lower in nodes closer to the root.
. 1.

Phylogeny of the flowering plant Oma1 genes, inferred from their amino acid sequences. Trees were drawn using the neighbor-joining (A) and maximum-likelihood (B) methods. Scale bars indicate evolutionary distances. Superrosids, superasterids, monocots, and basal magnoliophyta are indicated by R, A, M, and BM, respectively, in parentheses. Bootstrap values are shown near the nodes. Abbreviations of scientific names are: Ah, Amaranthus hypochondriacus; Al, Arabidopsis lyrata; At, Arabidopsis thaliana; Br, Brassica rapa; Bv, Beta vulgaris; Ca, Capsicum annuum; Cc, Cynara cardunculus; Cl, Citrullus lanatus; Cm, Cucumis melo; Cp, Carica papaya; Cq, Chenopodium quinoa; Cr, Capsella rubella; Cs, Citrus sinensis; Eg, Eucalyptus grandis; Fv, Fragaria vesca; Gm, Glycine max; Gr, Gossypium raimondii; Ha, Helianthus annuus; Kf, Kalanchoe fedtschenkoi; Ls, Lactuca sativa; Md, Malus domestica; Me, Manihot esculenta; Mp, Marchantia polymorpha; Mt, Medicago truncatula; Os, Oryza sativa; Ppe, Prunus persica; Pt, Populus trichocarpa; Rc, Ricinus communis; Si, Sesamum indicum; Sl, Solanum lycopersicum; So, Spinacia oleracea; St, Solanum tuberosum; Tc, Theobroma cacao; Vv, Vitis vinifera; Zm, Zea mays.

Phylogeny of the flowering plant Oma1 genes, inferred from their amino acid sequences. Trees were drawn using the neighbor-joining (A) and maximum-likelihood (B) methods. Scale bars indicate evolutionary distances. Superrosids, superasterids, monocots, and basal magnoliophyta are indicated by R, A, M, and BM, respectively, in parentheses. Bootstrap values are shown near the nodes. Abbreviations of scientific names are: Ah, Amaranthus hypochondriacus; Al, Arabidopsis lyrata; At, Arabidopsis thaliana; Br, Brassica rapa; Bv, Beta vulgaris; Ca, Capsicum annuum; Cc, Cynara cardunculus; Cl, Citrullus lanatus; Cm, Cucumis melo; Cp, Carica papaya; Cq, Chenopodium quinoa; Cr, Capsella rubella; Cs, Citrus sinensis; Eg, Eucalyptus grandis; Fv, Fragaria vesca; Gm, Glycine max; Gr, Gossypium raimondii; Ha, Helianthus annuus; Kf, Kalanchoe fedtschenkoi; Ls, Lactuca sativa; Md, Malus domestica; Me, Manihot esculenta; Mp, Marchantia polymorpha; Mt, Medicago truncatula; Os, Oryza sativa; Ppe, Prunus persica; Pt, Populus trichocarpa; Rc, Ricinus communis; Si, Sesamum indicum; Sl, Solanum lycopersicum; So, Spinacia oleracea; St, Solanum tuberosum; Tc, Theobroma cacao; Vv, Vitis vinifera; Zm, Zea mays. The orf20 and orf20 genes are clustered (hereafter referred to as the RF-Oma1 clade) in both trees within the Caryophyllales clade, suggesting that they are duplicates specific to this order (or the lower taxonomic class). In both trees, the internal branches of the RF-Oma1 clade are longer than the branches separating the other Oma1 genes (fig. 1). In the neighbor-joining tree, the sugar beet Oma1 homologs form a species-specific clade with the genes and RF-Oma1 clade branching in the following order: first bvOma1, then LOC104906603, LOC104888056, and the RF-Oma1 clade (fig. 1). In the maximum-likelihood tree, LOC104888056 and the RF-Oma1 clade form a sister group within a larger group containing all the other Caryophyllalean Oma1 genes except that of Amaranthus hypochondriacus (fig. 1).

Gene Arrangements Associated with bvOma1 and the RF-Oma1 Family

Among the B. vulgaris Oma1 homologs, bvOma1 is the likely ortholog of atOma1. In such cases, genes linked with the ortholog would exhibit synteny. We investigated whether the genomic region surrounding bvOma1 is conserved in the Arabidopsis thaliana genome. In the sugar beet reference genome, a total of 30 genes, including bvOma1, span a chromosomal region of ∼487 kbp (supplementary table S3, Supplementary Material online). This region, between LOC104906573 and LOC104906596, is illustrated in figure 2. We used the amino acid sequences of the 30 genes as queries to find homologous genes in the reference genome of Ar. thaliana, and found 12 homologs, including atOma1, clustered on chromosome 5 (supplementary table S3 and fig. 2, Supplementary Material online). The order and orientations of the homologs in Ar. thaliana are consistent with those in B. vulgaris except for atOma1, which is inversed (fig. 2).
. 2.

Microsynteny of chromosomal regions associated with Oma1 homologs. Annotated genes are shown as horizontal arrows. Genes homologous to the B. vulgaris queries are linked by thin lines. A 50 kbp scale bar is shown on the bottom right. The Oma1 homolog in each segment is shown as a red arrow. Note that the Amaranthus Scaffold_7 has no Oma1 homolog. The chromosomal regions with NCBI accession numbers are: Arabidopsis, A. thaliana (NC_003076, from 20979520 to 21080594); Beta, B. vulgaris (NW_017567367, from 71170 to 557770); Spinacia, S. oleracea (NW_018931419, from 994825 to 1414068); Chenopodium NW_018744460, C. quinoa (NW_018744460, from 1428475 to 1649963); Chenopodium NW_018742987, C. quinoa (NW_018744460, from 2410957 to 2638297); Amaranthus Scaffold_7, Amaranthus hypochondriacus (Scaffold_7, from 22668158 to 23111136); and Amaranthus Scaffold_13, A. hypochondriacus (Scaffold_13, from 4279078 to 4673615).

Microsynteny of chromosomal regions associated with Oma1 homologs. Annotated genes are shown as horizontal arrows. Genes homologous to the B. vulgaris queries are linked by thin lines. A 50 kbp scale bar is shown on the bottom right. The Oma1 homolog in each segment is shown as a red arrow. Note that the Amaranthus Scaffold_7 has no Oma1 homolog. The chromosomal regions with NCBI accession numbers are: Arabidopsis, A. thaliana (NC_003076, from 20979520 to 21080594); Beta, B. vulgaris (NW_017567367, from 71170 to 557770); Spinacia, S. oleracea (NW_018931419, from 994825 to 1414068); Chenopodium NW_018744460, C. quinoa (NW_018744460, from 1428475 to 1649963); Chenopodium NW_018742987, C. quinoa (NW_018744460, from 2410957 to 2638297); Amaranthus Scaffold_7, Amaranthus hypochondriacus (Scaffold_7, from 22668158 to 23111136); and Amaranthus Scaffold_13, A. hypochondriacus (Scaffold_13, from 4279078 to 4673615). We conducted the same analysis on other Caryophyllalean genomes (fig. 2). The gene arrangement found in B. vulgaris is well conserved in Spinacia oleracea. We found two syntenic regions in the C. quinoa genome, which may be associated with the polyploidization of this species (see Discussion). Amaranthus hypochondriacus has a different arrangement, with a similar syntenic region on Scaffold_7 but with no Oma1 homolog (fig. 2). In Ama. hypochondriacus, the Oma1 homolog is located on Scaffold_13, where only two genes are common to the syntenic region (fig. 2). We think this gene arrangement is a derived morph (see Discussion). Although the RF-Oma1 genes are unique to B. vulgaris, it is possible that other plants have DNA regions with similar gene arrangements to that of the RF-Oma1 region. We selected 22 sugar beet genes linked to the RF-Oma1 genes as queries (from LOC104888042 to LOC104888100 in supplementary table S4, Supplementary Material online and fig. 3), and used them to investigate the S. oleracea genome. In B. vulgaris strain KWS2320, which has only one RF-Oma1 copy, the genes LOC104888049, LOC104888050, RF-Oma1, and LOC104888089 are clustered together. A similar gene arrangement is also seen in an S. oleracea contig, but the RF-Oma1 (or Oma1-like) gene is missing (supplementary table S4, Supplementary Material online and fig. 3). Instead, three other genes (marked by an asterisk in fig. 3) are located between the LOC104888050 and LOC104888089 homologs in S. oleracea, but their gene products are annotated as abscisic stress-ripening protein 3-like, long noncoding (lnc) RNA, and lysine-rich arabinogalactan protein 19-like, respectively. None of these genes are related to Oma1.
. 3.

Microsynteny of chromosomal segments associated with the RF-Oma1 genes. Annotated genes within the segments are shown by horizontal arrows. Genes homologous to the B. vulgaris queries are linked by thin lines. A 20 kbp scale bar is shown at the bottom right. The RF-Oma1 gene is shown as a purple arrow. We refer to this copy as orf20. An asterisk denotes the S. oleracea genes LOC110796815, LOC110796811, and LOC110797175 (from left to right) that are annotated as abscisic stress-ripening protein 3-like, long noncoding (lnc) RNA, and lysine-rich arabinogalactan protein 19-like, respectively. A plus sign denotes C. quinoa LOC110725390, which is annotated as an lncRNA but is not related to S. oleracea LOC110796811. The chromosomal regions with NCBI accession numbers are: Beta, B. vulgaris (NC_025814, from 2300755 to 2586046); Spinacia, S. oleracea (NW_018931398, from 1613247 to 1362360); Chenopodium NW_018742205, C. quinoa (NW_018742205, from 265231 to 482238); and Chenopodium NW_018743021, C. quinoa (NW_018743021, from 3318973 to 3346398).

Microsynteny of chromosomal segments associated with the RF-Oma1 genes. Annotated genes within the segments are shown by horizontal arrows. Genes homologous to the B. vulgaris queries are linked by thin lines. A 20 kbp scale bar is shown at the bottom right. The RF-Oma1 gene is shown as a purple arrow. We refer to this copy as orf20. An asterisk denotes the S. oleracea genes LOC110796815, LOC110796811, and LOC110797175 (from left to right) that are annotated as abscisic stress-ripening protein 3-like, long noncoding (lnc) RNA, and lysine-rich arabinogalactan protein 19-like, respectively. A plus sign denotes C. quinoa LOC110725390, which is annotated as an lncRNA but is not related to S. oleracea LOC110796811. The chromosomal regions with NCBI accession numbers are: Beta, B. vulgaris (NC_025814, from 2300755 to 2586046); Spinacia, S. oleracea (NW_018931398, from 1613247 to 1362360); Chenopodium NW_018742205, C. quinoa (NW_018742205, from 265231 to 482238); and Chenopodium NW_018743021, C. quinoa (NW_018743021, from 3318973 to 3346398). In C. quinoa, the contig NW_018742205 contains a similar gene arrangement to the RF-Oma1 region (supplementary table S4, Supplementary Material online and fig. 3). As in S. oleracea, the genes homologous to B. vulgaris LOC104888049, LOC104888050, and LOC104888089 are clustered together but a different gene is situated in the position corresponding to that of the RF-Oma1 gene (marked by a plus sign in fig. 3). Although it is annotated as an lncRNA, its nucleotide sequence is not closely related to the S. oleracea counterpart in the same position. Another C. quinoa contig, NW_018743021, contains a much shorter syntenic region containing five genes homologous to genes located near RF-Oma1 in B. vulgaris (supplementary table S4, Supplementary Material online and fig. 3). We found no gene arrangement homologous to the B. vulgaris region LOC104888049-LOC104888050-LOC104888089 in Ama. hypochondriacus or Ar. thaliana (supplementary table S5, Supplementary Material online). Both the bvOma1 and RF-Oma1 regions of B. vulgaris contained members of the PPR protein gene family and the R gene family (supplementary tables S3 and S4, Supplementary Material online). However, no other homologous genes were shared.

Differences in the selection patterns between the RF-Oma1 genes and the conserved type of Oma1

The RF-Oma1 genes appear to have evolved faster than the conserved type of Oma1, as indicated by the longer internal branches within the RF-Oma1 clade in both phylogenetic trees (fig. 1). We therefore investigated the selection patterns for the RF-Oma1 genes and the conserved type of Oma1. The ratio of nonsynonymous to synonymous substitutions in codons (dN/dS) was used as an index of selection. We selected orf20 (whose product had the binding activity) and orf20 (whose product had no binding activity) to represent the RF-Oma1 family, and their codons were aligned with those of the 42 conserved Oma1 genes used in the phylogenetic analysis (supplementary data set 3, Supplementary Material online). We compared two evolutionary models: one assumes that dN/dS is constant (ω0) among all the branches in a phylogenetic tree (supplementary data set 3, Supplementary Material online) (one-ratio model), and the other assumes dN/dS is different between the RF-Oma1 lineage (ω) and the conserved type Oma1 lineage (ωcons-) (two-ratio model). As shown in table 2, the values for ω0, ω and ωcons- are 0.24, 1.23 and 0.22, respectively. Our likelihood ratio test indicated that the difference between the models is significant (P = 2.0E−35).
Table 2.

Summary of Estimates under Different Evolutionary Models

Lineage of GeneType of ModeldN/dS (ω)lnL2ΔlnL
Conserved type Oma1 and RF-Oma1Branch (one ratio)
ω 0 = 0.24
−36,059.34
154.28(P = 2.0E−35)
Branch (two ratios) ωRF1 = 1.23ωcons-Oma1 = 0.22−35,982.20
RF-Oma1 M02.45−2,870.16
M1a0.44−2,856.59

27.14 [M0–M1a]

(P =2.6E−33)

M70.40−2,856.82

26.68 [M0–M7]

(P = 2.4E−07)

M2a3.57−2,781.58

150.02 [M1a–M2a]

(P = 2.6E−33)

M83.58−2,781.58

150.48 [M7–M8]

(P = 2.1E−33)

bvOma1 M00.096−2,159.93
M1a0.071−2,153.13

13.6 [M0–M1a]

(P = 0.0002)

M70.10−2,153.50

12.86 [M0–M7]

(P = 0.0003)

M2a0.10−2,151.25

3.76 [M1a–M2a]

(P = 0.153)

M80.10−2,151.30

4.4 [M7–M8]

(P = 0.111)

Summary of Estimates under Different Evolutionary Models 27.14 [M0–M1a] (P =2.6E−33) 26.68 [M0–M7] (P = 2.4E−07) 150.02 [M1a–M2a] (P = 2.6E−33) 150.48 [M7–M8] (P = 2.1E−33) 13.6 [M0–M1a] (P = 0.0002) 12.86 [M0–M7] (P = 0.0003) 3.76 [M1a–M2a] (P = 0.153) 4.4 [M7–M8] (P = 0.111) In table 2, the ω value is larger than the ωcons- value, suggesting that the RF-Oma1 genes contain sites that undergo positive selection. Therefore, we investigated whether such positively selected sites (with dN/dS > 1) are found in the RF-Oma1 genes. Among the 12 RF-Oma1 genes identified in our previous studies, one is an apparent pseudogene (orf20), and it was excluded from our analysis. The remaining 11 genes were subjected to further study. Their codons were aligned according to the alignment of the amino acid sequences, from which a phylogenetic tree was constructed (supplementary data set 4, Supplementary Material online). We tested five evolutionary models: Model M0 assumes constant dN/dS ratios at all sites, and under this model, the average dN/dS is calculated as 2.45. Models M1a and M7 assume variation in the dN/dS ratios among sites but no positive selection. Under these models, the average dN/dS ratios are 0.44 and 0.40, respectively (table 2). Our likelihood ratio test between M0 and M1a rejected M0 at P = 2.6E−33, and the test between M0 and M7 also rejected M0 (P = 2.4E−07). Models M2a and M8 allow positive selection, and the calculated dN/dS ratios are 3.57 and 3.58, respectively. Our likelihood tests comparing M1a with M2a and M7 with M8 resulted in the rejection of both M1a and M7 (P = 2.6E−33 and P = 2.1E−33, respectively). Therefore, we found evidence that the RF-Oma1 genes include some sites that have undergone positive selection. Our Bayes Empirical Bayes analysis (Yang et al. 2005) revealed a total of 37 amino acid residues that are potential positively selected sites with high posterior probability (>0.95) (supplementary fig. S1, Supplementary Material online and fig. 4). Twenty-six of these sites reside in the predicted transmembrane helices (supplementary fig. S1, Supplementary Material online and fig. 4). Note that RF-Oma1 translation products were detected in the mitochondrial membrane fraction (Kitazaki et al. 2015). We compared the transmembrane helices of all the RF-Oma1 gene products. Of the three transmembrane helices, the N-terminal one is less clear in ORF20NK-198-3, ORF20PI 615522, ORF20NK-305-2, and ORF20NK-219-2, but this does not appear to affect the proteins’ ability to alter the higher order structure of preSATP6 (supplementary fig. S2, Supplementary Material online).
. 4.

Positional relationships between positively selected amino acid residues and transmembrane helices in the RF-Oma1 genes. Vertical and horizontal axes show probability and residue number, respectively. (A) Positively selected codons revealed by Bayes Empirical Bayes analysis. Each vertical line indicates the posterior probability of positive selection on each residue, and is orange if the posterior probability is more than 0.95. (B) Plot showing the probability that each amino acid residue sits within a transmembrane helix (red vertical line), inside the membrane (blue graph line) or outside the membrane (purple graph line).

Positional relationships between positively selected amino acid residues and transmembrane helices in the RF-Oma1 genes. Vertical and horizontal axes show probability and residue number, respectively. (A) Positively selected codons revealed by Bayes Empirical Bayes analysis. Each vertical line indicates the posterior probability of positive selection on each residue, and is orange if the posterior probability is more than 0.95. (B) Plot showing the probability that each amino acid residue sits within a transmembrane helix (red vertical line), inside the membrane (blue graph line) or outside the membrane (purple graph line). We also compared the selection pattern of bvOma1 with that of the RF-Oma1 genes. Because our phylogenetic trees suggested bvOma1 evolution is slower than RF-Oma1, we thought identification of its polymorphic sites would require larger sample size. We chose cultivated B. vulgaris species, which are intercrossable (Biancardi et al. 2020). We amplified cDNA copies of bvOma1 from 57 plants, representing 50 accessions of B. vulgaris (supplementary table S2, Supplementary Material online), and subjected them to direct sequencing. Among the 57 sequences, 19 contained one to several heterozygous sites and 38 were entirely homozygous. We found several groups of identical sequences within these sequences so that the total number of different bvOma1 sequences was 11. Unlike the 11 RF-Oma1 genes, the 11 bvOma1 sequences can be aligned with no indels (supplementary data set 5, Supplementary Material online). The codons were aligned to test the evolutionary models (supplementary data set 5, Supplementary Material online), and the statistics are shown in table 2. Our likelihood tests of M0–M1a and M0–M7 rejected M0 (P = 0.0002 and P = 0.0003, respectively). However, in the tests of M1a–M2a and M7–M8, the null hypotheses (i.e., M1a and M7, respectively) were not rejected (P = 0.153 and P = 0.111, respectively). This suggests differences in the numbers of positively selected codons between bvOma1 and the RF-Oma1 genes.

Phylogenetic Network of the RF-Oma1 Gene Copies

As mentioned previously, the number of clustered RF-Oma1 gene copies varies among sugar beet lines (Moritani et al. 2013; Arakawa et al. 2019b). Unequal crossing over was suggested to explain this polymorphism (Arakawa et al. 2019b). In this study, we adopted a phylogenetic network analysis (Huson and Bryant 2006) to analyze the relationships among the 11 functional RF-Oma1 genes. The results showed a reticulate network, in which five RF-Oma1 genes clustered together while the remaining genes were separated from each other (fig. 5). Huson and Bryant (2006) pointed out that recombination can be one of the mechanisms to explain such topology.
. 5.

Phylogenetic network of 11 RF-Oma1 genes in B. vulgaris. Genes capable of altering the higher order structure of the cognate MS-inducing mitochondrial protein are highlighted in green or red. A scale bar indicates evolutionary distances. The origins of the RF-Oma1 genes are as follows: orf20, orf20, orf20, and orf20 are from sugar beet line NK-198; orf20 and orf20 are from sugar beet line NK-305; orf20 and orf20 are from sugar beet line NK-219mm-O; and orf20, orf20, and orf20 are from sugar beet line TK-81mm-O, sugar beet line PI 615522, and leaf beet accession “Fukkoku-ouba,” respectively (Matsuhira et al. 2012; Ohgami et al. 2016; Arakawa et al. 2018, 2019b).

Phylogenetic network of 11 RF-Oma1 genes in B. vulgaris. Genes capable of altering the higher order structure of the cognate MS-inducing mitochondrial protein are highlighted in green or red. A scale bar indicates evolutionary distances. The origins of the RF-Oma1 genes are as follows: orf20, orf20, orf20, and orf20 are from sugar beet line NK-198; orf20 and orf20 are from sugar beet line NK-305; orf20 and orf20 are from sugar beet line NK-219mm-O; and orf20, orf20, and orf20 are from sugar beet line TK-81mm-O, sugar beet line PI 615522, and leaf beet accession “Fukkoku-ouba,” respectively (Matsuhira et al. 2012; Ohgami et al. 2016; Arakawa et al. 2018, 2019b). The 11 RF-Oma1 genes have been classified into two classes based the properties of their translation products: those capable of altering the higher order structure of preSATP6 protein via a molecular chaperone-like activity (six copies), and those not capable (five copies) (Arakawa T, et al., submitted). The former class is separated into two groups that are distantly positioned in the phylogenetic network: one includes orf20, orf20, orf20, orf20, and orf20, and the other includes only orf20 (fig. 5). The amino acid sequence homology between ORF20NK-198-3 and the proteins encoded by former group is 88–89% (Arakawa T, et al., submitted).

Discussion

Unlike in model eukaryotes such as yeast and mouse, the flowering plant Oma1 does not always occur as a single copy gene. We found that about 35% of the plant genomes listed in table 1 contain multiple Oma1 genes. Whole-genome duplication can partly explain these multiple copies, because polyploidization is common in flowering plants (Soltis et al. 2015). For example, several rounds of whole-genome duplication are known to have occurred in the Fabales (Sato et al. 2008; Schmutz et al. 2010; Young et al. 2011), and this seems to be consistent with the results shown in table 1. Whole-genome duplication may also explain the occurrence of two Oma1 copies in C. quinoa, an allotetraploid species (Jarvis et al. 2017), because two chromosomal segments containing conserved type Oma1 homologs have very similar gene contents and arrangements (fig. 2). Amaranthus hypochondriacus is a paleoallotetraploid species but has only one Oma1 homolog. Although Scaffold_7 and the Scaffold_13 in Ama. hypochondriacus (depicted in fig. 2) are derived from homologous chromosomes, it is possible that a redundant Oma1 gene was lost from the Scaffold_7. However, we speculate that whole-genome duplication does not explain the evolution of the RF-Oma1 genes, because no recent whole-genome duplication event is evident in B. vulgaris (Dohm et al. 2014), but our phylogenetic data suggest that the RF-Oma1 genes evolved relatively recently. Interestingly, bvOma1, the S. oleracea Oma1, and atOma1 are located on chromosomal segments that are syntenic with one another (fig. 2). Therefore, this gene arrangement seems to represent the ancestral organization, and bvOma1 is the likely ortholog of atOma1. In contrast, the gene arrangement in the neighborhood of the RF-Oma1 genes is dissimilar to that of bvOma1. Although both regions contain members of the PPR protein and R gene families, these families are known to be very large, with hundreds of copies per plant genome (Fujii et al. 2011). Therefore, the occurrence of such genes does not necessarily indicate a syntenic relationship. We favor the notion that the first RF-Oma1 gene was generated via a segmental duplication of a small chromosomal region. Considering that all the RF-Oma1 gene copies identified to date are interrupted by two introns whose number and position are similar to those of bvOma1, retrotransposition (Hurles 2004) as the mechanism of duplication is unlikely. Our phylogenetic analysis suggests that the RF-Oma1 lineage diverged after the establishment of the Caryophyllales. Although we have no phylogenetic data to further delimit when the chromosomal segment was duplicated, it may be possible to gain further information from the gene arrangement surrounding the RF-Oma1 genes. We found that the gene arrangement near the locus is conserved in S. oleracea and C. quinoa but not in Ama. hypochondriacus. A phylogenetic study of the Amaranthaceae suggests that Beta, Spinacia, and Chenopodium are closely related to one another and more distantly related to Amaranthus (Kadereit et al. 2006; Fuentes-Bazan et al. 2012). Given this, it seems unlikely that the duplication occurred in the common ancestor of the four species, because this scenario requires a series of events including multiple losses of the duplicated Oma1. The most parsimonious explanation may be that the framework of the gene arrangement surrounding the RF-Oma1 genes had been gradually established before the duplication occurred. However, this is not conclusive because the resolution of our phylogenetic analysis is relatively low due to the lack of genome information for other Caryophyllalean plants, and further investigations are needed. In either of the scenarios, the original RF-Oma1 gene appears to be a lineage-specific paralog. The evolutionary patterns of bvOma1 and the RF-Oma1 family are very different: whereas bvOma1 is a conserved, single copy gene (Arakawa et al. 2019a; this study), the RF-Oma1 genes have diverged in terms of copy number and nucleotide sequence (Arakawa et al. 2019b). Our network analysis provides additional data suggesting the involvement of recombination in the molecular diversity of the RF-Oma1 family. This implies that unequal crossing over between alleles has produced various RF-Oma1 genes. For an RF-Oma1 gene to suppress Owen-type mitochondria, its translation product should bind to the preSATP6 protein (Kitazaki et al. 2015). Our previous study revealed that this activity is an acquired trait of some RF-Oma1 copies because neither bvOma1 nor atOma1 has such activity (Arakawa et al. 2019a). Therefore, the role of recombination in this acquisition is interesting. Our network analysis suggests that the RF-Oma1 genes whose translation products lack the binding activity are more various (fig. 5), raising the possibility that the majority of recombinants are nonsuppressive. This may suggest that the number of nonsuppressive RF-Oma1 genes would increase with each recombination event. As a result, nonsuppressive RF-Oma1 genes might be generated and stacked within alleles, leading to many novel nonrestoring alleles (i.e., recessive rf1 alleles). However, molecular variation among recessive rf1 alleles in sugar beet is very small (Ohgami et al. 2016). In addition, previous genetic studies have shown that the restoring genotype (presumably containing dominant Rf1) is prevalent in the B. vulgaris genetic resources (Touzet 2012). This might indicate selection to maintain suppressive RF-Oma1 copies in the allele, but the details of this have not been investigated. Furthermore, we found two independent groups of suppressive RF-Oma1 genes (fig. 5), and it is unknown whether other types of suppressive RF-Oma1 genes exist in the B. vulgaris genetic resources. Further studies are necessary to clarify the entire molecular diversity of the RF-Oma1 gene family, and to understand what selection pressures have shaped the molecular variation within the family. The molecular evolution of the Rf gene families has not been studied in detail except for the RFL genes, which form a subset of the PPR gene family (Fujii et al. 2011). Therefore, it is valuable to compare the evolutionary patterns of the RFL and RF-Oma1 families. Although their gene products and functions are completely different, they share the features of gene clustering and copy number variation (Kubo et al. 2020). Our study adds another shared feature: the ratios of nonsynonymous to synonymous substitutions are high in both families. As is the case for RFL genes, the RF-Oma1 genes include codons that showed high probabilities of positive selection. In the RFL genes, such codons correspond to amino acid residues that are directly associated with substrate recognition: they are involved in recognizing specific RNA sequences (Fujii et al. 2011). Many of the positively selected codons in the RF-Oma1 alleles are confined to the transmembrane helices. Yeast YTA12 is an m-AAA protease that is also involved in the quality control of mitochondria, and one of its transmembrane helices plays an important role in substrate recognition for dislocating the substrate from the membrane (Lee et al. 2017). Similarly, it may be that the positively selected codons in the RF-Oma1 genes are associated with recognition of the preSATP6 protein to exert the molecular chaperone-like activity. The alteration of amino acid residues in the N-terminal transmembrane helix may change the topology of the protein, but this alteration appears not to affect the suppression function (supplementary fig. S2, Supplementary Material online). Hence, the functional significance of the amino acid substitutions is unclear. Further studies are needed to analyze the relationship between the positively selected codons and substrate specificity, and this will contribute to our understanding of how the dominant Rf1 allele emerged. Knowledge of the OMA1 protein structure would benefit such studies. Positive selection in the RFL family is considered to be adaptive, because variations in substrate specificity resulting from changes in the recognition motifs would be beneficial for coping with novel MS-inducing mitochondria that might emerge over time (Gaborieau et al. 2016). This notion predicts a PPR-type Rf locus with multiple alleles, each counteracting a different MS-inducing mitochondrion. The role of positive selection in the case of the RF-Oma1 family is currently obscure because little is known about how substrate specificity is determined (see above). It is possible that various RF-Oma1 copies could be beneficial to cope with various types of MS-inducing mitochondrial proteins, but to date, Rf1 alleles for different MS-inducing mitochondria have not been found in B. vulgaris. Other Rfs that counteract different kinds of MS-inducing mitochondria have been located on different chromosomes (Laporte et al. 1998; Touzet et al. 2004). The RF-Oma1 and RFL families are sources of Rf genes. Their evolutionary patterns are very similar, although the former appears to have evolved more recently than the latter. It is difficult to infer how the RFL genes originated, and there are questions about what type of duplication was involved, which PPR gene evolved into the first RFL gene (the PPR genes constitute a large family), and what expression pattern was exhibited by the original RFL gene. These questions are related to the initial stages of Rf evolution (i.e., how Rf emerged), and studies of evolutionarily young Rfs may be valuable because remnants of their initial evolutionary processes may be preserved in the genome. We think the RF-Oma1 family represents an early stage of Rf evolution, and the evolutionary pattern of this family could provide some answers to these questions. Considering that RF-Oma1 expression in anthers is more than 480 times higher than in vegetative organs (Arakawa et al. 2019a), the evolution of the RF-Oma1 family recalls that of certain Drosophila genes: paralogs that are expressed specifically in the testes, that undergo positive selection, and encode mitochondrial proteins that possibly counteract male-harming mitochondrial variants (reviewed in Havird et al. 2019). We find it very interesting that paralogs specifically expressed in male-reproductive organs are the source of Rf, and play key roles in the mitonuclear conflict in flowering plants.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online. Click here for additional data file.
  48 in total

Review 1.  Unveiling the molecular arms race between two conflicting genomes in cytoplasmic male sterility?

Authors:  Pascal Touzet; Françoise Budar
Journal:  Trends Plant Sci       Date:  2004-12       Impact factor: 18.313

2.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

Review 3.  The Rf and Rf-like PPR in higher plants, a fast-evolving subclass of PPR genes.

Authors:  Jennifer Dahan; Hakim Mireau
Journal:  RNA Biol       Date:  2013-07-09       Impact factor: 4.652

4.  Selection patterns on restorer-like genes reveal a conflict between nuclear and mitochondrial genomes throughout angiosperm evolution.

Authors:  Sota Fujii; Charles S Bond; Ian D Small
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-10       Impact factor: 11.205

5.  Curing cytoplasmic male sterility via TALEN-mediated mitochondrial genome editing.

Authors:  Tomohiko Kazama; Miki Okuno; Yuta Watari; Shungo Yanase; Chie Koizuka; Yu Tsuruta; Hajime Sugaya; Atsushi Toyoda; Takehiko Itoh; Nobuhiro Tsutsumi; Kinya Toriyama; Nobuya Koizuka; Shin-Ichi Arimura
Journal:  Nat Plants       Date:  2019-07-08       Impact factor: 15.793

Review 6.  Polyploidy and genome evolution in plants.

Authors:  Pamela S Soltis; D Blaine Marchant; Yves Van de Peer; Douglas E Soltis
Journal:  Curr Opin Genet Dev       Date:  2015-12-02       Impact factor: 5.578

7.  A male sterility-associated mitochondrial protein in wild beets causes pollen disruption in transgenic plants.

Authors:  Masayuki P Yamamoto; Hiroshi Shinada; Yasuyuki Onodera; Chihiro Komaki; Tetsuo Mikami; Tomohiko Kubo
Journal:  Plant J       Date:  2008-03-01       Impact factor: 6.417

8.  PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.

Authors:  Mikita Suyama; David Torrents; Peer Bork
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  A fertility-restoring genotype of beet (Beta vulgaris L.) is composed of a weak restorer-of-fertility gene and a modifier gene tightly linked to the Rf1 locus.

Authors:  Takumi Arakawa; Daisuke Uchiyama; Takashi Ohgami; Ryo Ohgami; Tomoki Murata; Yujiro Honma; Hiroyuki Hamada; Yosuke Kuroda; Kazunori Taguchi; Kazuyoshi Kitazaki; Tomohiko Kubo
Journal:  PLoS One       Date:  2018-06-01       Impact factor: 3.240

10.  Mitochondrial stress is relayed to the cytosol by an OMA1-DELE1-HRI pathway.

Authors:  Xiaoyan Guo; Giovanni Aviles; Yi Liu; Ruilin Tian; Bret A Unger; Yu-Hsiu T Lin; Arun P Wiita; Ke Xu; M Almira Correia; Martin Kampmann
Journal:  Nature       Date:  2020-03-04       Impact factor: 49.962

View more
  1 in total

1.  Migration restores hybrid incompatibility driven by mitochondrial-nuclear sexual conflict.

Authors:  Manisha Munasinghe; Benjamin C Haller; Andrew G Clark
Journal:  Proc Biol Sci       Date:  2022-01-26       Impact factor: 5.349

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.