| Literature DB >> 35764640 |
José Cerca1, Bent Petersen2,3, José Miguel Lazaro-Guevara4, Angel Rivera-Colón5, Siri Birkeland6,7, Joel Vizueta8, Siyu Li9, Qionghou Li4, João Loureiro10, Chatchai Kosawang11, Patricia Jaramillo Díaz12,13, Gonzalo Rivas-Torres14,15,16,17, Mario Fernández-Mazuecos18, Pablo Vargas19, Ross A McCauley20, Gitte Petersen21, Luisa Santos-Bay2, Nathan Wales22, Julian M Catchen5, Daniel Machado23, Michael D Nowak7, Alexander Suh24,25, Neelima R Sinha9, Lene R Nielsen11, Ole Seberg26, M Thomas P Gilbert27,2, James H Leebens-Mack28, Loren H Rieseberg4, Michael D Martin29.
Abstract
The repeated, rapid and often pronounced patterns of evolutionary divergence observed in insular plants, or the 'plant island syndrome', include changes in leaf phenotypes, growth, as well as the acquisition of a perennial lifestyle. Here, we sequence and describe the genome of the critically endangered, Galápagos-endemic species Scalesia atractyloides Arnot., obtaining a chromosome-resolved, 3.2-Gbp assembly containing 43,093 candidate gene models. Using a combination of fossil transposable elements, k-mer spectra analyses and orthologue assignment, we identify the two ancestral genomes, and date their divergence and the polyploidization event, concluding that the ancestor of all extant Scalesia species was an allotetraploid. There are a comparable number of genes and transposable elements across the two subgenomes, and while their synteny has been mostly conserved, we find multiple inversions that may have facilitated adaptation. We identify clear signatures of selection across genes associated with vascular development, growth, adaptation to salinity and flowering time, thus finding compelling evidence for a genomic basis of the island syndrome in one of Darwin's giant daisies.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35764640 PMCID: PMC9240058 DOI: 10.1038/s41467-022-31280-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Chromosome-resolved assembly of the Scalesia atractyloides nuclear genome.
A Link density histogram, with 34 linkage groups (chromosome models) identified by contiguity ligation sequencing. The x and y axes show mapping positions of the first and second read in read pairs. B Viridiplantae BUSCO set, which offers a characterisation of conserved orthologue genes.
Fig. 2Subgenomes and evolutionary history of Scalesia.
A Circos plots displaying the 34 chromosome models in the assembly. Pairs are organised to the left and right from the top, and have the same colour coding. B Selected families of transposable elements (TEs) that are differently represented on each subgenome (17 pairwise comparisons; each TE is labelled after the assembly (scaAtr), round and family number (2–95) after RepeatMasker). These TE families were likely active while the two subgenomes were separated, being thus unevenly represented. These highlight subgenome identification. Each data point corresponds to a chromosome in a subgenome (subgenome A in blue and B in orange). Chromosome pairs are linked by grey lines. C Single-copy ortholog phylogeny of the studied Asteraceae genome assemblies. Node ages are provided to the right of each node, as well as the predicted time for the polyploidization event. D Pairwise sequentially Markovian coalescent (PSMC) estimation of the demographic history of Scalesia atractyloides, two other Scalesia species, and two members of the Pappobolus genus, which is the sister taxon to Scalesia using bootstrapping (100 replicates). PSMC runs for the whole genome and subgenomes yielded similar results (Supplementary Fig. 03).
Fig. 3Subgenome evolution and characterisation.
A Ideogram with gene and transposable element distribution in 25-kbp bins. Gene density is plotted in chromosome representations and transposable element distribution is plotted to the side of each chromosome in black. Chromosomes are arranged in homoelogous pairs. B The number of isoforms detected for each subgenome. Each data point corresponds to a chromosome in a subgenome (subgenome A in blue and B in orange; 17 pairwise comparisons; subgenome A: min = 5176; max = 8817, avg = 7175; Subgenome B: min = 4277; max = 7982, avg = 6103). Chromosome pairs are linked by grey lines. C Number of genes detected for each subgenome (17 pairwise comparisons; subgenome A: min = 1571; max = 1571, avg = 1327; Subgenome B: min = 944; max = 1474, avg = 1181). D The number of pseudogenes detected for each subgenome (17 pairwise comparisons; Subgenome A: min = 196; max = 315, avg = 270; subgenome B: min = 186; max = 312, avg = 230). E Length of transposable elements detected for each subgenome (17 pairwise comparisons; subgenome A: min = 59,440,194; max = 97,142,619, avg = 78,545,717; Subgenome B: min = 57,631,669; max = 91,646,014, avg = 76,870,839).
Fig. 4Chromosome stability plots.
These reveal the role of inversions and translocations in the differential in each subgenome. A Chromosome stability plot between the two Scalesia subgenomes and the Helianthus annuus genome. Each line connects a pair of orthologous genes, colour-coded by chromosome pair. B Chromosome stability between the two Scalesia subgenomes. Each line connects orthologous genes in subgenomes A and B, colour-coded by chromosome pair.
Fig. 5Positive selection and gene-family expansion across the Scalesia atractyloides genome.
A GO term enrichment of the genes under selection across the genome. GO terms assigned to at least four genes are labelled. Size refers to the number of genes associated with a particular GO term. B GO term enrichment of the genes belonging to expanded gene families across the genome according to a CAFE analysis. Only GOs within a group of three or more overlapping circles are included. Uniqueness measures the degree to which a particular GO term is distinct relative to the whole list.