| Literature DB >> 30692677 |
Miaomiao Li1,2, Dongfen Zhang1, Qiang Gao3, Yingfeng Luo4, Hui Zhang1,2, Bin Ma3, Chunhai Chen5, Annabel Whibley6, Yu'e Zhang1, Yinghao Cao3, Qun Li1, Han Guo1,2, Junhui Li1,2, Yanzhai Song1,2, Yue Zhang1,2, Lucy Copsey6, Yan Li3, Xiuxiu Li2,3, Ming Qi3, Jiawei Wang7, Yan Chen5, Dan Wang5, Jinyang Zhao5, Guocheng Liu5, Bin Wu5, Lili Yu5, Chunyan Xu5, Jiang Li5, Shancen Zhao5, Yijing Zhang7, Songnian Hu2,4, Chengzhi Liang8,9, Ye Yin10, Enrico Coen11, Yongbiao Xue12,13,14.
Abstract
Snapdragon (Antirrhinum majus L.), a member of the Plantaginaceae family, is an important model for plant genetics and molecular studies on plant growth and development, transposon biology and self-incompatibility. Here we report a near-complete genome assembly of A. majus cultivar JI7 (A. majus cv.JI7) comprising 510 Megabases (Mb) of genomic sequence and containing 37,714 annotated protein-coding genes. Scaffolds covering 97.12% of the assembled genome were anchored on eight chromosomes. Comparative and evolutionary analyses revealed that a whole-genome duplication event occurred in the Plantaginaceae around 46-49 million years ago (Ma). We also uncovered the genetic architectures associated with complex traits such as flower asymmetry and self-incompatibility, identifying a unique duplication of TCP family genes dated to around 46-49 Ma and reconstructing a near-complete ψS-locus of roughly 2 Mb. The genome sequence obtained in this study not only provides a representative genome sequenced from the Plantaginaceae but also brings the popular plant model system of Antirrhinum into the genomic age.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30692677 PMCID: PMC6784882 DOI: 10.1038/s41477-018-0349-9
Source DB: PubMed Journal: Nat Plants ISSN: 2055-0278 Impact factor: 15.793
Fig. 1An overview of the genomic features of A. majus JI7.
Roman numerals refer to: I, duplications of genomic paralogous sequences; II, guanine-cytosine (GC) content; III, simple sequence repeats; IV, gene density; V, retroelement density; VI, recombination rate (deep colour shows high recombination rates) and VII, eight Antirrhinum chromosomes with physical distances including low copy number repetitive elements: telomere repeat TTTAGGG (green), 5S recombinant DNA (orange) and pericentromeric repeats CentA1 and CentA2 (purple). A ruler with marks every 1 Mb is drawn on each chromosome.
Statistics for the Antirrhinum genome and gene annotation
| Estimate of genome size | 520 Mb |
| GC content | 35.50% |
| N50 length (contig) | 0.73 Mb |
| Longest contig | 3.74 Mb |
| Total size of assembled contigs | 510.20 Mb |
| N50 length (scaffold) | 2.62 Mb |
| Longest scaffold | 9.90 Mb |
| Total size of assembled scaffolds | 511.70 Mb |
| Number of genes | 37,714 |
| Average gene length | 3,166 bp |
| Gene density | 73.95 Mb−1 |
| Transcripts number | 52,780 |
| Average coding sequence length | 1,036 bp |
| Average protein length | 344 amino acids |
| Average exon length | 245 bp |
| Average intron length | 314 bp |
| Tandem repeat | 13.03 Mb |
Fig. 2Genome evolution of A. majus.
a, Phylogenetic tree of angiosperm species including their divergence time on the basis of orthologues of single-gene families. The red star highlights the genome duplication in the A. majus lineage. The number in each node indicates Ma between two divergent branches. Green/red numbers indicate expansion and contraction gene families. A. trichopoda was used as an outgroup. Bootstrap values for each node are above 100%. b, Synteny blocks among chromosomes of A. majus, V. vinifera and S. lycopersicum. The numbers represent individual chromosomes. The selected syntenic gene numbers are more than 50 in each block. c, Density distributions of Ks for paralogous genes among A. majus, V. vinifera and S. lycopersicum. d, A Venn diagram of shared orthologues among four species of A. majus, S. lycopersicum, A. thaliana and O. sativa. Each number represents a gene family number shared among the genomes.
Fig. 3Evolution of flower symmetry and TCP gene family.
Left, a phylogenetic tree of the flowering plants derived from their divergence time based on orthologues of single-gene families. Three red stars show the whole-genome triplication and two green stars the duplication events (http://chibba.agtec.uga.edu/duplication/). Right, B represents bilateral flower symmetry and R radial flower symmetry; Asterid, Rosid, Basal rosid, Basal eudicot, Monocot and Basal angiosperm represent the clade names, respectively. Total numbers of TCP family genes, Class I PCF, Class II CIN and Class II CYC/TB1 are shown from left to right. The asterisk indicates the sequenced genome of species of Medicago truncatula with flower radial symmetry, but flowers of most Medicago species have bilateral symmetry. The double asterisk indicates Lotus corniculatus in which three TCP genes were identified but a functional TCP gene was not detected in its genome. Four-pointed stars denote flowers with bilateral symmetry with their photos from PPBC (http://www.plantphoto.cn).
Fig. 4Genomic features of the ψS-locus of A. majus and its synteny with the S-locus regions of A. hispanicum.
a, Chromosomal locations of three scaffolds covering the ψS-locus region of A. majus. A genetic recombination map of chromosome 8 is shown on the top panel. The x axis shows its physical distance (Mb) and the y axis its genetic distance (cM). A schematic representation of chromosome 8 is shown in the middle panel with a red dot indicating its centromere. The ψS-locus is depicted as a blue box on its short arm. A vertical red line in the chromosome indicates the RAD gene. The lower panel shows three scaffolds of Sc29, Sc276 and Sc184 covering the ψS-locus region. b, Transcriptional profiles of the ψS-locus and its flanking regions of A. majus. The light orange shadow denotes the predicted ψS-locus region (SLF1–SLF37). This region between RAD and SLF37 contains a total number of 102 annotated genes. The bottom panel is a schematic representation of the SLF genes. Orange squares indicate the ψSLF genes and green arrows the other annotated genes (I: a putative MYB family transcription factor; II and III, putative RNA-binding proteins and IV, a putative phosphate-dependent transferase). c, The synteny of the S-locus regions between A. majus and S, S, S and S haplotypes of A. hispanicum. Different colours indicate syntenic and inversion regions between the ψS-locus and S (S, S and S), S, S (S, S and S) or S (S and S) haplotypes of A. hispanicum.