| Literature DB >> 36196394 |
Yasin Kaya1, Zübeyde Uğurlu Aydın1, Xu Cai2, Xiaowu Wang2, Ali A Dönmez1.
Abstract
Aubrieta canescens complex is divided into two subspecies, Au. canescens subsp. canescens, Au. canescens subsp. cilicica and a distinct species, Au. macrostyla, based on molecular phylogeny. We generated a draft assembly of Au. canescens subsp. canescens and Au. macrostyla using paired-end shotgun sequencing. This is the first attempt at genome characterization for the genus. In the presented study, ~165 and ~157 Mbp of the genomes of Au. canescens subsp. canescens and Au. macrostyla were assembled, respectively, and a total of 32 425 and 31 372 gene models were predicted in the genomes of the target taxa, respectively. We corroborated the phylogenomic affinity of taxa with some core Brassicaceae species (Clades A and B) including Arabis alpina. The orthology-based tree suggested that Aubrieta species differentiated from A. alpina 1.3-2.0 mya (million years ago). The genome-wide syntenic comparison of two Aubrieta taxa revealed that Au. canescens subsp. canescens (46 %) and Au. macrostyla (45 %) have an almost identical syntenic gene pair ratio. These novel genome assemblies are the first steps towards the chromosome-level assembly of Au. canescens and understanding the genome diversity within the genus.Entities:
Keywords: Arabideae; Arabis; Aubrieta; Brassicaceae; genome evolution; whole-genome sequencing
Year: 2022 PMID: 36196394 PMCID: PMC9521481 DOI: 10.1093/aobpla/plac035
Source DB: PubMed Journal: AoB Plants Impact factor: 3.138
Figure 1.General habit and comparative gene density, repeat composition and polymorphism diversity across the genome of Aubrieta canescens subsp. canescens, and Au. macrostyla, respectively. (A) Flower and fruit morphology of the Au. canescens complex. (B) Gene density across the chromosomes. (C) Repeat composition. (D) Single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertion, deletion and indel mutations across the genome.
Assembly statistics of Aubrieta canescens subsp. canescens and Au. macrostyla.
| Assembly statistics |
|
|
|---|---|---|
| Assembly strategies | Iterative and | Iterative and |
| Number of scaffolds | 98 | 105 |
| Longest scaffold (kb) | 29 | 28 |
| Guanine-cytosine content (%) | 34.63 | 34.85 |
| Mapping accuracy (%) | 98.4 | 97.7 |
| Complete BUSCOs percentage (%) | 90 | 88 |
| N50 (kb) | 19.7 | 18.9 |
| N75 (kb) | 17.8 | 16.6 |
| Assembled genome size (Mb) | 165.0 | 156.9 |
| Assembled contig numbers | 818.200 | 647.738 |
| Number of nucleotides per 100 kb | 1862.51 | 1821.26 |
Chromosomal organization of gene models in Aubrieta canescens subsp. canescens and Au. macrostyla.
| Chromosome | Gene model numbers of | Gene model numbers of |
|---|---|---|
| Chr 1 | 3.989 | 3.875 |
| Chr 2 | 2.892 | 2.698 |
| Chr 3 | 3.960 | 4.033 |
| Chr 4 | 4.503 | 4.295 |
| Chr 5 | 3.441 | 3.306 |
| Chr 6 | 3.400 | 3.242 |
| Chr 7 | 3.971 | 3.686 |
| Chr 8 | 5.852 | 5.844 |
Repetitive sequences in Aubrieta canescens subsp. canescens, and Au. macrostyla genome assembly.
| Repeat class | Repeat subclass | Repeat size of | Repeat size of | |
|---|---|---|---|---|
| Retrotransposons | 11 644 448 | 9 586 050 | ||
| SINEs | 446 655 | 440 701 | ||
| LINEs | 2 915 226 | 2 741 343 | ||
| L1/CIN4 | 2 910 180 | 2 735 455 | ||
| Long terminal repeat elements | 8 282 567 | 6 404 006 | ||
| Ty1/Copia | 3 096 604 | 2 691 881 | ||
| Gypsy/DIRS1 | 4 957 195 | 3 510 223 | ||
| DNA transposons | 5 110 621 | 4 343 654 | ||
| hobo-Activator | 1 240 611 | 1 130 175 | ||
| Tc1-IS630-Pogo | 1 229 977 | 964 432 | ||
| Tourist/Harbinger | 526 370 | 469 081 | ||
| Small RNA | 485 668 | 481 167 | ||
| Satellite DNA | 10 493 | 10 493 | ||
| Simple sequence repeat (microsatellite) | 2 825 779 | 2 602 091 | ||
| Low-complexity DNA | 965 076 | 894 604 | ||
| Unclassified | 529 526 | 428 768 | ||
| Interspersed repeats | 17 284 595 | 14 358 472 | ||
| Total masked TE | 21 112 636 bp (12.79 %) | 17 896 883 bp (11.41 %) |
Figure 2.Genome evolution and comparative genomic analyses. (A) Homology-based phylogenetic tree of the Au. canescens complex and other Brassicaceae taxa. Gene numbers in nodes represent gene duplication events, and decimals in blue indicate node age. (B) Venn diagram showing the number of orthologous genes in Au. canescens subsp. canescens along with Arabis alpina, Brassica rapa, Camelina sativa, Arabidopsis thaliana, Arabidopsis lyrata and Au. macrostyla. (C) Relationships of syntenic genes between A. alpina and Au. canescens subsp. canescens, and Au. macrostyla. According to the identity scale, highly matched DNA sequences are indicated by dark green to light green dots (50–100 %), moderately matched sequences by orange dots (25–50 %) and poorly matched sequences by yellow dots (0–25 %). Sequences that did not match are shown in white.
Syntenic orthologous genes between Arabis alpina to Aubrieta canescens and Au. macrostyla. *Number of overlapped sequences.
| Syntenic orthologs to |
|
|
|---|---|---|
| Chr1 (arrays* | bp | genes) | 377.339 | 7.194.247 | 1.926 | 424.768 | 7.540.313 | 1.788 |
| Chr2 (arrays* | bp | genes) | 485.449 | 5.466.368 | 1.414 | 423.693 | 4.799.368 | 1.230 |
| Chr3 (arrays* | bp | genes) | 296.393 | 7.323.310 | 1.853 | 274.384 | 6.922.344 | 1.797 |
| Chr4 (arrays* | bp | genes) | 628.093 | 7.719.562 | 1.544 | 610.820 | 7.587.290 | 1.563 |
| Chr5 (arrays* | bp | genes) | 416.086 | 5.828.293 | 1.470 | 390.924 | 5.661.399 | 1.599 |
| Chr6 (arrays* | bp | genes) | 303.688 | 5.863.068 | 1.707 | 275.829 | 5.454.359 | 1.597 |
| Chr7 (arrays* | bp | genes) | 459.382 | 6.696.666 | 1.524 | 443.276 | 6.433.374 | 1.436 |
| Chr8 (arrays* | bp | genes) | 676.307 | 10.542.223 | 1.717 | 647.019 | 10.509.024 | 1.815 |
Comparative homology of Arabis alpina and studied taxa Aubrieta canescens subsp. canescens, and Au. macrostyla.
| Taxa | Number of genes | Number of removed tandem genes | Orthologous genes to | Non-orthologs to | Orthologous-based syntenic genes to |
|---|---|---|---|---|---|
|
| 32.425 | 216 | 19.373 | 13.052 | 13.155 |
|
| 31.372 | 2.390 | 20.177 | 11.195 | 12.825 |