| Literature DB >> 35820796 |
Zhenfei Yan1,2, Lijun Sang1,2, Yue Ma1,2, Yong He1,2, Juan Sun1,2, Lichao Ma1,2, Shuo Li1,2, Fuhong Miao1,2, Zixin Zhang1, Jianwei Huang3, Zengyu Wang4,5, Guofeng Yang6,7.
Abstract
BACKGROUND: Red clover (Trifolium pratense L.) is a diploid perennial temperate legume with 14 chromosomes (2n = 14) native to Europe and West Asia, with high nutritional and economic value. It is a very important forage grass and is widely grown in marine climates, such as the United States and Sweden. Genetic research and molecular breeding are limited by the lack of high-quality reference genomes. In this study, we used Illumina, PacBio HiFi, and Hi-C to obtain a high-quality chromosome-scale red clover genome and used genome annotation results to analyze evolutionary relationships among related species.Entities:
Keywords: De novo assembly; Genome; Genome annotation; PacBio HiFi; Trifolium pretense
Mesh:
Year: 2022 PMID: 35820796 PMCID: PMC9277957 DOI: 10.1186/s12870-022-03707-5
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 5.260
Fig. 1Plant morphology and Hi-C-assisted genome assembly of red clover. a Phenotype of the sequenced red clover plant. b Hi-C interaction heatmap showing 100-kb resolution super scaffolds
Summary statistic for the Trifolium pratense genome
| Assembly | ||
|---|---|---|
| Genome assembly | Estimated genome size | 456Mbp |
| Total length of assembly | 423Mbp | |
| Number of contigs | 194 | |
| Contig N50 | 13Mbp | |
| Largest contig | 34Mbp | |
| Number of scaffolds | 96 | |
| Scaffold N50 | 55Mbp | |
| Chromosome coverage (%) | 97.32% | |
| GC content of genome | 33.6% | |
| Annotation | ||
| Total length | ||
| Transposable elements | Total | 224Mbp (52.81%) |
| Retrotransposon | 140Mbp (33.04%) | |
| DNA Transposon | 37Mbp (8.69%) | |
| Copies | ||
| Noncoding RNAs | rRNAs | 13,053 |
| tRNAs | 1281 | |
| miRNAs | 477 | |
| snRNAs | 990 | |
| Gene models | Number of genes | 44,588 |
| Mean gene length | 3620 bp | |
| Mean coding sequence length | 1585 bp |
The information of annotated gene models per species for all the species
| Organism | Number | Mean CDS | Exons per | Mean exon | Mean intro |
|---|---|---|---|---|---|
| 29,006 | 1430 | 7.6 | 293 | 449 | |
| 54,881 | 1391 | 8 | 295 | 413 | |
| 119,102 | 306 | 1.4 | 219 | 172 | |
| 28,772 | 1393 | 7.7 | 291 | 418 | |
| 36,079 | 1428 | 6.9 | 324 | 393 | |
| 44,588 | 1585 | 5 | 348 | 440 |
Fig. 2Venn analysis of five major databases (NR, Swiss-Prot, eggNOG, GO, KEGG) containing gene function annotation information
Fig. 3Gene family and phylogenetic tree analyses of red clover and other representative plant genomes. a Venn diagram of the number of shared gene families. b A phylogenetic tree based on shared single-copy gene families (left), gene family expansions and contractions among red clover and seven other species (middle), and Gene family clustering in red clover and seven other plant genomes (right). c Genome-wide replication Ks distribution map of red clover and its related species. d Genome-wide replication Ks analysis of red clover
Fig. 4Features of the red clover and M.truncatula genome. a Length of each pseudochromosome (Mb). b Distribution of repetitive sequence. c Distribution of gene density. d Distribution of the GC content (e) Red clover and M.truncatula synteny analysis; the beginning of NC represents the chromosome of M.truncatula, while the beginning of CHR represents the chromosome of red clover