| Literature DB >> 30683940 |
Junpeng Shi1, Xuxu Ma1, Jihong Zhang1, Yingsi Zhou1, Minxuan Liu2, Liangliang Huang1, Silong Sun1, Xiangbo Zhang1, Xiang Gao1, Wei Zhan3, Pinghua Li4, Lun Wang5, Ping Lu2, Haiming Zhao1, Weibin Song6, Jinsheng Lai7,8.
Abstract
Broomcorn millet (Panicum miliaceum L.) has strong tolerance to abiotic stresses, and is probably one of the oldest crops, with its earliest cultivation that dated back to ca. ~10,000 years. We report here its genome assembly through a combination of PacBio sequencing, BioNano, and Hi-C (in vivo) mapping. The 18 super scaffolds cover ~95.6% of the estimated genome (~887.8 Mb). There are 63,671 protein-coding genes annotated in this tetraploid genome. About ~86.2% of the syntenic genes in foxtail millet have two homologous copies in broomcorn millet, indicating rare gene loss after tetraploidization in broomcorn millet. Phylogenetic analysis reveals that broomcorn millet and foxtail millet diverged around ~13.1 Million years ago (Mya), while the lineage specific tetraploidization of broomcorn millet may be happened within ~5.91 million years. The genome is not only beneficial for the genome assisted breeding of broomcorn millet, but also an important resource for other Panicum species.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30683940 PMCID: PMC6347627 DOI: 10.1038/s41467-018-07876-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
The statistics of genome assembly and protein-coding genes in broomcorn millet
| Contigs | Scaffolds | Super scaffolds | |
|---|---|---|---|
|
| |||
| Numbers | 1308 | 905 | 475 |
| Total length | ~838.9 Mb | ~848.4 Mb | ~848.4 Mb |
| N50 | ~2.55 Mb | ~8.24 Mb | ~48.26 Mb |
| Longest | 19,200,716 bp | 22,633,379 bp | 69,183,459 bp |
| Coverage | ~94.5% | ~95.6% | ~95.6% |
| GC content | ~46.80% | ||
|
| |||
| Numbers of genes | 63,671 | ||
| Gene space (coverage) | 214,072,228 bp (~24.1%) | ||
| Number of transcripts | 86,387 | ||
| Mean transcript length | ~2883 bp | ||
| Mean CDS length | ~1023 bp | ||
| Mean intron length | ~1270 bp |
Fig. 1The Hi-C assisted assembly of Longmi4 pseudomolecules. a Heatmap showing Hi-C interactions under the resolution of 200 kb, and the anti-diagonal pattern for the intrachromosomal interactions may reflect the Rabl configuration of chromatins. b Genome comparison between broomcorn millet (pm1–pm18) and foxtail millet (si1–si9). Each dot represented a homologous sequence reported by Mummer, and the two orange circles referred to two intrachromosomal inversions. The green circles represented the quartet of interchromosomal exchanges that may be happened before tetraploidization. Source Data of Fig. 1b are provided as a Source Data file
Fig. 2Gene loss and retentions in broomcorn millet. a sliding window approach with window size of 100 syntenic genes and step size of 10 syntenic genes was used to show the percentage of retained genes in subgenome1 (red), subgenome2 (blue), and both (yellow) in broomcorn millet using foxtail millet (a~i) as a reference. In total, there were 19,609 genes in foxtail millet that were syntenic with at least one subgenome in broomcorn millet, among which 16,884 (~86.2%) syntenic genes have two homologous copies retained in broomcorn millet. Source Data are provided as a Source Data file
Fig. 3The activity of LTR retrotransposons in Paniceae. The insertion time of Gypsy (a) and Copia (b) superfamilies were calculated in broomcorn millet (P. miliaceum), foxtail millet (S. italica), pearl millet (P. glaucum), sorghum (S. bicolor), and maize (Z. mays), respectively
Fig. 4The phylogeny of Paniceae. a Ks distribution of orthologous and paralogous genes in Paniceae. We further confirmed two WGD events in foxtail millet (b) and broomcorn millet (c), with the paralogous genes in green circles referred to the WGD shared by all grass, and the paralogous genes in orange circles referred to lineage-specific tetraploidization in broomcorn millet. d The phylogeny of Paniceae inferred from Ks distributions. The timeline was calculated based on the divergence between sorghum and maize (Ks ~ 0.152, 11.9 Mya). The WGD events were marked by stars. Source Data of Fig. 4a are provided as a Source Data file