| Literature DB >> 32385314 |
Xingquan Zeng1,2,3, Tong Xu4, Zhihao Ling4, Yulin Wang1,2,3, Xiangfeng Li4, Shuqing Xu5, Qijun Xu1,2,3, Sang Zha1,2,3, Wangmu Qimei1,2,3, Yuzhen Basang1,2,3, Jiabu Dunzhu1,2,3, Mingzhai Yu1,2,3, Hongjun Yuan1,2,3, Tashi Nyima6,7.
Abstract
Hulless barley (Hordeum vulgare L. var. nudum) is a barley variety that has loose husk cover of the caryopses. Because of the ease in processing and edibility, hulless barley has been locally cultivated and used as human food. For example, in Tibetan Plateau, hulless barley is the staple food for human and essential livestock feed. Although the draft genome of hulless barley has been sequenced, the assembly remains fragmented. Here, we reported an improved high-quality assembly and annotation of the Tibetan hulless barley genome using more than 67X PacBio long-reads. The N50 contig length of the new assembly is at least more than 19 times larger than other available barley assemblies. The new genome assembly also showed high gene completeness and high collinearity of genome synteny with the previously reported barley genome. The new genome assembly and annotation will not only remove major hurdles in genetic analysis and breeding of hulless barley, but will also serve as a key resource for studying barley genomics and genetics.Entities:
Mesh:
Year: 2020 PMID: 32385314 PMCID: PMC7210891 DOI: 10.1038/s41597-020-0480-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Morphology of a Tibetan hulless barley. These pictures show (a) seedling; (b) heading stage; (c) mature stage; (d) filling stage spike; (e) grain of a hulless barley cultivated in Tibet.
Titetan hulless barley genome size estimation and assembly statistics in previous study.
| H. vulgare L. var. nudum | |
|---|---|
| Estimated genome size | 4.48 Gb |
| Total size of assembled scaffolds, >200 bp | 3.89 Gb |
| Total sequence length anchored to chromosomes | 3.48 Gb |
| Percent of chromosomal sequences | 89.41% |
| N50 length, scaffolds | 242Kb |
| Longest scaffold | 3.07 Mb |
| Total size of assembled contigs | 3.64 Gb |
| Longest contig | 276.95Kb |
| N50 length, contig | 18.07Kb |
| GC content | 44.00% |
| Repeat content | 81.39% |
| Number of gene models | 36,151 |
Iso-Seq library information and sequencing results.
| Sample | SRA Accession Number | Platform | Library type | Tissue | Insert size | Polymerase bases (Gb) | Subreads bases (Gb) |
|---|---|---|---|---|---|---|---|
| m170209_043156_42199 | SRR9112621 | PacBio Sequel II | SMRT Bell | root, stem and leaf mixed | 1–2 kb | 1.01 | 0.94 |
| m170209_085125_42199 | SRR9112620 | root, stem and leaf mixed | 1–2 kb | 1.23 | 1.15 | ||
| m170210_073652_42199 | SRR9112625 | root, stem and leaf mixed | 1–2 kb | 1.31 | 1.23 | ||
| m170210_115552_42199 | SRR9112624 | root, stem and leaf mixed | 1–2 kb | 1.41 | 1.32 | ||
| m170209_131022_42199 | SRR9112619 | root, stem and leaf mixed | 2–3 kb | 1.57 | 1.48 | ||
| m170209_173334_42199 | SRR9112618 | root, stem and leaf mixed | 2–3 kb | 1.48 | 1.4 | ||
| m170210_161505_42199 | SRR9112623 | root, stem and leaf mixed | 2–3 kb | 1.40 | 1.32 | ||
| m170210_203418_42199 | SRR9112622 | root, stem and leaf mixed | 2–3 kb | 1.26 | 1.19 | ||
| m170211_005331_42199 | SRR9112627 | root, stem and leaf mixed | 3–6 kb | 1.59 | 1.5 | ||
| m170211_051244_42199 | SRR9112626 | root, stem and leaf mixed | 3–6 kb | 1.60 | 1.52 | ||
| m170214_153228_42199 | SRR9112612 | root, stem and leaf mixed | 3–6 kb | 0.36 | 0.34 | ||
| m170214_195529_42199 | SRR9112611 | root, stem and leaf mixed | 3–6 kb | 0.36 | 0.34 | ||
| m170216_130225_42199 | SRR9112614 | root, stem and leaf mixed | 3–6 kb | 1.39 | 1.31 | ||
| m170228_215450_42199 | SRR9112613 | root, stem and leaf mixed | 5–10 kb | 1.43 | 1.36 | ||
| m170301_021333_42199 | SRR9112616 | root, stem and leaf mixed | 5–10 kb | 1.12 | 1.06 | ||
| m170304_235542_42199 | SRR9112615 | root, stem and leaf mixed | 5–10 kb | 1.25 | 1.19 | ||
| m170305_041614_42199 | SRR9112617 | root, stem and leaf mixed | 5–10 kb | 1.10 | 1.04 | ||
| 20.87 | 19.67 | ||||||
Comparison of the new genome with previously published assemblies of the Tibetan hulless barley genome.
| This study | Zeng | Dai | cv. Morex[ | |
|---|---|---|---|---|
| Sequenced genome size (Gb) | 4.00 | 3.89 | 3.73 | 4.58 |
| Contig N50 (Kb) | 1,563.00 | 18.07 | 5.94 | 79.0 |
| Scaffold N50 (Kb) | 4,006.00 | 242.00 | 171.1 | 1,900.00 |
| Repeat proportion (%) | 87.48 | 81.39 | NA | 80.80 |
| Annotated protein-coding genes | 40,457* | 36,151 | 46,787 | 39,734* |
*Only refer to high confident protein-coding genes.
Fig. 2The workflows of genome assembly and annotation used in this study. (a) Genome assembly pipeline; (b) Protein-coding gene annotation pipeline. Software and tools were indicated at lines, data and database information were shown in rectangles.
Fig. 3Genome comparison between Tibetan hulless barley and barley cv. Morex. (a) The plot shows the LAST alignments of predicted protein-coding genes in barley cv. Morex assembly and Tibetan hulless barley assembly. (b–d) Dot plots show the MUMMER alignments of Tibetan hulless barley scaffolds and assembled barley cv. Morex bacterial artificial chromosome sequences. Different coverages were shown in B (65%), C (85%) and D (99.9%). The differences can be due to either true sequence divergences or assembly errors.
Repetitive element annotation statistics.
| TE class | TE order | TE family | cv. Morex | Tibetan hulless barley (this study) |
|---|---|---|---|---|
| Class I: retrotransposon | LTR retrotransposon | Copia | 22.3 | 23.3 |
| Gypsy | 44.2 | 46.6 | ||
| unclassified LTR | 0.2 | 0.3 | ||
| non-LTR retrotransposon | LINE | 1.3 | 1.3 | |
| SINE | 0 | 0 | ||
| Sum | ||||
| Class II: DNA Transposon | DNA Transposon super-families | 6.5 | 6.1 | |
| Helitron | 0.03 | 0.04 | ||
| other DNA transposon | 1.6 | 1.7 | ||
| Sum | ||||
| Unclassified | 8 | 8.1 | ||
| Total | ||||
| Measurement(s) | RNA • transcriptome • DNA • genome • sequence_assembly • sequence feature annotation |
| Technology Type(s) | RNA sequencing • DNA sequencing • sequence assembly process • sequence annotation |
| Sample Characteristic - Organism | Hordeum vulgare |