| Literature DB >> 35022669 |
Areej Sakkour1, Martin Mascher2,3, Axel Himmelbach2, Georg Haberer4, Thomas Lux4, Manuel Spannagl4, Nils Stein2,5, Shoko Kawamoto6, Kazuhiro Sato1.
Abstract
Cultivated barley (Hordeum vulgare ssp. vulgare) is used for food, animal feed, and alcoholic beverages and is widely grown in temperate regions. Both barley and its wild progenitor (H. vulgare ssp. spontaneum) have large 5.1-Gb genomes. High-quality chromosome-scale assemblies for several representative barley genotypes, both wild and domesticated, have been constructed recently to populate the nascent barley pan-genome infrastructure. Here, we release a chromosome-scale assembly of the Japanese elite malting barley cultivar 'Haruna Nijo' using a similar methodology as in the barley pan-genome project. The 4.28-Gb assembly had a scaffold N50 size of 18.9 Mb. The assembly showed high collinearity with the barley reference genome 'Morex' cultivar, with some inversions. The pseudomolecule assembly was characterized using transcript evidence of gene projection derived from the reference genome and de novo gene annotation achieved using published full-length cDNA sequences and RNA-Seq data for 'Haruna Nijo'. We found good concordance between our whole-genome assembly and the publicly available BAC clone sequence of 'Haruna Nijo'. Interesting phenotypes have since been identified in Haruna Nijo; its genome sequence assembly will facilitate the identification of the underlying genes.Entities:
Keywords: zzm321990 Hordeum vulgarezzm321990 ; RNA-Seq; full-length cDNA; genome sequencing; pseudomolecules
Mesh:
Year: 2022 PMID: 35022669 PMCID: PMC8798153 DOI: 10.1093/dnares/dsac001
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.477
Statistics of ‘Haruna Nijo’ and two versions of ‘Morex’ assemblies
| Parameter | ‘Haruna Nijo’ | ‘Morex’V2 | ‘Morex’V3 |
|---|---|---|---|
| Number of scaffolds in pseudomolecules | 552 | 273 | 103 |
| Pseudomolecule size (Gb) | 4.28 | 4.34 | 4.20 |
| Scaffold N50 | 18.9 | 43.7 | 118.9 |
| Scaffold N90 [Mb] | 2.6 | 5.9 | 21.8 |
| Cumulative size of unanchored scaffold (Mb) | 154.3 | 82.9 | 29.1 |
‘Scaffold’ refers to top-level entities that constitute the pseudomolecules. In ‘Morex’V3, these are Bionano scaffolds of PacBio HiFi contigs; in the other assemblies, superscaffolds were constructed from PE, MP, and 10X data.
Figure 1Alignment of pseudomolecules of ‘Haruna Nijo’ to ‘Morex’V3 individual chromosomes.
BUSCO statistics of ‘Haruna Nijo’
| Factor | Scaffolds | Pseudomolecule |
|---|---|---|
| Complete BUSCOs | 1,403 (97.5%) | 1,396 (96.9%) |
| Complete BUSCOs: single copy | 1,382 (96.0%) | 1,378 (95.7%) |
| Complete BUSCOS: duplicated | 21 (1.3%) | 18 (1.2%) |
| Fragmented BUSCOs | 14 (1.0%) | 14 (1.0%) |
| Missing BUSCOs | 23 (1.5%) | 30 (2.1%) |
| Total BUSCO groups searched | 1,440 | 1,440 |
De novo gene annotation statistics
| Statistics | Complete sequences | High confidence | Low confidence |
|---|---|---|---|
| Number of genes | 161,721 | 49,524 | 112,197 |
| Number of monoexonic genes | 67,724 | 12,645 | 55,079 |
| Number of transcripts | 181,980 | 68,751 | 113,229 |
| Transcripts per gene | 1.13 | 1.39 | 1.01 |
| cDNA lengths (mRNAs) | 1,294 | 1,696 | 1,050 |
| CDS lengths (mRNAs) | 1,154 | 1,377 | 1,018 |
| Exons per transcript (mRNAs) | 3.45 | 5.21 | 2.38 |
| Exon lengths (mRNAs) | 375 | 326 | 441 |
| Intron lengths (mRNAs) | 675 | 623 | 770 |
| CDS exons per transcript (mRNAs) | 3.33 | 4.95 | 2.35 |
| CDS exon lengths | 346 | 278 | 434 |
| 5' UTR exon number | 54,193 | 48,584 | 5,609 |
| 3' UTR exon number | 52,989 | 44,690 | 8,299 |
Figure 2BUSCO assessment results of ‘Haruna Nijo’ fl-cDNA sequences (upper), high-confidence genes (middle), and low-confidence genes (lower).
BLASTN hits (
| Target | Query | ||
|---|---|---|---|
| Full-length cDNA | Gene projection |
| |
| Full-length cDNA | 22,651 | 25,977 | 28,415 |
| Gene projection | 19,711 | 47,367 | 43,087 |
|
| 19,636 | 42,336 | 49,524 |
| Total hits | 19,937 | 42,753 | 44,387 |
| Ratio (total hits/number of queries) | 0.880 | 0.903 | 0.896 |
Figure 3Alignment of ‘Haruna Nijo’ BAC sequences of Btr, Qsd1, and Vrs1 regions to pseudomolecules of ‘Haruna Nijo’ and ‘Morex’V3.