| Literature DB >> 32241919 |
Miriam Schreiber1, Martin Mascher2,3, Jonathan Wright4, Sudharasan Padmarasu2, Axel Himmelbach2, Darren Heavens4, Linda Milne5, Bernardo J Clavijo4, Nils Stein2,6, Robbie Waugh7,8,9.
Abstract
Barley (Hordeum vulgare) is one of the most important crops worldwide and is also considered a research model for the large-genome small grain temperate cereals. Despite genomic resources improving all the time, they are limited for the cv Golden Promise, the most efficient genotype for genetic transformation. We have developed a barley cv Golden Promise reference assembly integrating Illumina paired-end reads, long mate-pair reads, Dovetail Chicago in vitro proximity ligation libraries and chromosome conformation capture sequencing (Hi-C) libraries into a contiguous reference assembly. The assembled genome of 7 chromosomes and 4.13Gb in size, has a super-scaffold N50 after Chicago libraries of 4.14Mb and contains only 2.2% gaps. Using BUSCO (benchmarking universal single copy orthologous genes) as evaluation the genome assembly contains 95.2% of complete and single copy genes from the plant database. A high-quality Golden Promise reference assembly will be useful and utilized by the whole barley research community but will prove particularly useful for CRISPR-Cas9 experiments.Entities:
Keywords: Barley; Golden Promise; reference assembly
Mesh:
Year: 2020 PMID: 32241919 PMCID: PMC7263683 DOI: 10.1534/g3.119.401010
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Statistics for the different stages of the assembly process
| Contigs | Scaffolds | Dovetail | Hi-C | |
|---|---|---|---|---|
| N50 | 22.4kb | 86.67kb | 4.14Mb | / |
| Number | 786,696 | 245,820 | 128,283 | 8 |
| Longest | 352,153bp | 1,540,019bp | 22,832,123bp | 612,216,794bp |
| Size | 4.02Gb | 4.11Gb | 4.12Gb | 4.13Gb |
Figure 1Spectra cn plots comparing k-mers from the paired-end reads to kmers in (a) the contig assembly and (b) the scaffold assembly.
Identified repetitive elements in the Golden Promise assembly. Values represent percentage coverage of the genome
| Golden Promise | MorexV1 | MorexV2 | |
|---|---|---|---|
| 72.88 | 70.65 | 74.93 | |
| LTR Retrotransposon | 63.16 | 62.25 | 64.25 |
| LTR/Copia | 19.87 | 21 | 20.94 |
| LTR/Gypsy | 42.97 | 40.93 | 42.99 |
| Unclassified LTR | 0.32 | 0.31 | 0.32 |
| Non-LTR Retrotransposon | |||
| LINE | 0.25 | 0.24 | 0.24 |
| SINE | 0.03 | 0.03 | 0.03 |
| DNA Transposon Superfamily | 8.25 | 7.39 | 8.97 |
| CACTA superfamily (DTC) | 7.77 | 6.92 | 8.49 |
| hAT superfamily (DTA) | 0.004 | 0.004 | 0.004 |
| Mutator superfamily (DTM) | 0.13 | 0.13 | 0.13 |
| Tc1/Mariner superfamily (DTT) | 0.2 | 0.19 | 0.2 |
| Harbinger superfamily (DTH) | 0.13 | 0.12 | 0.13 |
| Unclassified (DTX) | 0.02 | 0.02 | 0.02 |
| MITE (DXX) | 0.01 | 0.01 | 0.01 |
| Helitron (DHH) | 0.08 | 0.09 | 0.09 |
| Unclassified Element (XXX) | 0.46 | 0.3 | 0.74 |
| Simple Sequence Repeats | 0.63 | 0.36 | 0.59 |
Figure 2Completeness assessment of the Golden Promise assembly in comparison to the previous steps of the assembly process and the published barley references MorexV1 and MorexV2 for both the BUSCO analysis (a) and the flcDNA mapping analysis (b).