| Literature DB >> 29167271 |
Nathan A Wyatt1,2, Jonathan K Richards2, Robert S Brueggeman1,2, Timothy L Friesen3,2,4.
Abstract
Pyrenophora teres f. teres, the causal agent of net form net blotch (NFNB) of barley, is a destructive pathogen in barley-growing regions throughout the world. Typical yield losses due to NFNB range from 10 to 40%; however, complete loss has been observed on highly susceptible barley lines where environmental conditions favor the pathogen. Currently, genomic resources for this economically important pathogen are limited to a fragmented draft genome assembly and annotation, with limited RNA support of the P. teres f. teres isolate 0-1. This research presents an updated 0-1 reference assembly facilitated by long-read sequencing and scaffolding with the assistance of genetic linkage maps. Additionally, genome annotation was mediated by RNAseq analysis using three infection time points and a pure culture sample, resulting in 11,541 high-confidence gene models. The 0-1 genome assembly and annotation presented here now contains the majority of the repetitive content of the genome. Analysis of the 0-1 genome revealed classic characteristics of a "two-speed" genome, being compartmentalized into GC-equilibrated and AT-rich compartments. The assembly of repetitive AT-rich regions will be important for future investigation of genes known as effectors, which often reside in close proximity to repetitive regions. These effectors are responsible for manipulation of the host defense during infection. This updated P. teres f. teres isolate 0-1 reference genome assembly and annotation provides a robust resource for the examination of the barley-P. teres f. teres host-pathogen coevolution.Entities:
Keywords: PacBio; Pyrenophora teres f. teres; RNAseq; barley; genome report; genome sequencing
Mesh:
Year: 2018 PMID: 29167271 PMCID: PMC5765338 DOI: 10.1534/g3.117.300196
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Updated 0-1 assembly summary statistics compared to previous 0-1 assembly
| Feature | Updated 0-1 Assembly | Previous 0-1 Assembly |
|---|---|---|
| Genome size | 46,508,966 | 41,957,260 |
| Total contigs | 86 | 6,412 |
| Largest contig | 3,573,185 | 300,442 |
| Smallest contig | 27,932 | 200 |
| Prescaffolding L50 | 11 | 408 |
| Prescaffolding N50 | 1,730,401 | 26,790 |
| Contigs <100 kb | 33 | 6,389 |
| GC% | 46 | 48 |
| Telomeres | 9 | 0 |
Ellwood .
Smallest number of contigs whose length equals 50% of the prescaffolding genome assembly.
Length of the smallest contig in an ordered set of contigs corresponding to 50% of the prescaffolding assembly length.
Overall GC% content of the 0-1 genome assembly.
Number of telomeres identified in the 0-1 assembly on the end of contigs.
ALLMAPS genome scaffolding statistics
| Feature | Scaffolded 0-1 Assembly |
|---|---|
| Markers | 652 |
| Markers per Mb | 15.3 |
| Scaffolded contigs | 43 |
| Scaffolded bases | 42,704,288 |
| Unscaffolded contigs | 42 |
| Unscaffolded bases | 3,804,678 |
| Scaffold N50 | 4,379,536 |
| Scaffold L50 | 5 |
| Genome % scaffolded | 91.8% |
| Total scaffolds | 12 |
Contigs lacking a marker from either linkage map used when scaffolding.
Total bases in the unscaffolded contigs.
Smallest number of scaffolds whose length equals 50% of genome assembly.
Length of the smallest scaffold in an ordered set of scaffolds corresponding to 50% of the assembly length.
Gene annotation summary statistics
| Parameter | Value |
|---|---|
| Genes | 11,541 |
| Mean gene length | 1,470 |
| Max gene length | 43,584 |
| Min gene length | 60 |
| Mean exons/gene | 3 |
| Predicted secreted proteins | 1,002 |
| Predicted effectors | 282 |
Proteins harboring predicted signal sequence via SignalP (Petersen ).
Secreted proteins predicted to be effectors via EffectorP (Sperschneider ).
BUSCO analysis on assembly and annotations
| Species | Isolate | BUSCO Library | Complete | Fragmented | Missing |
|---|---|---|---|---|---|
| Ascomycota | 1285 | 10 | 20 | ||
| Ascomycota | 1283 | 15 | 17 | ||
| Ascomycota | 1280 | 17 | 18 | ||
| Ascomycota | 1284 | 10 | 21 |
Busco contains custom curated libraries for different taxa. The Ascomycota library consists of 1315 genes and was used in this analysis.
Complete Busco Ascomycota genes identified.
Partially identified Busco Ascomycota genes.
Missing Busco Ascomycota genes.
Figure 1OcculterCut v1 GC% plot of the P. teres f. teres 0-1 genome. A bimodal distribution is observed in which genome segments fall into the high GC% category (41–100%) or the low GC% category (0–41%). The OcculterCut calculated cut off for distinguishing low and high GC% content in 0-1 is 41%.
OcculterCut analysis of P. teres f. teres
| Feature | High GC Content (>41–100%) | Low GC Content (0–41%) |
|---|---|---|
| Peak GC content | 51.6% | 32.2% |
| Percentage of genome | 75.1% | 24.9% |
| Average region length (kb) | 58.3 | 18.3 |
| Number of genes in regions | 10,501 | 797 |
| Gene density (genes/Mb) | 306 | 72.6 |
Peak GC content in the elevated GC regions of the genome.
Proportion of the genome containing a high or low GC content.
Average length of identified GC-rich and GC-poor regions in the genome.
Number of genes residing in GC-rich and GC-poor regions of the genome.
Density of annotated genes within the GC-rich and GC-poor regions of the genome.