| Literature DB >> 28243575 |
Judith Turner1, Paul O'Neill2, Murray Grant2, Rick A Mumford1, Richard Thwaites1, David J Studholme2.
Abstract
Here we present genome sequences for twelve isolates of the invasive pathogen Phytophthora ramorum EU1. The assembled genome sequences and raw sequence data are available via BioProject accession number PRJNA177509. These data will be useful in developing molecular tools for specific detection and identification of this pathogen.Entities:
Year: 2017 PMID: 28243575 PMCID: PMC5320048 DOI: 10.1016/j.gdata.2017.02.006
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Isolates and raw sequence data. All samples were collected by The Plant Health and Seeds Inspectorate except for CC12475 and CC14654, which were collected by Fera.
| Isolate | Year | Source | County | BioSample | SRA | Read 1 (bp) | Read 2 (bp) | Read pairs | Platform |
|---|---|---|---|---|---|---|---|---|---|
| CC12475 | 2007 | Soil | Cornwall | SAMN01797768 | SRX202256 | 73 | 73 | 10,929,957 | HiSeq 2000 |
| CC14654 | 2009 | Leaf detritus | Cornwall | SAMN01797770 | SRX202259 | 73 | 73 | 32,069,494 | HiSeq 2000 |
| CC2184 | 2004 | Cornwall | SAMN01797769 | SRX202257 | 73 | 73 | 26,720,144 | HiSeq 2000 | |
| CC1008 | 2002 | West Sussex | SAMN05823577 | SRX2190141 | 100 | 80 | 35,377,462 | HiSeq 2000 | |
| CC1033 | 2002 | Dorset | SAMN05823579 | SRX2190142 | 100 | 80 | 29,037,290 | HiSeq 2000 | |
| CC1048 | 2002 | Gloucestershire | SAMN01797771 | SRX202261 | 73 | 73 | 19,206,505 | HiSeq 2000 | |
| CC2168 | 2009 | Cornwall | SAMN05823583 | SRX2190143 | 100 | 80 | 31,267,535 | HiSeq 2000 | |
| CC2176 | 2009 | Cornwall | SAMN05823584 | SRX2190144 | 100 | 80 | 32,285,291 | HiSeq 2000 | |
| CC2186 | 2009 | Devon | SAMN05823586 | SRX2190145 | 100 | 80 | 22,816,927 | HiSeq 2000 | |
| CC2187 | 2009 | Cornwall | SAMN05823587 | SRX2190146 | 100 | 80 | 28,439,345 | HiSeq 2000 | |
| CC2275 | 2004 | Cornwall | SAMN01797766 | SRX202258 | 100 | 100 | 94,913,722 | GA IIx | |
| CC1011 | 2002 | Cheshire | SAMN01797767 | SRX202260 | 100 | 100 | 191,582,259 | GA IIx |
Assembly statistics.
| Isolate | GenBank accession number | Total length (bp) | Genomic coverage | Number of contigs | Number of scaffolds | Contig N50 (bp) | Scaffold N50 (bp) |
|---|---|---|---|---|---|---|---|
| CC1008 | MLJA00000000 | 39,143,618 | 53.4 × | 5099 | 4625 | 20,824 | 24,587 |
| CC1033 | MLJB00000000 | 39,265,443 | 41.4 × | 5112 | 4666 | 21,410 | 25,459 |
| CC1048 | MLJC00000000 | 39,184,277 | 28.3 × | 4887 | 4562 | 22,104 | 25,016 |
| CC12475 | MLIX00000000 | 38,872,797 | 17.0 × | 4749 | 4449 | 22,493 | 24,333 |
| CC14654 | MLIY00000000 | 38,990,779 | 47.0 × | 5403 | 4802 | 18,355 | 23,539 |
| CC2168 | MLJD00000000 | 39,189,198 | 47.6 × | 5208 | 4707 | 19,710 | 24,940 |
| CC2176 | MLJE00000000 | 39,297,625 | 46.6 × | 5050 | 4635 | 20,983 | 24,953 |
| CC2184 | MLIZ00000000 | 39,039,515 | 39.6 × | 5237 | 4706 | 19,158 | 24,300 |
| CC2186 | MLJF00000000 | 40,428,501 | 32.9 × | 6372 | 5378 | 19,249 | 23,332 |
| CC2187 | MLJG00000000 | 39,213,677 | 42.0 × | 5054 | 4612 | 21,714 | 25,338 |
| CC2275 | AMZZ00000000 | 38,865,974 | 162.0 × | 5507 | 2445 | 17,358 | 23,300 |
| CC1011 | MRWH00000000 | 40,766,767 | 310.2 × | 6920 | 5959 | 15,332 | 21,758 |
Completeness of assemblies assessed using BUSCO [28].
| Assembly | Complete single copy | Complete duplicated | Fragmented | Missing | Total |
|---|---|---|---|---|---|
| EU1 CC2168 (this study) | 361 (84.15%) | 58 | 9 | 59 | 429 |
| EU1 CC2184 (this study) | 361 (84.15%) | 62 | 10 | 58 | 429 |
| EU1 CC2187 (this study) | 361 (84.15%) | 62 | 10 | 58 | 429 |
| EU2 SOD158 | 361 (84.15%) | 62 | 11 | 57 | 429 |
| EU1 CC2176 (this study) | 360 (83.92%) | 62 | 10 | 59 | 429 |
| EU2 SOD136 | 360 (83.92%) | 61 | 11 | 58 | 429 |
| EU1 CC14654 (this study) | 359 (83.68%) | 58 | 11 | 59 | 429 |
| EU2 996/3 (6) | 359 (83.68%) | 78 | 13 | 57 | 429 |
| EU2 SOD22 | 359 (83.68%) | 64 | 12 | 58 | 429 |
| EU1 CC1008 (this study) | 358 (83.45%) | 62 | 12 | 59 | 429 |
| EU1 CC1033 (this study) | 358 (83.45%) | 60 | 12 | 59 | 429 |
| EU1 CC2186 (this study) | 358 (83.45%) | 66 | 11 | 60 | 429 |
| EU2 SOD58 | 358 (83.45%) | 59 | 13 | 58 | 429 |
| EU1 CC12475 (this study) | 357 (83.22%) | 62 | 12 | 60 | 429 |
| EU1 CC2275 (this study) | 357 (83.22%) | 56 | 12 | 60 | 429 |
| EU2 SOD69 | 357 (83.22%) | 61 | 14 | 58 | 429 |
| EU2 SODL51 | 357 (83.22%) | 64 | 13 | 59 | 429 |
| EU1 CC1048 (This study) | 356 (82.98%) | 57 | 12 | 61 | 429 |
| NA1 Pr102 | 351 (81.82%) | 65 | 16 | 62 | 429 |
Fig. 1Heterozygosity profiles of twelve Phytophthora ramorum EU1 isolates over scaffold 7. The previously published P. ramorum NA1 genome sequence [14] was downloaded from the Joint Genome Institute at http://genome.jgi.doe.gov/ramorum1/ramorum1.download.ftp.html and used as a reference sequence, against which genomic sequence reads from each of the 12 isolates were aligned with BWA-mem [31], [32]. Panel A: we used a sliding window of 1000 nucleotides to calculate the rate of heterozygosity. Proportion of single-nucleotide positions at which 45–55% of the aligned reads contain the second-most abundant nucleotide was expressed as a percentage; that is the vertical axis represents percentage heterozygosity. Panel B: we used a sliding window of 1000 nucleotides to calculate average depth of coverage by aligned reads. The vertical axis represents depth of coverage, normalized so that the median depth over the whole genome is one. In both panels, the horizontal axis represents position on the scaffold and regions of zero heterozygosity are highlighted in yellow.
Fig. 2Heterozygosity profiles of twelve Phytophthora ramorum EU1 isolates over scaffold 24. The previously published P. ramorum NA1 genome sequence [14] was downloaded from the Joint Genome Institute at http://genome.jgi.doe.gov/ramorum1/ramorum1.download.ftp.html and used as a reference sequence, against which genomic sequence reads from each of the 12 isolates were aligned with BWA-mem [31], [32]. Panel A: we used a sliding window of 1000 nucleotides to calculate the rate of heterozygosity. Proportion of single-nucleotide positions at which 45–55% of the aligned reads contain the second-most abundant nucleotide was expressed as a percentage; that is the vertical axis represents percentage heterozygosity. Panel B: we used a sliding window of 1000 nucleotides to calculate average depth of coverage by aligned reads. The vertical axis represents depth of coverage, normalized so that the median depth over the whole genome is one. In both panels, the horizontal axis represents position on the scaffold and regions of zero heterozygosity are highlighted in yellow.
| Specifications [standardized info for the reader] | |
|---|---|
| Organism/cell line/tissue | Twelve isolates of the EU2 lineage of |
| Sex | Not applicable |
| Sequencer or array type | Illumina HiSeq 2000 and Illumina GA II |
| Data format | Analysed; |
| Experimental factors | Genomic sequences of pure microbial cultures |
| Experimental features | Genomic sequences of pure microbial cultures |
| Consent | Not applicable; data are available without restriction |
| Sample source location | All isolates were collected in the United Kingdom |