| Literature DB >> 28770027 |
Kurt C Showmaker1,2, Mark A Arick1, Chuan-Yu Hsu1, Brigitte E Martin3, Xiaoqiang Wang2, Jiayuan Jia2, Martin J Wubben4, Robert L Nichols5, Tom W Allen6, Daniel G Peterson1,7, Shi-En Lu2.
Abstract
Xanthomonas citri pv. malvacearum is a major pathogen of cotton, Gossypium hirsutum L.. In this study we report the complete genome of the X. citri pv. malvacearum strain MSCT1 assembled from long read DNA sequencing technology. The MSCT1 genome is the first X. citri pv. malvacearum genome with complete coding regions for X. citri pv. malvacearum transcriptional activator-like effectors. In addition functional and structural annotations are presented in this study that will provide a foundation for future pathogenesis studies with MSCT1.Entities:
Keywords: Bacterial blight; Cotton; Long read DNA sequencing; TAL effectors; Xanthomonas citri pv. malvacearum
Year: 2017 PMID: 28770027 PMCID: PMC5525278 DOI: 10.1186/s40793-017-0253-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Xanthomonas citri pv. malvacearum strain: MSCT1 [75]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Pathovar | |||
| Gram stain |
| TAS [ | |
| Cell shape |
| TAS [ | |
| Motility |
| TAS [ | |
| Sporulation |
| ||
| Temperature range |
| TAS [ | |
| Optimum temperature |
| TAS [ | |
| pH range; Optimum |
| TAS [ | |
| Carbon source |
| TAS [ | |
| MIGS-6 | Habitat |
| TAS [ |
| MIGS-6.3 | Salinity |
| |
| MIGS-22 | Oxygen requirement |
| |
| MIGS-15 | Biotic relationship |
| TAS [ |
| MIGS-14 | Pathogenicity |
| IDA |
| MIGS-4 | Geographic location |
| IDA |
| MIGS-5 | Sample collection |
| IDA |
| MIGS-4.1 | Latitude |
| |
| MIGS-4.2 | Longitude |
| |
| MIGS-4.4 | Altitude |
|
aEvidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [82]
Fig. 1Top (a) and bottom (b) of a cotton leaf displaying the bacterial blight disease symptom caused by Xanthomonas citri pv. malvacearum
Fig. 2The phylogenetic tree indicating current placement of the source organism. The phylogenetic tree was constructed based on the sequences of genes coding for ATP synthase β chain (atpD), heat shock protein 70 (dnaK), and gyrase subunit β (gyrB) for Xanthomonas species. MAFFT (version 7) [85] was used to align the sequences; the evolutionary history was inferred by using the Maximum Likelihood, with 100 bootstraps, method based on the Tamura-Nei model [86] with MEGA6 [87] software. Sequence identifiers of each subunit are as reported by Ah-You et al. 2009 [17]. Type (T) and Pathovar Type (PT) strains are noted in superscript
Fig. 3Transmission election micrograph of Xanthomonas citri pv. malvacearum strain MSCT1
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Complete genome |
| MIGS-28 | Libraries used | Paired-end (Illumina), Pacbio 20 kb, Nanopore |
| MIGS 29 | Sequencing platforms | Illumina MiSeq, PacBio, Nanopore |
| MIGS 31.2 | Fold coverage | 2378.74X Total, 1820.26X Illumina, 516.58 PacBio, 41.90 Nanopore |
| MIGS 30 | Assemblers | Canu v1.3, Pilon v1.17, Quiver v2.3.0 |
| MIGS 32 | Gene calling method | NCBI Prokaryotic Genome Annotation Pipeline |
| Locus Tag | BGK55 | |
| Genbank ID | GCA001719145.1 | |
| GenBank Date of Release | 06-SEP-2016 | |
| GOLD ID | Gp0177725 | |
| BIOPROJECT | PRJNA299817 | |
| MIGS 13 | Source Material Identifier | MSCT1 |
| Project relevance | Agricultural |
Fig. 4The genomic map of MSCT1 Chromosome 1. The outer and inner dark blue rings represents protein coding genes on the (+) and (−) strands, respectively. The light red, green and blue rings represent blastn alignments to MSCT1 against X. citri pv. malvacearum strains; R18 from Nicaragua (GCF_000309905.1), R18 from Burkina Faso (GCF_000454505.1), R20 from Burkina Faso (GCF_000454525.1), respectively. The black ring represents the gc content, while the inner green and purple ring represents the gc skew. The genomic map was created with cgview [88]
Summary of genome: one chromosome and 3 plasmids
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 5.0 | Circular | CP017020.1 | NZ_CP017020.1 |
| pMSCT15kb | 15,263 (bp) | Circular | CP017021.1 | NZ_CP017021.1 |
| pMSCT44kb | 43,946 (bp) | Circular | CP017022.1 | NZ_CP017022.1 |
| pMSCT60kb | 60,533 (bp) | Circular | CP017023.1 | NZ_CP017023.1 |
Fig. 5Dot plot of X. citri pv. malvacearum strain MSCT1 chromosome (NZ_CP017020.1) (X-Axis) compared to X. citri pv. malvacearum strain X18 (NZ_CM002136.1) (left, Y-Axis) and X. citri pv. malvacearum strain X20 (NZ_CM002029.1) (right, Y-Axis) Chromosomes. Dot plot produced with YASS web server using default settings [89]
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 5,123,946 | 100.00 |
| DNA coding (bp) | 4,365,468 | 85.20 |
| DNA G + C (bp) | 3,313,791 | 64.67 |
| DNA scaffolds | 4 | 100.00 |
| Total genes | 4410 | 100.00 |
| Protein coding genes | 4102 | 93.02 |
| RNA genes | 95 | 2.15 |
| Pseudo genes | 213 | 4.83 |
| Genes in internal clusters | - | - |
| Genes with function prediction | 3375 | 76.53 |
| Genes assigned to COGs | 3228 | 73.20 |
| Genes with Pfam domains | 3302 | 74.88 |
| Genes with signal peptides | 553 | 12.54 |
| Genes with transmembrane helices | 911 | 20.66 |
| CRISPR repeats | 1 | - |
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 349 | 8.51 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 348 | 8.48 | Transcription |
| L | 208 | 5.017 | Replication, recombination and repair |
| B | 3 | 0.07 | Chromatin structure and dynamics |
| D | 129 | 3.14 | Cell cycle control, Cell division, chromosome partitioning |
| V | 168 | 4.10 | Defense mechanisms |
| T | 344 | 8.39 | Signal transduction mechanisms |
| M | 425 | 10.36 | Cell wall/membrane biogenesis |
| N | 241 | 5.88 | Cell motility |
| U | 198 | 4.83 | Intracellular trafficking and secretion |
| O | 295 | 7.19 | Posttranslational modification, protein turnover, chaperones |
| C | 369 | 9.00 | Energy production and conversion |
| G | 366 | 8.92 | Carbohydrate transport and metabolism |
| E | 479 | 11.68 | Amino acid transport and metabolism |
| F | 129 | 3.14 | Nucleotide transport and metabolism |
| H | 397 | 9.68 | Coenzyme transport and metabolism |
| I | 266 | 6.48 | Lipid transport and metabolism |
| P | 358 | 8.72 | Inorganic ion transport and metabolism |
| Q | 243 | 5.92 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 669 | 16.31 | General function prediction only |
| S | 263 | 6.41 | Function unknown |
| - | 874 | 21.31 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Ten most represented functional annotations
| Annotation | Count |
|---|---|
| Membrane protein | 64 |
| TonB-dependent receptor | 42 |
| MFS transporter | 33 |
| Transposase | 26 |
| Transcriptional regulator | 25 |
| ABC transporter ATP-binding protein | 23 |
| Oxidoreductase | 19 |
| LysR family transcriptional regulator | 19 |
| GGDEF domain-containing protein | 16 |
| Two-component sensor histidine kinase | 15 |
Xanthomad Non-TAL Effector families found in MSCT1
| Family | Refseq_ID | BlastP HIT | Notes | REF |
|---|---|---|---|---|
| XopAG | WP_033479491.1 | CAP49915.1 | HR response in Grapefruit | [ |
| XopK | WP_005915119.1 | CAP50604.1 | Unclear role in virulence | [ |
| XopP | WP_069288200.1 | CAJ22867.1 | Suppresses immune response in rice | [ |
| XopR | WP_005923840.1 | BAE70889.1 | Suppression of MAMP-triggered immune responses | [ |
| XopT | WP_069288215.1 | BAE68965.1 | - | [ |
| XopZ1 | WP_005914471.1 | BAE69157.1 | Contributes to virulence in rice | [ |
| AvrBs2 | WP_033481547.1 | CAJ21683.1 | Suppresses rice immunity | [ |
MSCT1 Potential TAL Effectors
| TAL | Molecule | Refseq_ID | Starta | Stopa | Stranda |
|---|---|---|---|---|---|
| MSCT1-TAL1 | pMSCT44kb | WP_069288206.1 | 36,431 | 40,333 | 1 |
| MSCT1-TAL2 | pMSCT60kb | WP_069288209.1 | 16,043 | 19,127 | 1 |
| MSCT1-TAL3 | Chromosome | WP_069288181.1b | 2,568,181 | 2,571,268 | -1 |
| MSCT1-TAL4 | pMSCT44kb | WP_069288204.1 | 15,111 | 19,626 | -1 |
| MSCT1-TAL5 | pMSCT60kb | WP_069288212.1 | 41,404 | 44,689 | -1 |
| MSCT1-TAL6 | pMSCT60kb | WP_069288211.1 | 44,689 | 34,259 | -1 |
| MSCT1-TAL7 | pMSCT60kb | WP_069288210.1 | 21,870 | 21,870 | -1 |
| MSCT1-TAL8 | pMSCT60kb | WP_069288208.1 | 3549 | 8064 | -1 |
aStart, Stop, and Strand annotations by AnnoTAL
bNCBI Annotation differs from AnnoTAL prediction, the MSCT1-TAL3 NCBI Start Codon begins at 2,570,908
Repeat Variable Diresidues of MSCT1 TAL effectors
| TAL | Repeat Variable Diresidues |
|---|---|
| MSCT1-TAL1 | HD-NI-NG-NI-NI-NS-NG-NG-NI-NG-NS-HD-NS-HD-NS-NG-NS-NG-HD-NG-NG-NG |
| MSCT1-TAL2 | NI-NI-NI-NN-NI-NS-HD-NG-HD-NS-NG-HD-HD-NG |
| MSCT1-TAL3 | NI-NG-NI-HD-NG-NG-NG-NG-HD-NS-HD-HD-NG-NG |
| MSCT1-TAL4 | NI-NG-NI-NG-NS-NS-NS-NG-HD-NS-HD-HD-HD-HD-HD-NG-NI-NG-NS-NG-NS-HD-HD-HD-HD-NG-NG-NG |
| MSCT1-TAL5 | NI-NI-NI-NN-NI-NS-HD-NG-NN-NS-NN-NN-HD-NG-N*-NN |
| MSCT1-TAL6 | NI-NG-NI-NI-NI-NG-NG-NS-NG-NS-NS-NG-NS-NG-HD-NS-HD-HD-NG-NS-NG-NG-NG-NG-NG-NG |
| MSCT1-TAL7 | HD-NI-NG-NI-NI-NI-HD-HD-HD-NS-NS-HD-HD-NS-NS-NG-NS-NG-NG |
| MSCT1-TAL8 | NI-NG-NI-NI-NI-NG-HD-HD-NS-NI-HD-NI-HD-HD-NI-NS-NG-HD-NS-NS-NS-NG-NS-NG-NG-NG-NG-NG |