| Literature DB >> 32783805 |
Longxin Chen1,2,3, Xingke Gao3, Runting Li4,2, Limeng Zhang4,2, Rui Huang5,6, Linqing Wang2, Yue Song2, Zhenzhen Xing2, Ting Liu2, Xiaoning Nie2, Fangyuan Nie5,6, Shuang Hua7, Zihan Zhang2, Feng Wang1, Runlin Z Ma5,6,2, Long Zhang3.
Abstract
Microsporidia are a large group of unicellular parasites that infect insects and mammals. The simpler life cycle of microsporidia in insects provides a model system for understanding their evolution and molecular interactions with their hosts. However, no complete genome is available for insect-parasitic microsporidian species. The complete genome of Antonospora locustae, a microsporidian parasite that obligately infects insects, is reported here. The genome size of A. locustae is 3 170 203 nucleotides, composed of 17 chromosomes onto which a total of 1857 annotated genes have been mapped and detailed. A unique feature of the A. locustae genome is the presence of an ultra-low GC region of approximately 25 kb on 16 of the 17 chromosomes, in which the average GC content is only 20 %. Transcription profiling indicated that the ultra-low GC region of the parasite could be associated with differential regulation of host defences in the fat body to promote the parasite's survival and propagation. Phylogenetic gene analysis showed that A. locustae, and the microsporidian family in general, is likely at an evolutionarily transitional position between prokaryotes and eukaryotes, and that it evolved independently. Transcriptomic analysis showed that A. locustae can systematically inhibit the locust phenoloxidase PPO, TCA and glyoxylate cycles, and PPAR pathways to escape melanization, and can activate host energy transfer pathways to support its reproduction in the fat body, which is an insect energy-producing organ. Our study provides a platform and model for studies of the molecular mechanisms of microsporidium-host interactions in an energy-producing organ and for understanding the evolution of microsporidia.Entities:
Keywords: Microsporidia; genome; host–pathogen interaction; locust; transcription
Year: 2020 PMID: 32783805 PMCID: PMC7643970 DOI: 10.1099/mgen.0.000421
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Comparison of genome information between A. locustae and other microsporidia
|
Genomic features |
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
|
Chromosome no. |
17 |
18 |
|
11 |
11 |
≥6 |
|
15 |
6 |
|
Genome size (Mbp) |
3.2 |
|
7.7 |
2.9 |
2.3 |
6 |
≤24.2 |
6.2–7.3 |
63–75 |
|
Assembled (Mbp) |
3.2 |
15.7 |
7.9 |
2.5 |
2.2 |
3.9 |
13.3 |
4.98 |
74.6 |
|
No. of scaffolds/contigs |
17 |
1605 |
5465 |
11 |
137 |
1646 |
41 804 |
1392 |
1231 |
|
N50 (bp) |
183 675 |
57 394 |
2902 |
|
|
2349 |
|
5923 |
1158 |
|
Genome coverage (%) |
100 |
100 |
90 |
86 |
96 |
64 |
55 |
70–80 |
|
|
G+C content (%) |
41.6 |
31 |
27 |
48 |
41.4 |
26 |
26 |
20 |
40.4 |
|
Predicted ORFs |
1857 |
4458 |
2614 |
1999 |
1833 |
3804 |
2174 |
2543 |
18 074 |
nd, no data.
Fig. 1.A circular representation of the complete genome of A. locustae. DNA sequencing revealed that the genome size of A. locustae is 3 170 203 base pairs, with a total of 1857 predicted coding genes distributed on 17 chromosomes. The outermost circle shows chromosome size (kb) and the distribution of KEGG pathways, as indicated with colour-coded bars (see the colour bar for key) for each chromosome. The second circle from the outside shows the variation in GC content for each of the 17 chromosomes, characterized by a sharp decrease in GC content near the centre of each chromosome. The third circle from the outside represents the distribution of coding genes on the positive strand (red) and negative strand (green) of DNA, respectively. The non-coding RNA (ncRNA) detected is shown in the fourth circle. Information about long-fragment repeat sequences is represented in the fifth circle, and genomic long-fragment repeat sequences are indicated on the innermost circle.
Fig. 2.Distribution of GC content and variations in transcriptional density along each chromosome of A. locustae in the fat body and midgut of the host. (a) The distribution of GC content on each chromosome of A. locustae. (b) The density distribution of mRNA transcripts of A. locustae in the locust fat body. The ordinate shows the value of log2 for the depth distribution of sequences on the chromosomes; the abscissa indicates the length of the chromosome. (c) The density distributions of transcripts of A. locustae in the midgut of the locust.
Fig. 3.Statistical analysis of transcripts. (a) The ratio of reads mapped to the genome of transcripts between diseased and healthy locusts. Comparison of the percentage of reads in midgut tissues (P=0.3604) and fat body tissues (P=0.0182) of diseased and healthy locusts to the locust genome. (b) Comparison of the percentage of the reads mapped to A. locustae genomes in the fat body and midgut tissues of diseased and healthy locusts (P=0.0142). (c) Principal component analysis (PCA) of A. locustae (left) and locust (right) transcripts. (d) A. locustae spore was detected in the midgut/fat body of the locust after inoculation. The t-test was used for determination of significance, *P<0.05.
Fig. 4.Homology and collinearity analysis of genomes based on all predicted genomic protein sequences and a maximum-likelihood (ML) phylogenetic tree of the frataxin gene. (a) Homology and collinearity analysis between microsporidia and yeast. A. locustae with S. cerevisiae (GenBank ID: GCF_000146045.2) (left); S. cerevisiae with K. naganishii (GenBank ID: GCF_000348985.1) (right). (b) Homology and collinearity analysis between microsporidia and protozoa. A. locustae with B. bigemina (GenBank ID: GCF_000981445.1) (left); B. bigemina with B. motasi (GenBank ID: GCF_000691945.2) (right). (c) Homology and collinearity analysis between A. locustae and E. cuniculi (GenBank ID: GCF_000091225.1) in the microsporidia family (left). Homology and collinearity analysis between E. cuniculi and E. hellem (GenBank ID: GCF_000277815.2) in the family Encephalitozoon (right). (d) The order of genes from partial genomes among several representative species of microsporidia. Arrows of the same colour represent homologous genes and the direction of the arrow indicates the direction of the encoded protein. (e) An ML phylogenetic tree of frataxin was constructed with phyML 3.0 using the best model of amino acid substitutions as determined with mega 6.0 and the Regrafting (SPR) branch-swapping algorithm. Numbers indicate the corresponding levels of bootstrap support; values below 70 are hidden. Branch lengths are drawn to scale as noted below. Details: Trichuris trichiura (GenBank ID: CDW51770.1); Trichuris suis (GenBank ID: KHJ45876.1); Danio rerio (GenBank ID: NP_001076485.1); Mus musculus (GenBank ID: NP_032070.1); Homo sapiens (GenBank ID: NP_000135.2); Ochotona princeps (GenBank ID: XP_012784301.1); Apis cerana (GenBank ID: PBC30404.1); Pararge aegeria (GenBank ID: JAA83326.1); Culex quinquefasciatus (GenBank ID: XP_001864042.1); S. cerevisiae (GenBank ID: ONH78464.1); (GenBank ID: ALV05461.1); (GenBank ID: AKX59165.1); Haemophilus influenza (GenBank ID: KIS34558.1); (Aphis glycines) (GenBank ID: ALD15510.1); K-12 (GenBank ID: CQR83218.1); Spraguea lophii 42_110 (GenBank ID: EPR79861.1); A. locustae CLX; N. bombycis (GenBank ID: ABW91182.1); Nosema apis BRL 01 (GenBank ID: EQB60682.1); E. hellem ATCC 50504 (GenBank ID: XP_003886726.1); Encephalitozoon romaleae SJ-2008 (GenBank ID: XP_009263955.1); E. cuniculi GB-M1 (GenBank ID: XP_965969.1); B. bigemina (GenBank ID: XP_012769760.1); Cryptosporidium parvum Iowa II (GenBank ID: XP_625594.1); Leishmania major strain Friedlin (GenBank ID: XP_001683860.1); Leishmania infantum JPCM5 (GenBank ID: XP_001466138.1); Trypanosoma brucei (GenBank ID: AAX69885.1); Trypanosoma cruzi (GenBank ID: RNC58480.1); K. naganishii CBS 8797 (GenBank ID: XP_022462458.1); Eremothecium gossypii FDAG1 (GenBank ID: AEY99209.1).
Fig. 5.Simplified life cycle of infection by A. locustae and critical interactions with its locust host at the level of gene transcription.