| Literature DB >> 28241794 |
Lucas L Maldonado1, Juliana Assis2, Flávio M Gomes Araújo2, Anna C M Salim2, Natalia Macchiaroli3, Marcela Cucher3, Federico Camicia3, Adolfo Fox3, Mara Rosenzvit3, Guilherme Oliveira2,4, Laura Kamenetzky5.
Abstract
BACKGROUND: The parasite Echinococcus canadensis (G7) (phylum Platyhelminthes, class Cestoda) is one of the causative agents of echinococcosis. Echinococcosis is a worldwide chronic zoonosis affecting humans as well as domestic and wild mammals, which has been reported as a prioritized neglected disease by the World Health Organisation. No genomic data, comparative genomic analyses or efficient therapeutic and diagnostic tools are available for this severe disease. The information presented in this study will help to understand the peculiar biological characters and to design species-specific control tools.Entities:
Keywords: Comparative genomics; Drug targets; Echinococcus genome; Helminth parasites; SNPs
Mesh:
Substances:
Year: 2017 PMID: 28241794 PMCID: PMC5327563 DOI: 10.1186/s12864-017-3574-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Statistic measures of quality assembly. a Cumulative length distribution using E. multilocularis assembly as reference genome. b Echinococcus canadensis (G7) contig length distribution. The histogram represents the frequency of contigs per log contig length (bp). Lines indicate normal distribution of log contig length. c Identity, coverage (bars) and depth coverage (dashedline) of E. canadensis (G7) contigs on E. multilocularis chromosomes
Genome-wide statistics for the Echinococcus canadensis (G7) assembly and gene predictions
| Genome statistics | |
|---|---|
| Size of genome (Mb) | 115 |
| GC content (%) | 42 |
| Number of contigs | 9326 |
| N50 (Kb) | 75 |
| Largest contig (Kb) | 574 |
| Deep coverage | 55× |
| Number of predicted genes | 11,449 |
| Gene density per Mb | 13 |
| Length of proteome (amino acids) | 4,915,068 |
| Maximum protein length (amino acids) | 7886 |
| Average protein length (amino acids) | 440 |
| Average exon length (bp) | 219 |
| Median exon length (bp) | 159 |
| Average exons per transcript | 6 |
| Median exons per transcript | 4 |
| Total length of contained introns (Kb) | 40,117 |
| Average intron length (bp) | 714 |
| Median intron length (bp) | 273 |
Fig. 2Circos Plot of the genome of E. canadensis (G7), E. granulosus and E. multilocularis. One-to-one orthologs connected according their distribution on their corresponding chromosomes
Fig. 3Orthologous genes present exclusively in Echinococcus species. Venn diagram illustrating the number of gene clusters in each analysed group
Fig. 4Proportion of Echinococcus species-specific proteins with functional information according to different Gene Ontology (GO) categories. Gene models from E. canadensis (G7) genome were classified according to molecular function GOterms
E. canadensis (G7) new drug targets proteins found in cestodes but absent or highly divergent in humans
| Category | Product |
|
|---|---|---|
| Antigens | Taeniidae antigen (Antigen B) | ECANG7_07838 |
| immunogenic protein ts11 | ECANG7_01678 | |
| Defense | Antimicrobial peptide tachystatin A | ECANG7_00862 |
| Sygnalling | neuropeptide-like protein | ECANG7_03703 |
| neuropeptide spp-like | ECANG7_10139 | |
| Pancreatic hormone | ECANG7_09023 | |
| Pancreatic hormone | ECANG7_05886 | |
| Transport | Vacuolar (H+) ATPase G subunit | ECANG7_02132 |
| Metabolic process | Dolichol phosphate mannosyltransferase subunit 3 | ECANG7_01023 |
| EF-hand domain containing protein | ECANG7_02884 | |
| Transcription processes | CREB binding protein | ECANG7_05946 |
| zinc finger, C2H2 type | ECANG7_07928 |
OG distant less than 0.8 and present in all cestodes species analysed (stricted criteria)
Fig. 5Correlations between CGI density and genomic features in the genomes of the three Echinococcus species. a CGI density (per Mb) versus contig GC content (%). b CGI density (per Mb) versus log (contig size). c CGI density (per Mb) versus contigs Obs.CpG/Exp.CpG. d CGI density (per Mb) by Echinococcus species
Fig. 6Genomic context of CpG islands associated with genes in Echinococcus. a ECANG7_03687 that codes for protein kinase . b ECANG7_06911 that codes for ribonucleotide reductase small subunit.
Fig. 7SNPs and Phylogeny based on genome-wide SNPs analysis a Homozygous SNP sites between E. canadensis (G7), E. multilocularis and E. granulosus (G1) species. The numbers in the overlap region indicate the number of SNPs between the species. The numbers in triple overlapping indicate the number of triallelic loci. b Phylogenetic tree based on genome-wide SNPs analysis by Maximum Likelihood method