| Literature DB >> 26384787 |
Hugo O Valdivia1,2, João L Reis-Cunha3, Gabriela F Rodrigues-Luiz4, Rodrigo P Baptista5, G Christian Baldeviano6, Robert V Gerbasi7, Deborah E Dobson8, Francine Pratlong9, Patrick Bastien10, Andrés G Lescano11,12, Stephen M Beverley13, Daniella C Bartholomeu14.
Abstract
BACKGROUND: The Leishmania (Viannia) braziliensis complex is responsible for most cases of New World tegumentary leishmaniasis. This complex includes two closely related species but with different geographic distribution and disease phenotypes, L. (V.) peruviana and L. (V.) braziliensis. However, the genetic basis of these differences is not well understood and the status of L. (V.) peruviana as distinct species has been questioned by some. Here we sequenced the genomes of two L. (V.) peruviana isolates (LEM1537 and PAB-4377) using Illumina high throughput sequencing and performed comparative analyses against the L. (V.) braziliensis M2904 reference genome. Comparisons were focused on the detection of Single Nucleotide Polymorphisms (SNPs), insertions and deletions (INDELs), aneuploidy and gene copy number variations.Entities:
Mesh:
Year: 2015 PMID: 26384787 PMCID: PMC4575464 DOI: 10.1186/s12864-015-1928-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Top ten high SNP count genes in two L. (V.) peruviana isolates
| Gene ID | Annotation | Number of SNP | Gene length | CN PAB LEM | |
|---|---|---|---|---|---|
| LbrM.33.3060 | Hypothetical proteins | 135 | 14,943 | 0.97 | 0.87 |
| LbrM.30.2340 | 83 | 11,340 | 0.98 | 0.75 | |
| LbrM.34.5330 | 82 | 19,875 | 1.07 | 1.25 | |
| LbrM.16.0180 | 69 | 13,302 | 0.82 | 0.79 | |
| LbrM.35.1580 | 68 | 16,767 | 1.01 | 0.76 | |
| LbrM.14.0770 | 63 | 12,570 | 0.68 | 0.39 | |
| LbrM.35.3160 | 43 | 12,582 | 1.05 | 0.98 | |
| LbrM.30.2160 | Endosomal trafficking protein RME-8, putative | 40 | 7335 | 1.20 | 1.19 |
| LbrM.02.0130 | Phosphatidylinositol kinase related protein, putative | 39 | 14,775 | 0.51 | 0.47 |
| LbrM.30.1620 | protein kinase, putative | 38 | 5112 | 1.30 | 1.23 |
Top ten genes showing high SNP differences in L. (V.) peruviana compared with L. (V.) braziliensis orthologs. Number of SNP and gene length are presented in nucleotides. Copy number (CN) estimated for the haploid genome of PAB-4377 (PAB) and LEM-1537 (LEM)
Top ten high INDEL count genes in L. (V.) peruviana
| Gene ID | Annotation | Affected nucleotides | Gene length | CN PAB LEM | |
|---|---|---|---|---|---|
| LbrM.17.0390 | Hypothetical proteins | 57 | 3480 | 0.63 | 0.52 |
| LbrM.21.1080 | 42 | 2895 | 0.76 | 0.56 | |
| LbrM.15.1180 | Nucleoside transporter 1, putative | 28 | 1848 | 1.34 | 1.33 |
| LbrM.34.2710 | Hypothetical protein | 24 | 2133 | 1.43 | 1.6 |
| LbrM.14.0785 | Kinesin, putative | 21 | 957 | 0.75 | 1.02 |
| LbrM.31.1470 | Hypothetical proteins | 21 | 4089 | 0.82 | 0.66 |
| LbrM.32.3450 | 21 | 2469 | 0.74 | 0.54 | |
| LbrM.33.2950 | 21 | 3582 | 0.91 | 0.77 | |
| LbrM.07.1050 | RNA binding protein-like protein | 19 | 1377 | 1.02 | 1.21 |
| LbrM.25.1000 | Hypothetical proteins | 18 | 19,518 | 0.56 | 0.33 |
Top ten high indel count genes in L. (V.) peruviana compared with L. (V.) braziliensis orthologs. Gene length is presented in nucleotides. Copy number (CN) estimated for the haploid genome of PAB-4377 (PAB) and LEM-1537 (LEM)
Fig. 1Chromosome copy number in L. (V). peruviana. Chromosome copy number variation in L. (V.) peruviana. a PAB-4377; b LEM-1537. Boxes represent the estimated copy number for each chromosome and standard deviation from the three methods. Mean genome ploidy is indicated by a dotted red line
Fig. 2Chromosome 31 normalized read depth. Normalized read depth for supernumerary chromosome 31. a PAB-4377; b LEM-1537. Estimated ploidy indicated by a dotted red line. Blue lines represent normalized read depth for each position at the chromosome
Fig. 3Normalized allele frequency distribution. Normalized allele frequency counts for L. (V.) peruviana. a PAB-4377; b LEM-1537. Blue dots show normalized counts at heterozygous positions for all disomic chromosomes. Mean count at each allele frequency is indicated by a red line. Cumulative percentage between 0.4 and 06 heterozygous frequencies support disomic tendency of most chromosomes
Ontology analysis for chromosome 31
| Go ID | Description | Corrected |
|---|---|---|
| 51,537 | 2 iron, 2 sulfur cluster binding | 1.08E-03 |
| 9055 | electron carrier activity | 1.85E-02 |
| 4198 | calcium-dependent cysteine-type endopeptidase activity | 1.85E-02 |
| 51,536 | iron-sulfur cluster binding | 1.85E-02 |
| 51,540 | metal cluster binding | 1.85E-02 |
| 4148 | dihydrolipoyl dehydrogenase activity | 1.85E-02 |
| 4197 | cysteine-type endopeptidase activity | 3.81E-02 |
| 8234 | cysteine-type peptidase activity | 4.94E-02 |
Fig. 4Gene copy number variations in L. (V.) peruviana. Mapping of expanded genes in both L. (V.) peruviana isolates. a, b Tandem duplicated gene arrays in PAB-4377 and LEM-1537, respectively. Outer circle shows gene arrays as red with numbers indicating the calculated array expansion. Disomic chromosomes are shown in blue and supernumerary chromosomes in orange with outer numbers describing each chromosome. Colored lines map the location of duplicated arrays in their respective chromosomes. c, d dispersed duplicated genes in PAB-4377 and LEM-1537, respectively. Histogram and numbers represents the total number of gene expansions in each chromosome