| Literature DB >> 28256610 |
Simone Coughlan1, Peter Mulhair2, Mandy Sanders3, Gabriele Schonian4, James A Cotton3, Tim Downing1,2.
Abstract
Control of pathogens arising from humans, livestock and wild animals can be enhanced by genome-based investigation. Phylogenetically classifying and optimal construction of these genomes using short sequence reads are key to this process. We examined the mammal-infecting unicellular parasite Leishmania adleri belonging to the lizard-infecting Sauroleishmania subgenus. L. adleri has been associated with cutaneous disease in humans, but can be asymptomatic in wild animals. We sequenced, assembled and investigated the L. adleri genome isolated from an asymptomatic Ethiopian rodent (MARV/ET/75/HO174) and verified it as L. adleri by comparison with other Sauroleishmania species. Chromosome-level scaffolding was achieved by combining reference-guided with de novo assembly followed by extensive improvement steps to produce a final draft genome with contiguity comparable with other references. L. tarentolae and L. major genome annotation was transferred and these gene models were manually verified and improved. This first high-quality draft Leishmania adleri reference genome is also the first Sauroleishmania genome from a non-reptilian host. Comparison of the L. adleri HO174 genome with those of L. tarentolae Parrot-TarII and lizard-infecting L. adleri RLAT/KE/1957/SKINK-7 showed extensive gene amplifications, pervasive aneuploidy, and fission of chromosomes 30 and 36. There was little genetic differentiation between L. adleri extracted from mammals and reptiles, highlighting challenges for leishmaniasis surveillance.Entities:
Mesh:
Year: 2017 PMID: 28256610 PMCID: PMC5335649 DOI: 10.1038/srep43747
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary statistics for the L. adleri and L. tarentolae genomes, including unassigned (bin) contigs for both.
| Genome statistics | ||
|---|---|---|
| Number of chromosomes | 38 | 36 |
| All genes | 7,959 | 8,530 |
| Protein coding genes | 7,849 | 8,454 |
| Genes on chromosomes | 7,570 | 8,282 |
| Genes on bin contigs | 389 | 248 |
| Number of gaps | 4,350 | 4,568 |
| N content (%) | 0.64 | 3.77 |
| Chromosomes total length (bp) | 28,686,960 | 31,056,039 |
| Bin sequence total length (bp) | 1,664,372 | 578,687 |
| Genome length (bp) | 30,351,332 | 31,634,726 |
| GC content (%) | 56.76 | 56.66 |
| Median Coverage | 69 | 30 |
L. adleri has two additional chromosomes due to the fission of chromosome 30 in to 30.1 and 30.2 and chromosome 36 into chromosome 36.1 and chromosome 36.2.
Figure 1L. adleri HO174 was a Sauroleishmania isolate based on neighbornet network of the uncorrected p-distances of an alignment of: (a) seven concatenated genes with 4,677 sites for 225 strains. The scale bar indicates the number of substitutions per site. The L. adleri RLAT/KE/1957/SKINK-7 and MARV/ET/1975/HO174 nodes partially obscure each other. Compared to HO174, there are only two substitutions with SKINK-7, 177 substitutions with L. tarentolae RTAR/DZ/1939/Parrot-TarII, 635 with L. major MRHO/SU/1959/P-STRAIN, 627 with L. infantum MHOM/IT/1985/ISS175, 630 with L. donovani MHOM/YE/1993/LEM2677, and 599 substitutions with L. tropica MHOM/JO/1996/JH-88; (b) two concatenated genes (encoding DNA polymerase α catalytic polypeptide and RNA polymerase II largest subunit) with 2,192 sites of six samples. The scale bar indicates the number of substitutions per site. The SKINK-7 and HO174 nodes partially obscure each other. Compared to HO174, there are two substitutions with SKINK-7, 21 with L. adleri RLIZ/KE/1954/1433, 49 with L. tarentolae RTAR/DZ/1939/TarVI (from a Tarentola wall gecko) and L. tarentolae Parrot-TarII, 51 with L. hoogstraali RHEM/SD/1963/NG-26 (LV31), 55 with L. gymnodactyli RGYM/SU/1964/Ag (LV247) and 203 with L. major MHOM/SU/1973/5-ASKH. L. adleri SKINK-7 had the same number of substitutions as HO174 with each of these isolates.
Figure 2Divergence of L. adleri HO174 and L. adleri SKINK-7 from L. tarentolae as the number of homozygous SNPs per 10 Kb window.
Loci with high divergence in both genomes are at the top right. Density plots of divergence per 10 Kb of HO174 and SKINK-7 indicated that HO174 was more divergent from L. tarentolae than SKINK-7.
Figure 3Evidence of chromosome fission of (a) L. adleri HO174 chromosome 36 into 36.1 and 36.2; and (b) L. adleri SKINK-7 chromosome 30 into 30.1 and 30.2. Median read coverage (blue) and GC content (pink) were measured in 10 Kb blocks. Black horizontal lines indicate median coverage of that chromosome. The dashed line indicates the fission breakpoints on the original chromosomes: at 989,697 for chromosome 36 and 230,911 for chromosome 30. Genes transcribed from left to right (green) and from right to left (red) are homologous to L. major polycistronic transcriptional units from Thomas et al.67 with their transcription SSRs shown as arrows and origins of replication shown as black crosses.
Figure 4Chromosome copy numbers based on haploid median read coverage for L.
adleri HO174 reads mapped to the L. adleri HO174 reference (top); L. adleri SKINK-7 reads mapped to the L. adleri HO174 reference (middle); and L. tarentolae mapped to itself (bottom).
Figure 5Read depth allele frequency (RDAF) distributions of normalised SNP counts for: (a) L. adleri HO174 disomic chromosome 36.1 and trisomic chromosome 36.2; (b) L. adleri SKINK-7 tetrasomic chromosome 30.1 and disomic chromosome 30.2; (c) chromosome 12 trisomic in HO174 and disomic in SKINK-7; (d) chromosome 16 disomic in HO174 and trisomic in SKINK-7; (e) tetrasomic chromosome 31 in HO174 and SKINK-7.
Figure 6Numbers of genes either unique to or with orthologs in each L. adleri HO174, L. major Friedlin and L. tarentaole Parrot-TarII determined using OrthoMCL v5 orthologous groups (OGs).
The OGs are in parentheses. The number of genes in the L. major OGs are denoted by LmjF, L. tarentolae OGs by LtaP, and L. adleri OGs by LaHO174.
Amplified genes in L. adleri HO174 (top) and L. adleri SKINK-7 (bottom).
| Copy number variant information | Gene information | ||||||
|---|---|---|---|---|---|---|---|
| Chr | Copy number | Start | End | Length (bp) | Gene number | Gene ID | Gene product |
| 10 | 2 | 490,001 | 500,889 | 10,888 | 3 | LaHO174_101360 | Phosphate-repressible phosphate permease-like protein |
| LaHO174_101370 | Pteridine transporter (folate/biopterin transporter) | ||||||
| LaHO174_101380 | Delta-12 fatty acid desaturase | ||||||
| 17 | 5.5 | 22,673 | 42,500 | 19,827 | 4 | LaHO174_170090 | Elongation factor 1 -alpha |
| LaHO174_170100 | Receptor-type adenylate cyclase a | ||||||
| LaHO174_170110 | Receptor-type adenylate cyclase b | ||||||
| LaHO174_170120 | Receptor-type adenylate cyclase | ||||||
| 26 | 3.5 | 931,670 | 941,785 | 10,115 | 3 | LaHO174_262430 | Protein kinase |
| LaHO174_262440 | Conserved hypothetical protein | ||||||
| LaHO174_262450 | Paraquat-inducible protein-A (PqiA) | ||||||
| 27* | 2.4 | 437,425 | 453,344 | 15,919 | 3 | LaHO174_271110 | ATP-binding cassette subfamily A, member 8 (ABCA8) |
| LaHO174_271120 | ATP-binding cassette subfamily A, member 9 (ABCA9) | ||||||
| LaHO174_271130 | Cysteine peptidase, Clan CA, family C2 | ||||||
| 31 | 3.9 | 1,181,426 | 1,187,096 | 5,670 | 2 | LaHO174_312740 | Conserved hypothetical protein |
| LaHO174_312750 | Phosphoglycan beta 1,3 galactosyltransferase 5 | ||||||
| 33* | 2.2 | 1,034,500 | 1,040,973 | 6,473 | None | ||
| 10 | 2.5 | 490,001 | 500,889 | 10,888 | 2 | LaHO174_101360 | Phosphate-repressible phosphate permease-like protein |
| LaHO174_101370 | Pteridine transporter (folate/biopterin transporter) | ||||||
| LaHO174_101380 | Delta-12 fatty acid desaturase | ||||||
| 17 | 3.6 | 22,673 | 42,500 | 19,827 | 2 | LaHO174_170090 | Elongation factor 1 -alpha |
| LaHO174_170100 | Receptor-type adenylate cyclase a | ||||||
| LaHO174_170110 | Receptor-type adenylate cyclase b | ||||||
| LaHO174_170120 | Receptor-type adenylate cyclase | ||||||
| 17* | 2 | 518,715 | 521,099 | 2,384 | None | ||
| 26 | 2.9 | 931,670 | 941,785 | 10,115 | 3 | LaHO174_262430 | Protein kinase |
| LaHO174_262440 | Conserved hypothetical protein | ||||||
| LaHO174_262450 | Paraquat-inducible protein-A ( | ||||||
| 31 | 3.34 | 1,181,426 | 1,187,096 | 5,670 | 4 | LaHO174_312740 | Conserved hypothetical protein |
| LaHO174_312750 | Phosphoglycan beta 1,3 galactosyltransferase 5 | ||||||
*CNVs unique to each strain.