| Literature DB >> 25165620 |
Devendra K Biswal1, Anupam Chatterjee2, Alok Bhattacharya3, Veena Tandon4.
Abstract
Among helminth parasites, Paragonimus (Entities:
Keywords: Bayesian analysis; Mitochondria; Next generation sequencing; Paragonimus; Transfer RNA
Year: 2014 PMID: 25165620 PMCID: PMC4137670 DOI: 10.7717/peerj.484
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Summary of illumina and Ion-Torrent quality control reads.
|
| ||
| S.No | 1 | 2 |
| Fastq file name | processed_reads.fastq | mapped_mito.fastq |
| Fastq file size | 239.71 MB | 71.55 MB |
| Time taken for analysis | 8.75 s | 2.76 s |
| Maximum read length | 260 | 260 |
| Minimum read length | 35 | 35 |
| Mean Read Length | 121 | 117 |
| Total number of reads | 890,504 | 292,832 |
| Total number of HQ reads 1* | 890,442 | 292,822 |
| Percentage of HQ reads | 99.993% | 99.997% |
| Total number of bases | 107,866,584 bases | 34,145,801 bases |
| Total number of bases in Mb | 107.8666 Mb | 34.1458 Mb |
| Total number of HQ bases 2* | 105,216,008 bases | 33,218,357 bases |
| Total number of HQ bases in Mb | 105.2160 Mb | 33.2184 Mb |
| Percentage of HQ bases | 97.543% | 97.284% |
| Total number of non-ATGC characters | 0 bases | 0 bases |
| Total number of non-ATGC characters in Mb | 0.000000 Mb | 0.000000 Mb |
| Percentage of non-ATGC characters | 0.000% | 0.000% |
| Number of reads with non-ATGC characters | 0 | 0 |
| Percentage of reads with non-ATGC characters | 0.000% | 0.000% |
|
| ||
| S.No | 1 | |
| Fastq file name | SE_ill.fastq | |
| Fastq file size | 14.56 MB | |
| Time taken for analysis | 0.48 s | |
| Maximum read length | 100 | |
| Minimum read length | 50 | |
| Mean read length | 96 | |
| Total number of reads | 62,874 | |
| Total number of HQ reads 1* | 62,874 | |
| Percentage of HQ reads | 100.000% | |
| Total number of bases | 6,053,872 bases | |
| Total number of bases in Mb | 6.0539 Mb | |
| Total number of HQ bases 2* | 5,982,733 bases | |
| Total number of HQ bases in Mb | 5.9827 Mb | |
| Percentage of HQ bases | 98.825% | |
| Total number of non-ATGC characters | 410 bases | |
| Total number of non-ATGC characters in Mb | 0.000410 Mb | |
| Percentage of non-ATGC characters | 0.007% | |
| Number of reads with non-ATGC characters | 240 | |
| Percentage of reads with non-ATGC characters | 0.382% | |
mt DNA nucleotide sequence statistics information of representative helminth parasites.
|
| DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA | DNA |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 14,118 bp | 14,462 bp | 15,004 bp | 14,277 bp | 14,014 bp | 13,875 bp | 14,478 bp | 14,415 bp | 14,085 bp | 13,670 bp | 13,709 bp | 14,281 bp | 14,284 bp |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Submitted to |
|
|
|
|
|
|
|
|
|
|
|
|
|
| submitted | 01-FEB-2010 | submitted | 18-AUG-2010 | 14-JAN-2014 | 01-JUL-2010 | 01-MAY-2014 | 14-APR-2009 | 01-FEB-2010 | 14-APR-2009 | 01-FEB-2010 | 01-DEC-2011 | 11-MAR-2010 |
|
| 4,396.507 | 4,499.496 | 4,666.455 | 4,437.683 | 4,363.551 | 4,311.834 | 4,504.913 | 4,482.165 | 4,371.002 | 4,242.425 | 4,251.992 | 4,428.619 | 4,429.981 |
|
| 8,721.667 | 8,934.244 | 9,270.244 | 8,820.283 | 8,657.348 | 8,571.888 | 8,944.06 | 8,904.302 | 8,700.11 | 8,443.711 | 8,467.723 | 8,443.711 | 8,822.899 |
|
| |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| CDS | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 |
| Gene | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 | 12 |
| Misc. | 1 | 1 | – | – | – | – | 1 | 1 | – | – | 1 | 2 | |
| rRNA | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| tRNA | 22 | 22 | 24 | 22 | 22 | 22 | 22 | 23 | 23 | 22 | 22 | 22 | 22 |
Figure 1Comparative Synteny map of the representative species for the helminth mtDNA illustrating the protein coding genes, tRNAs, rRNAs etc.
Figure 2Circular genome map of Paragonimus westermani mtDNA.
The manual and in-silico annotations with appropriate regions for P. westermani mtDNA and annotated GenBank flat file for P. westermani were drawn into a circular graph in GenomeVX depicting the 12 PCGs and 24tRNAs.
P. westermani mtDNA annotations showing PCGs and tRNA in dot bracket format.
| Name | Start | Stop | Strand | Length | Structure |
|---|---|---|---|---|---|
| cox3 | 658 | 1134 | + | 477 | |
| trnH(gtg) | 1147 | 1209 | + | 63 | (((((((..((((......)))).(((((.......)))))....((.(..).))))))))). |
| cob-a | 1213 | 1860 | + | 648 | |
| cob-b | 1922 | 2311 | + | 390 | |
| nad4l | 2393 | 2644 | + | 252 | |
| nad4_0-a | 2922 | 3167 | + | 246 | |
| nad4_0-b | 3163 | 3465 | + | 303 | |
| nad4_1-b | 3564 | 3710 | − | 147 | |
| nad4_1-a | 3725 | 3805 | − | 81 | |
| trnQ(—) | 3882 | 3944 | + | 63 | (((((((..((((......)))).(((((......)))))....((.......))))))))). |
| trnF(gaa) | 3951 | 4020 | + | 70 | ((((.((..((((........)))).(((((.......)))))....(((.........))))).)))). |
| trnM(cat) | 4027 | 4092 | + | 66 | (((((((..((((........)))).(((((.......)))))....((((...))))))))))). |
| atp6 | 4326 | 4583 | + | 258 | |
| nad2 | 4627 | 5262 | + | 636 | |
| trnV(tac) | 5470 | 5531 | + | 62 | (((((.(..((((.....)))).(((((.......)))))....(((....)))).))))). |
| trnA(tgc) | 5539 | 5610 | + | 72 | (((((((..((((............)))).(((((.......)))))....(((((...)))))))))))). |
| trnD(gtc) | 5615 | 5681 | + | 67 | (((((((..((((.........)))).(((((.......)))))....(((.....)))))))))). |
| nad1-a | 5767 | 5877 | + | 111 | |
| nad1-b | 6077 | 6535 | + | 459 | |
| trnN(gtt) | 6606 | 6675 | + | 70 | (((((((..((((........)))).(((((.......)))))....(((((.....)))))))))))). |
| trnP(tgg) | 6676 | 6743 | + | 68 | ((((.(((..((((.......)))).(((((.......)))))....(((.......)))))))))). |
| trnI(gat) | 6749 | 6812 | + | 64 | ((((((.(..((((....)))).(((((.......)))))....(((((..)))))))))))). |
| trnK(ctt) | 6815 | 6880 | + | 66 | (((((((..((((......)))).(((((.......)))))....(((((...)))))))))))). |
| nad3 | 7001 | 7231 | + | 231 | |
| trnS1(gct) | 7244 | 7302 | + | 59 | (((((((.......(((((.......)))))....(((((......)))))))))))). |
| trnW(tca) | 7308 | 7375 | + | 68 | (((((((..((((......)))).(((((.......)))))....((((.......))))))))))). |
| cox1 | 7379 | 8872 | + | 1494 | |
| trnT(tgt) | 8914 | 8977 | + | 64 | ((((((...((((.......)))).(((((.......)))))....(((....))).)))))). |
| rrnL | 9067 | 9181 | + | 115 | ((...((((((((.....((.(((((((.((((...))))... |
| rrnL | 9417 | 9951 | + | 535 | .................(((....)))....(.....)..................((................)).. |
| trnC(gca) | 9961 | 10025 | + | 65 | (((((((..((((...))))...(((((.......)))))....(((((...)))))))))))). |
| rrnS | 10028 | 10751 | + | 724 | ...(((((.......))))).(((((((...((((((((....((..(....)....... (((.................. |
| cox2-a | 11020 | 11094 | + | 75 | |
| cox2-b | 11112 | 11204 | + | 93 | |
| cox2-c | 11201 | 11338 | + | 138 | |
| nad6 | 11410 | 11772 | + | 363 | |
| trnY(gta) | 11814 | 11876 | + | 63 | .((((((..((((.......)))).(((((.......)))))....((((.)))))))))).. |
| trnL1(tag) | 11883 | 11947 | + | 65 | .((((((..(((.......))).(((((.......)))))....(((.(...).))))))))).. |
| trnL2(—) | 12025 | 12086 | + | 62 | (((((((..(((.......))).((((........))))....(((((.)))))))))))). |
| trnR(tcg) | 12091 | 12154 | + | 64 | (((((((((......)))).(((((.......)))))....(((((.......)))))))))). |
| nad5_0-a | 12430 | 12927 | + | 498 | |
| nad5_0-b | 12989 | 13285 | + | 297 | |
| nad5_1 | 13506 | 13733 | + | 228 | |
| trnG(tcc) | 13751 | 13820 | + | 70 | (((((((..((((..........)))).(((.(.......).)))....((((.....))))))))))). |
| trnE(ttc) | 14358 | 14422 | + | 65 | (((((((..((((.......)))).(((((.......)))))....((((...))))))))))). |
Figure 3Inferred Phylogenetic relationship among the representative helminth mtDNA species of the concatenated 12 protein coding genes.
Trees were inferred using MrBayes v3.1. Posterior support values are given at nodes. Differences in the gene order in the mitochondrial genomes of parasitic flatworms from the Trematoda and Cestoda and taking Nematoda (Ascaridida) as an outgroup are indicated on the phylogenetic leaf nodes. See text for more details.
Figure 4Summarized 12 protein coding genes and their blast hit protein plots.
The protein plot depicts the quality value for each gene and each position if it is above the threshold and the different genes are differentiated with a range of colour codes. The hits used in MITOS correspond to the “mountains” in this protein plot that visualizes the signal from the BLAST searches. The arrows shown on the top of the plot depict gene order annotation and the quality values are shown on a log scale.
Figure 5Dot plot matrix and sequence similarity map depicting the the sequence similarity regions between the assembled and reference mtDNA.
(A) Dot plot matrix between the reference and assembled mt DNA. X-axis represents the assembled sequence, whereas y-axis represents the reference sequence. (B) Visual output of the mapped assembled mtDNA against the reference mtDNA using standalone blast and Artemis Comparison Tool (ACT).
Comparative nucleotide sequence statistics of mtDNA between P. westermani Indian and Korean isolates.
|
| ||
|
| ||
| Sequence type | DNA | DNA |
| Length | 15,004 | 14,965 |
| Weight (single-stranded) | 4666.455 | 4652.101 |
| Weight (double-stranded) | 9270.244 | 9246.535 |
|
| ||
| [ | ||
| 0.1 | 83.53 | 84.71 |
| 0.2 | 88.53 | 89.7 |
| 0.3 | 91.45 | 92.63 |
| 0.4 | 93.53 | 94.7 |
| 0.5 | 95.14 | 96.31 |
|
| ||
|
| ||
| CDS | 12 | 12 |
| Gene | 12 | 12 |
| Source | 1 | 1 |
| rRNA | 1 | 1 |
| tRNA | 24 | 23 |
|
| ||
| Ambiguous residues are omitted in atom counts. | ||
|
| ||
| Hydrogen (H) | 185,664 | 184,951 |
| Carbon (C) | 147,756 | 147,080 |
| Nitrogen (N) | 53,610 | 53,530 |
| Oxygen (O) | 93,068 | 92,834 |
| Phosphorus (P) | 15,004 | 14,963 |
|
| ||
| Ambiguous residues are omitted in atom counts. | ||
|
| ||
| Hydrogen (H) | 368,285 | 366,846 |
| Carbon (C) | 293,261 | 292,031 |
| Nitrogen (N) | 111,847 | 111,970 |
| Oxygen (O) | 180,050 | 179,556 |
| Phosphorus (P) | 30,008 | 29,926 |
|
| ||
|
| ||
| Ambiguous residues are omitted in atom counts. | ||
|
| ||
| Hydrogen (H) | 0.375 | 0.375 |
| Carbon (C) | 0.298 | 0.298 |
| Nitrogen (N) | 0.108 | 0.109 |
| Oxygen (O) | 0.188 | 0.188 |
| Phosphorus (P) | 0.03 | 0.03 |
|
| ||
| Ambiguous residues are omitted in atom counts. | ||
|
| ||
| Hydrogen (H) | 0.374 | 0.374 |
| Carbon (C) | 0.298 | 0.298 |
| Nitrogen (N) | 0.114 | 0.114 |
| Oxygen (O) | 0.183 | 0.183 |
| Phosphorus (P) | 0.031 | 0.031 |
|
| ||
|
| ||
| Adenine (A) | 2571 | 2339 |
| Cytosine (C) | 2284 | 2550 |
| Guanine (G) | 4535 | 4679 |
| Thymine (T) | 5614 | 5395 |
| Any nucleotide (N) | 0 | 2 |
| C + G | 6819 | 7229 |
| A + T | 8185 | 7734 |
|
| ||
|
| ||
| Adenine (A) | 0.171 | 0.156 |
| Cytosine (C) | 0.152 | 0.17 |
| Guanine (G) | 0.302 | 0.313 |
| Thymine (T) | 0.374 | 0.361 |
| Any nucleotide (N) | 0 | 0 |
| C + G | 0.454 | 0.483 |
| A + T | 0.546 | 0.517 |
Figure 6Comparative histogram of the nucleotide frequences of the Indian and Korean P. westermani isolates.
Blue coloured bars indicate Indian isolate while dark coloured bars indicate the reference Korean isolate deposited in GenBank.
Codon usage for Paragonimus westermani mt DNA.
| AmAcid | Codon | Number | /1000 | Fraction |
|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 724 tRNA secondary structures predicted using ARWEN.