| Literature DB >> 32455229 |
Tripti Pandey1, Arpita Ghosh2, Vivek N Todur2, Vijayakumar Rajendran1, Parismita Kalita1, Jupitara Kalita1, Rohit Shukla1, Purna B Chetri1, Harish Shukla1, Amit Sonkar1, Denzelle Lee Lyngdoh1, Radhika Singh1, Heena Khan1, Joplin Nongkhlaw1, Kanhu Charan Das1, Timir Tripathi1.
Abstract
Fascioliasis, a neglected foodborne disease caused by liver flukes (genus Fasciola), affects more than 200 million people worldwide. Despite technological advances, little is known about the molecular biology and biochemistry of these flukes. We present the draft genome of Fasciola gigantica for the first time. The assembled draft genome has a size of ∼1.04 Gb with an N50 and N90 of 129 and 149 kb, respectively. A total of 20 858 genes were predicted. The de novo repeats identified in the draft genome were 46.85%. The pathway included all of the genes of glycolysis, Krebs cycle, and fatty acid metabolism but lacked the key genes of the fatty acid biosynthesis pathway. This indicates that the fatty acid required for survival of the fluke may be acquired from the host bile. It may be hypothesized that the relatively larger F. gigantica genome did not evolve through genome duplications but rather is interspersed with many repetitive elements. The genomic information will provide a comprehensive resource to facilitate the development of novel interventions for fascioliasis control.Entities:
Year: 2020 PMID: 32455229 PMCID: PMC7241025 DOI: 10.1021/acsomega.0c00980
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Assembly Features
| description | |
|---|---|
| genome assembly size | 1040 230 724 bp [1.04 Gb] |
| number of scaffolds | 40 381 |
| longest scaffold length | 1 127 280 bp |
| average size of scaffolds | 25 760 bp |
| number of genes | 20 858 |
| mean protein length | 264 aa |
| number of coding exons | 54 948 |
| mean number of coding exons per gene | 3 |
| coding exons combined length | 16 599 815 |
| number of introns | 35 695 |
| mean intron length | 2612 |
Comparison of the Nuclear Genome Assemblies of F. gigantica and Related Parasitic Flukes
| genome size | 1.04 Gb | 1.13 Gb | 1.13 Gb | 1.27 Gb | 634.5 Mb | 320.5 Mb | 397 Mb | 385 Mb | 364 Mb |
| number of genes | 20 858 | 13 940 | 14 851 | 22 676 | 16 379 | 28 407 | 13 469 | 13 073 | 13 184 |
| mean number of exons per gene | 3 | 5.9 | 3.18 | 5.3 | 5.8 | 7.7 | 5.3 | 5.4 | 6 |
| mean exon length (bp) | 302.5 | 1376 | 257 | 303 | 254 | 312 | 222 | 246 | 222 |
| mean intron length (bp) | 2612 | 3982 | NA | 3700 | 3531 | 359 | 2059 | 2442 | 2407 |
| total GC content (%) | 43.76 | 41.80 | 44 | 47.80 | 34.06 | 33.50 | 34.30 | 34.70 |
Summary of the De Novo Repeats Identified
| description | number of elements | length occupied in bp |
|---|---|---|
| SINEs | 67 024 | 11 130 748 |
| MIRs | 1689 | 186 234 |
| LINEs | 469 965 | 221 172 212 |
| LINE2 | 12 439 | 4 408 849 |
| L3/CR1 | 134 130 | 66 193 633 |
| LTR elements | 130 496 | 70 357 405 |
| ERV_class I | 344 | 32 937 |
| ERV_class II | 2413 | 579 624 |
| DNA elements | 63 140 | 18 072 908 |
| TcMar-Tigger | 176 | 53 598 |
| unclassified | 697 673 | 157 005 240 |
| total interspersed repeats | 477 738 513 | |
| small RNA | 35 107 | 6 310 113 |
| satellites | 45 457 | 7 519 158 |
| simple repeats | 68 849 | 3 254 749 |
| low complexity | 2024 | 92 148 |
Figure 1Graphical representation of the distribution of genes assigned to GO terms. The proportion of 5371 F. gigantica proteins with functional information in different GO categories is shown as the biological process, molecular function, and cellular component.
Figure 2GO classification of genes in cellular components, molecular function, and biological process.
Figure 3Representation of the 20 most abundant InterPro domains revealed by InterProScan (IPS) annotation.
Figure 4Representation of the 20 most abundant InterPro families revealed by InterProScan annotation.
Figure 5Venn diagram showing the phylogenetic distribution of orthologous protein families. (A) Between F. gigantica and F. hepatica. (B) Between F. hepatica, F. gigantica, S. mansoni, and O. viverrini.