| Literature DB >> 30181897 |
Yang An1,2, Akane Kawaguchi3,4, Chen Zhao5,2,6, Atsushi Toyoda7, Ali Sharifi-Zarchi8,9,10, Seyed Ahmad Mousavi8, Reza Bagherzadeh1,9,11,12, Takeshi Inoue1,12, Hajime Ogino3,13, Asao Fujiyama7, Hamidreza Chitsaz8, Hossein Baharvand9,11, Kiyokazu Agata1,12.
Abstract
BACKGROUND: Planarians are non-parasitic Platyhelminthes (flatworms) famous for their regeneration ability and for having a well-organized brain. Dugesia japonica is a typical planarian species that is widely distributed in the East Asia. Extensive cellular and molecular experimental methods have been developed to identify the functions of thousands of genes in this species, making this planarian a good experimental model for regeneration biology and neurobiology. However, no genome-level information is available for D. japonica, and few gene regulatory networks have been identified thus far.Entities:
Keywords: Conserved non-coding elements; Dugesia japonica; Genome; Nou-darake; Planarian
Year: 2018 PMID: 30181897 PMCID: PMC6114478 DOI: 10.1186/s40851-018-0102-2
Source DB: PubMed Journal: Zoological Lett ISSN: 2056-306X Impact factor: 2.836
Summary of library construction and sequencing
| Resource | Insert Length (bp) | Average Read Length (bp) | Total Raw Data (Gb) |
|---|---|---|---|
| Illumina | 180 | 150 × 2 | 37.98 |
| 200 | 150 × 2 | 26.54 | |
| 250 | 150 × 2 | 24.77 | |
| Illumina | 250 | 100 × 2 | 125.53 |
| 450 | 100 × 2 | 58.81 | |
| 450 | 34 × 2 | 43.51 | |
| 3000 | 34 × 2 | 69.78 | |
| 8000 | 34 × 2 | 46.22 | |
| 20,000 | 34 × 2 | 36.91 | |
| Roche | 3000 | 350 × 2 | 0.79 |
| 8000 | 270 × 2 | 0.64 | |
| Shotgun | 400 | 2.62 | |
| Shotgun | 600 | 2.31 | |
| Sanger | 35,000 | 1000 × 2 | 0.07 |
| Total | 476.48 |
Fig. 1Kmer (k = 17) frequency analysis. a Kmer species frequency graph. The horizontal axis shows the depth of kmer species, and the vertical axis shows the percentage of each kmer species value (blue curve). b. Kmer individuals frequency graph. The horizontal axis shows depth of kmer individuals, the left vertical axis shows the percentage of each kmer individual’s value (blue curve), and the right vertical axis shows the accumulative frequency of the kmer individuals (red curve)
Fig. 2One fosmid insert sequence of the gene Djth (DJF-016O13). This figure shows the alignment of genomic DNA and RNA sequencing reads to one fosmid insert sequence of the gene Djth (DJF-016O13) and the repeated elements annotation of this fosmid sequence by Repbase. Green arrowheads show the10th and 11th exon of the Djth gene on the fosmid insert sequence. Grey spots show matches between the sequencing reads and the reference fosmid sequence, and black spots show mismatches. Red bars represent repetitive sequences
Fig. 3D. japonica genome assembly workflow
Summary of the D. japonica genome assembly
| Terms | Contigs | Scaffolds | RNA-guided Super-scaffolds |
|---|---|---|---|
| N50 (bp) | 1408 | 23,204 | 27,741 |
| Longest (bp) | 186,265 | 760,010 | 760,010 |
| Total number (> 1Kb) | 286,283 | 135,705 | 126,524 |
| Total number (> 10Kb) | 466 | 43,707 | 38,208 |
| Total number | 951,280 | 213,090 | 202,925 |
| Total size (Gb) | 0.9 | 1.56 | 1.56 |
Fig. 4Alignment between the Djth fosmid insert sequence and its corresponding genome scaffold. In the illustration of the Djth fosmid structure, green arrowheads show the 10th and 11th exons of the Djth gene in the fosmid (see Fig. 2). In the illustration of the Djth fosmid repeated sequences, red bars represent repetitive sequences. In the illustration of the alignment between Djth fosmid and scaffold, pink blocks show matched sequences between the fosmid and the corresponding scaffold, while white blocks show mismatches, which were mainly caused by gaps in the scaffold
Summary of the D. japonica transcriptome assembly and genome scaffolds coverage
| Term | Value |
|---|---|
| Statistics for isotig length | |
| Min isotig length (bp) | 62 |
| Max isotig length (bp) | 17,446 |
| N50 isotig length (bp) | 1792 |
| Statistics for numbers of isotigs | |
| Number of isotig groups | 19,543 |
| Number of isotigs | 25,566 |
| Number of isotigs > = 1 kb | 13,256 |
| Genome scaffold coverage of isotig number | ~ 99.4% |
| Statistics for bases in the isotigs | |
| Number of bases in all isotigs | 34,777,653 |
| Number of bases in isotigs > = 1 kb | 27,150,697 |
| Genome scaffold coverage of isotig bases | ~ 97.8% |
Summary of repeated elements in D. japonica genome
| Repeated Elements | Numbers of elementsa | Length (bp) | Percentage of genome (%) |
|---|---|---|---|
| Retrotransposon | 260,765 | 122,888,499 | 7.85% |
| LTR-Retrotransposon | 194,395 | 101,831,050 | 6.51% |
| Non-LTR Retrotransposon | 66,370 | 21,057,449 | 1.35% |
| DNA Transposon | 323,715 | 109,645,290 | 7.01% |
| Unclassified | 1,602,282 | 355,858,117 | 22.74% |
| Small RNA | 4762 | 993,638 | 0.06% |
| Simple repeats | 411,512 | 27,237,668 | 1.74% |
| Low complexity | 80,856 | 4,603,818 | 0.29% |
| Total count | 2,683,892 | 621,227,030 | 39.69% |
aMost repeats fragmented by insertions or deletions were counted as one element
Fig. 5Category of gene ontology annotation
The number of hits within the D. japonica genome assembly by different programs of MAKER genome annotation platform
| Program | Number of hits |
|---|---|
| Blastn | 2,502,462 |
| Blastx | 1,108,696 |
| Est2genome | 2,368,346 |
| Protein2genome | 721,473 |
| Repeat masker | 2,255,084 |
| Repeat runner | 2212 |
Fig. 6Conserved non-coding elements between D. japonica and S. mediterranea
Fig. 7Transgenic experiments suggested that Djndk CNE3 might be a regulatory element. a The arrowhead shows CNE3 inserted actGFP vector, CNE (140)-actGFP driven GFP to express in the anterior region at the end of gastrulation in transgenic Xenopus embryos (n = 132). b Putative transcription factor-binding motifs are boxed in different colors; those subjected to mutation analysis are indicated by asterisks. The detailed point mutation design of three transcription factor-binding motifs are shown by red colored words. c Mutation analysis of CNE3 (the 140 bp element). actGFP is an empty reporter construct that contains the β-actin basal promoter; wt (140) is the construct of CNE (140)-actGFP used in (a); mt1, m2 and mt3 are point mutations (Msx (M), Tcf/Lef (T), and Jun/Fos (J)) generated from wt (140), and the detailed mutation design is shown in (b). The bar chart shows the percentage of the embryos that showed GFP expression in the neural plate among total developed embryos injected with the vector constructs. Actual numbers of GFP-positive cases and total numbers of scored embryos are indicated in parentheses. The chi-square test showed that the percentage of positive cases in the wt (140) and the Jun/Fos mutant constructs are significantly different (P < 0.0001), whereas the differences observed in other cases were not significant (P > 0.05)