| Literature DB >> 27529330 |
Kazuharu Arakawa1,2, Yuki Yoshida1,2, Masaru Tomita1,2.
Abstract
Tardigrades are ubiquitous microscopic animals that play an important role in the study of metazoan phylogeny. Most terrestrial tardigrades can withstand extreme environments by entering an ametabolic desiccated state termed anhydrobiosis. Due to their small size and the non-axenic nature of laboratory cultures, molecular studies of tardigrades are prone to contamination. To minimize the possibility of microbial contaminations and to obtain high-quality genomic information, we have developed an ultra-low input library sequencing protocol to enable the genome sequencing of a single tardigrade Hypsibius dujardini individual. Here, we describe the details of our sequencing data and the ultra-low input library preparation methodologies.Entities:
Mesh:
Year: 2016 PMID: 27529330 PMCID: PMC4986543 DOI: 10.1038/sdata.2016.63
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Details of sequenced data
| SAMD00047018 | Active Adult | 1 | TTAGGC | Whole Genome Shotgun | DRR055040 |
| SAMD00047018 | Active Adult | 2 | ATCACG | Whole Genome Shotgun | SRR3706607 |
| SAMD00047018 | Active Adult | 3 | TGACCA | Whole Genome Shotgun | SRR3706608 |
| SAMD00047018 | Active Adult | 4 | ACAGTG | Whole Genome Shotgun | SRR3706609 |
| SAMD00047018 | Active Adult | 5 | GCCAAT | Whole Genome Shotgun | SRR3706610 |
| SAMD00047019 | Egg 1 day | 1 | CGTACG | RNA-Seq | DRR055005 |
| SAMD00047019 | Egg 1 day | 2 | GAGTGG | RNA-Seq | DRR055006 |
| SAMD00047019 | Egg 1 day | 3 | GGTAGC | RNA-Seq | DRR055007 |
| SAMD00047020 | Egg 2 day | 1 | ACTGAT | RNA-Seq | DRR055008 |
| SAMD00047020 | Egg 2 day | 2 | ATGAGC | RNA-Seq | DRR055009 |
| SAMD00047020 | Egg 2 day | 3 | ATTCCT | RNA-Seq | DRR055010 |
| SAMD00047021 | Egg 3 day | 1 | CAAAAG | RNA-Seq | DRR055011 |
| SAMD00047021 | Egg 3 day | 2 | CAACTA | RNA-Seq | DRR055012 |
| SAMD00047021 | Egg 3 day | 3 | CACCGG | RNA-Seq | DRR055013 |
| SAMD00047022 | Egg 4 day | 1 | CACGAT | RNA-Seq | DRR055014 |
| SAMD00047022 | Egg 4 day | 2 | CACTCA | RNA-Seq | DRR055015 |
| SAMD00047022 | Egg 4 day | 3 | CAGGCG | RNA-Seq | DRR055016 |
| SAMD00047023 | Egg 5 day | 1 | CATGGC | RNA-Seq | DRR055017 |
| SAMD00047023 | Egg 5 day | 2 | CATTTT | RNA-Seq | DRR055018 |
| SAMD00047023 | Egg 5 day | 3 | CCAACA | RNA-Seq | DRR055019 |
| SAMD00047024 | Juvenile 1 day | 1 | CACGAT | RNA-Seq | DRR055020 |
| SAMD00047024 | Juvenile 1 day | 2 | CACTCA | RNA-Seq | DRR055021 |
| SAMD00047024 | Juvenile 1 day | 3 | CAGGCG | RNA-Seq | DRR055022 |
| SAMD00047025 | Juvenile 2 day | 1 | CATGGC | RNA-Seq | DRR055023 |
| SAMD00047025 | Juvenile 2 day | 2 | CATTTT | RNA-Seq | DRR055024 |
| SAMD00047025 | Juvenile 2 day | 3 | CCAACA | RNA-Seq | DRR055025 |
| SAMD00047026 | Juvenile 3 day | 1 | CGGAAT | RNA-Seq | DRR055026 |
| SAMD00047026 | Juvenile 3 day | 2 | CTAGCT | RNA-Seq | DRR055027 |
| SAMD00047026 | Juvenile 3 day | 3 | CTATAC | RNA-Seq | DRR055028 |
| SAMD00047027 | Juvenile 4 day | 1 | CTCAGA | RNA-Seq | DRR055029 |
| SAMD00047027 | Juvenile 4 day | 2 | GCGCTA | RNA-Seq | DRR055030 |
| SAMD00047027 | Juvenile 4 day | 3 | TAATCG | RNA-Seq | DRR055031 |
| SAMD00047028 | Juvenile 5 day | 1 | TACAGC | RNA-Seq | DRR055032 |
| SAMD00047028 | Juvenile 5 day | 2 | TATAAAT | RNA-Seq | DRR055033 |
| SAMD00047029 | Active Adult | 1 | CACTCA | RNA-Seq | DRR055034 |
| SAMD00047029 | Active Adult | 2 | CAGGCG | RNA-Seq | DRR055035 |
| SAMD00047029 | Active Adult | 3 | CATGGC | RNA-Seq | DRR055036 |
| SAMD00047030 | Tun Adult | 1 | CAACTA | RNA-Seq | DRR055037 |
| SAMD00047030 | Tun Adult | 2 | CACCGG | RNA-Seq | DRR055038 |
| SAMD00047030 | Tun Adult | 3 | CACGAT | RNA-Seq | DRR055039 |
Figure 1Comparison with existing assemblies and raw reads of H. dujardini.
The ultra-low input sequencing data described in this work showed greater genomic coverage, as represented by a higher mapping rate of UNC and Edinburgh reads to our assembly, than between that of UNC and Edinburgh. UNC mapped to our assembly at 84.72%, whereas 84.27% mapped to the Edinburgh assembly. Similarly, 77.10% of the Edinburgh reads mapped to our assembly, but only 73.95% mapped to UNC assembly. While the UNC and Edinburgh reads presumably contained 10~25% contamination, as suggested by the percentage of unmapped reads, 94.09 or 97.54% of the study reads mapped to the UNC or Edinburgh assemblies, respectively. These data indicated minimal contamination in our reads.
Mapping percentages of four replicates of genome sequence reads.
| SRR3706607 | 10,642,574 | 93.52 | 89.87 | 93.69 |
| SRR3706608 | 11,497,318 | 95.58 | 91.48 | 95.73 |
| SRR3706609 | 11,855,730 | 92.91 | 89.64 | 93.14 |
| SRR3706610 | 11,250,228 | 95.68 | 91.60 | 95.79 |
Figure 2Transcriptome abundances of active and tun (in anhydrobiosis) adults in three biological replicates.
Percentage of mapped reads is shown beneath the heatmap.
Figure 3Transcriptome abundances of juveniles (B1~B5) and eggs (E1E5) in the first 1~5 days after hatching or laying.
Sequences were obtained in three biological replicates except for B5, which only has two replicates. The percentage of mapped reads is shown beneath the heatmap, and samples with a mapping percentage greater than 50% are colored in red.