Literature DB >> 29541656

Draft genome sequence of Japanese wood mouse, Apodemus speciosus.

Masatoshi Matsunami1,2,3, Daiji Endo4, Naruya Saitou5, Hitoshi Suzuki1, Manabu Onuma2.   

Abstract

The wood mouse (genus Apodemus) is one of the most common rodents in broad-leaf forests in the temperate zone of the Palaearctic region. Molecular studies of wood mice have critically enhanced the understanding of their evolution and ancestral biogeographic events. However, their molecular data are currently only limited to partial mitochondrial sequences and a few genes. Therefore, we sequenced the wood mouse genome to facilitate the acquisition of useful resources for inferring their molecular evolution. We sampled a wild wood mouse at Tsukuba, Japan, and sequenced its whole-genome using the Illumina Hiseq. 2000. To reduce the risk of non-randomness, three paired-end libraries (insert sizes: 150, 300, and 500 bp) and, two mate-pair reads (insert sizes: 8 and 20 kbp) were constructed. In total, we generated approximately 210 Gbp data. From these sequences, we reconstructed 336,124 scaffolds. These data will enhance our understanding of the evolution and ecological factors that affect their genetic constitution. The genome scaffolds generated are available in the National Center Biotechnology Information (NCBI) BioProject with accession number PRJDB5914.

Entities:  

Keywords:  Apodemus speciosus; Evolution; Mammal; Mouse; Phylogeography; Rodent

Year:  2017        PMID: 29541656      PMCID: PMC5848110          DOI: 10.1016/j.dib.2017.10.063

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the Data The Japanese wood mouse could be a model mammal for investigating the natural histories of organisms inhabiting the Japanese Archipelago. We sequenced the Japanese wood mouse genome to facilitate the acquisition of useful resources for inferring the molecular evolution of this species. This data allows other researchers focusing on this species to start genome-wide analysis.

Data

The genus Apodemus, which is a member of the subfamily Murinae, is one of the most species rich taxa of the murine rodents, next to Rattus and Mus, which are the first and second speciose genera, respectively [1]. Species belonging to Apodemus, known as field or wood mice, are distributed in the temperate region of the Eurasian continent including the Japanese Archipelago. According to previous molecular phylogenetic analyses, Apodemus diversified in Asia and Europe, around the early Pliocene and early Pleistocene eras, respectively [2]. Deciphering the genome sequences of this genus would facilitate our understanding of the evolutionary trait of the group adapted to the temperate environment, with the aid of those of Rattus and Mus. The genome of a representative species of Apodemus from Europe, Apodemus sylvaticus, was sequenced by a research group at the Liverpool University according to the JGI GOLD database (https://gold.jgi.doe.gov/; project ID = Gp0143296), but its project status is "permanent draft." In Asian species, no genome sequence data are currently available. The large Japanese wood mouse, Apodemus speciosus, is widely distributed in Japan from Hokkaido to Kyushu, including the remote peripheral islands. This species inhabits forests and grasslands, eats insects and acorns, and shows seasonal breeding in the wild. A recent phylogeographic analysis revealed that this species experienced a rapid population growth in response to the effect of the Quaternary ice ages [3]. Thus, the species could be a model mammal for investigating the natural histories of organisms inhabiting the Japanese Archipelago [4]. We sequenced the Japanese wood mouse genome to facilitate the acquisition of useful resources for inferring the molecular evolution of this species. The genome was assembled using the ALLPATHS-LG program using two-step processes of assembling and scaffolding. The paired-end reads were first assembled into contigs, which were then joined to form concatenated scaffolds by using mate-pair reads. We reconstructed 647,983 contigs (N50 is 3.5 kbp). These contigs were joined in 336,124 scaffolds (Table 2), and the N50 value of the scaffolds was 47 kbp. The wood mouse genome sequenced is a potentially useful resource for the future molecular evolutionary analysis of rodents.
Table 2

General features of the Apodemus speciosus genome.

Number of contigs647,983
Number of contigs per megabase (Mb)189
Number of scaffolds336,124
Total contig length2,009,551,775
Total scaffold length, with gaps3,428,181,270
N50 contig size (kb)3.5
N50 scaffold size (kb)25
N50 scaffold size (kb), with gaps47
Number of scaffolds per megabase (Mb)98.05
Median size of gaps in scaffolds2846
Median deviation of gaps in scaffolds852

Experimental design, materials, and methods

Sample preparation and sequencing

We sampled the A. speciosus at Tsukuba, Japan (NIES ID; IB14-021, male). All rodent experiments were conducted in accordance with the guidelines for using wild mammals of the Mammal Society of Japan [5] and the rules of the National Institute for Environmental Studies for analysis and experimentation with environmental samples contaminated with radioactive materials. Genomic DNA was extracted from the liver using proteinase K and phenol:chloroform:isoamyl alcohol [6]. The library preparations and sequencings were conducted by the Eurofin Sequencing Service (Eurofin MWG Operon, Germany). Paired-end sequencing of each 100-bp end of the DNA fragments was performed using the Hiseq. 2000 sequencing system (Illumina, San Diego, CA, USA). We sequenced libraries showing five different insert sizes: 150 bp, 300 bp, 500 bp, 8 kbp, and 20 kbp.

De novo genome assembly

We sequenced three paired-end libraries (insert sizes: 150, 300, and 500 bp) and two mate-pair reads (insert sizes: 8, and 20 kbp). In total, we generated approximately 210 Gbp data. We obtained 362107447, 336419570, 369663105, 22370797, and 12377116 reads for each library showing 150-bp, 300-bp, 500-bp, 8-kbp, and 20-kbp insert sizes, respectively (Table 1). The sequenced reads were used for de novo genome assembly using the ALLPATHS-LG program ver. 51927 under default parameter settings [7].
Table 1

Data amount.

Insert size (bp)No. of read pairsNo. of readsRead length (bp)Total data (GB)
150362,107,447724,214,89410072.4
300336,419,570892,839,14010089.3
500369,663,105739,326,21010073.9
800022,370,79744,741,5941004.5
20,00012,377,11624,754,2321002.5
Data amount. General features of the Apodemus speciosus genome.
Subject areabiology
More specific subject areaGenomics
Type of dataGenome sequences
How data was acquiredHigh throughput DNA sequencing using Hiseq. 2000
Data formatFASTA format
Experimental factorsTotal DNA extraction from liver samples
Experimental featuresFreshly collected healthy liver was used for total DNA extraction
Data source locationTsukuba, Ibaraki, Japan
Data accessibilityThe genome scaffolds generated are available in the National Center Biotechnology Information (NCBI) BioProject with accession numberPRJDB5914.
Related research articleNot applicable
  5 in total

1.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data.

Authors:  Sante Gnerre; Iain Maccallum; Dariusz Przybylski; Filipe J Ribeiro; Joshua N Burton; Bruce J Walker; Ted Sharpe; Giles Hall; Terrance P Shea; Sean Sykes; Aaron M Berlin; Daniel Aird; Maura Costello; Riza Daza; Louise Williams; Robert Nicol; Andreas Gnirke; Chad Nusbaum; Eric S Lander; David B Jaffe
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-27       Impact factor: 11.205

2.  Ecological opportunity and incumbency in the diversification of repeated continental colonizations by muroid rodents.

Authors:  John J Schenk; Kevin C Rowe; Scott J Steppan
Journal:  Syst Biol       Date:  2013-08-06       Impact factor: 15.683

3.  A phylogenetic view on species radiation in Apodemus inferred from variation of nuclear and mitochondrial genes.

Authors:  K Serizawa; H Suzuki; K Tsuchiya
Journal:  Biochem Genet       Date:  2000-02       Impact factor: 1.890

4.  A trend of central versus peripheral structuring in mitochondrial and nuclear gene sequences of the Japanese wood mouse, Apodemus speciosus.

Authors:  Morihiko Tomozawa; Hitoshi Suzuki
Journal:  Zoolog Sci       Date:  2008-03       Impact factor: 0.931

5.  Estimating the molecular evolutionary rates of mitochondrial genes referring to Quaternary ice age events with inferred population expansions and dispersals in Japanese Apodemus.

Authors:  Yutaro Suzuki; Morihiko Tomozawa; Yuki Koizumi; Kimiyuki Tsuchiya; Hitoshi Suzuki
Journal:  BMC Evol Biol       Date:  2015-09-15       Impact factor: 3.260

  5 in total
  1 in total

1.  Ancient rivers shaped the current genetic diversity of the wood mouse (Apodemus speciosus) on the islands of the Seto Inland Sea, Japan.

Authors:  Jun J Sato; Kouki Yasuda
Journal:  Zoological Lett       Date:  2022-06-21       Impact factor: 3.157

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.