| Literature DB >> 22315334 |
Takeshi Takeuchi1, Takeshi Kawashima, Ryo Koyanagi, Fuki Gyoja, Makiko Tanaka, Tetsuro Ikuta, Eiichi Shoguchi, Mayuki Fujiwara, Chuya Shinzato, Kanako Hisata, Manabu Fujie, Takeshi Usami, Kiyohito Nagai, Kaoru Maeyama, Kikuhiko Okamoto, Hideo Aoki, Takashi Ishikawa, Tetsuji Masaoka, Atushi Fujiwara, Kazuyoshi Endo, Hirotoshi Endo, Hiromichi Nagasawa, Shigeharu Kinoshita, Shuichi Asakawa, Shugo Watabe, Nori Satoh.
Abstract
The study of the pearl oyster Pinctada fucata is key to increasing our understanding of the molecular mechanisms involved in pearl biosynthesis and biology of bivalve molluscs. We sequenced ~1150-Mb genome at ~40-fold coverage using the Roche 454 GS-FLX and Illumina GAIIx sequencers. The sequences were assembled into contigs with N50 = 1.6 kb (total contig assembly reached to 1024 Mb) and scaffolds with N50 = 14.5 kb. The pearl oyster genome is AT-rich, with a GC content of 34%. DNA transposons, retrotransposons, and tandem repeat elements occupied 0.4, 1.5, and 7.9% of the genome, respectively (a total of 9.8%). Version 1.0 of the P. fucata draft genome contains 23 257 complete gene models, 70% of which are supported by the corresponding expressed sequence tags. The genes include those reported to have an association with bio-mineralization. Genes encoding transcription factors and signal transduction molecules are present in numbers comparable with genomes of other metazoans. Genome-wide molecular phylogeny suggests that the lophotrochozoan represents a distinct clade from ecdysozoans. Our draft genome of the pearl oyster thus provides a platform for the identification of selection markers and genes for calcification, knowledge of which will be important in the pearl industry.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22315334 PMCID: PMC3325083 DOI: 10.1093/dnares/dss005
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.(A) The pearl oyster P. fucata and its pearl. Scale bar, 1 cm. (B) Flow cytometry of a mixture of sperm from Pinctata and Oryzias. The Pinctata genome, estimated to be ∼1150 Mb in size, is slightly larger than the Oryzias genome (∼1100 Mb).
Figure 2.Flow chart for sequencing and assembly of the P. fucata genome.
Summary of the Roche 454 and Illumina GAIIx data used for assembling P. fucata genome sequences
| Used data | Assembly | |||||||
|---|---|---|---|---|---|---|---|---|
| Totalsequences (Gb) | Number of reads(million) | Average readlength | Estimatedcoverage | Contig | Scaffold | |||
| Number | N50 (bp) | Number | N50 (bp) | |||||
| 454 | ||||||||
| Shotgun | 15.2 | 44.5 | 341 | 16.3x | 1 085 563 | 1629 | 800 982 | 14 455 |
| 4 kb paired enda | 1.1 | 3.8 | 286 | |||||
| 10 kb paired enda | 2.4 | 7.8 | 309 | |||||
| GAIIx | ||||||||
| Shotgun | 3.9 | 76 | 50 | 26.7x | ||||
| 3 kb mate-pair | 10.9 | 216 | 50 | |||||
| 10 kb mate-pair | 11.9 | 238 | 50 | |||||
aEach 454 raw reads were separated into ‘forward’ and ‘reverse’.
Figure 3.Distribution of the contig lengths (Y-axis) and their coverage depth (X-axis) in the current P. fucata genome assembly. A coverage range between 1 and 50 is shown for convenience. The 100 longest contigs (44–18 kb) are plotted as red dots; all others are plotted as gray dots. Two significant peaks appeared near coverage depths 10 and 20, respectively. The difference in read coverage of the contigs is likely to be derived from heterozygosity in the genome. The majority of the longest contigs (89 out of the top 100) are covered more than 15 times. See also Supplementary Fig. S3 for further distribution analysis.
Figure 4.Arrangement of P. fucata mitochondrial genes in (A) three contigs and (B) two cluster constructions of the genome.
Figure 5.The GC content of the P. fucata assembled genome (500 bp sliding window). The vertical axis shows counts of DNA sequence reads; the horizontal axis, percentage of GC in the reads. The GC content of the pearl oyster genome was estimated to be ∼34%. The GC contents of genomes of C. elegans (34%), Strongylocentrotus pulpulatus (35%), B. floridae (40%), and H. sapiens (38%) are shown for comparison.
The gene model of P. fucata
| Genome size (Mb) | 1150 |
| Total assembly (Mb) | 1413 |
| Number of gene models | 43 760 |
| Number of complete gene models (with start and stop codons) | 23 257 |
| Average length of transcripts (bp) | 1885 |
| Number of exons per gene | 3.2 |
| Average length of exons (bp) | 588.7 |
| Average length of genes (kb) | 6.7 |
| Average length of proteins (aa) | 274.1 |
| BLAST hit to transcriptome | 16 221/23 257 |
Figure 6.Repeat elements in the P. fucata genome.
The gene location in the P. fucata genome for reported cDNAs or proteins that are associated with pearl oyster shell matrix formation
| Protein | cDNA accession number | Scaffolds |
|---|---|---|
| Aspein | AB094512 | sca03_465.1 |
| Chitin synthase 1 | AB290881 | sca03_4962.1 |
| KRMP-3(MSI7) | AF516712 | sca03_229418.1 |
| MSI60 | D86074 | sca03_523.1 |
| N16 | AB023067 (#1) | sca03_1834.1 |
| N19 | AB332326 | sca03_2495.1 |
| Nacrein | D83523 | sca03_33972.1 |
| PFMG1 | DQ104255 | sca03_72180.1 |
| Pfty-1 | AB254132 | sca03_10251.1 |
| Pfty-2 | AB254133 | sca03_21093.1 |
| Pif177 | AB236929 | sca03_20175.1 |
| Prisilkin-39 | EU921665 | sca03_12887.1 |
| Prismalin-14 | AB159512 | sca03_15935.1 |
| Prismin | AB368930 | sca03_263.1 |
| Shematrin-1 | AB244419 | sca03_24266.1 |
| Shematrin-2 | AB244420 | sca03_89285.1 |
| Shematrin-3 | AB244421 | sca03_72160.1 |
| Shematrin-4 | AB244422 | sca03_16186.1 |
| Shematrin-5 | AB244423 | sca03_3950.1 |
| Shematrin-6 | AB244424 | sca03_14895.1 |
| Shematrin-7 | AB244425 | sca03_411.1 |
Figure 7.Molecular phylogeny of the pearl oyster. A total of 77 547 aligned amino acid positions of proteins encoded by 409 genes were obtained from the triploblasts T. castaneum, D. melanogaster, A. gambiae, A. pisum, P. fucata, B. floridae, C. intestinalis, D. rerio, T. rubripes, M. domestica, G. gallus, A. carolinensis, M. musculus, and H. sapiens. The sequences were analysed using ML methods, with the diploblasts A. digitifera and N. vectensis serving as an outgroup. The scale bar represents 0.05 expected substitutions per site in the aligned regions. The topology was supported by 100% bootstrap value. This analysis shows the phylogenic position of molluscs obtained using genome-wide data from all of these species.